@talkio/deepgram

Deepgram STT and TTS providers for talkio. Uses Deepgram's WebSocket APIs for streaming.

Installation

``bash bun add @talkio/deepgram

`or`


npm install @talkio/deepgram
or

pnpm add @talkio/deepgram

Peer Dependencies: This package requires talkio to be installed.

`Quick Start`

`typescript import { createAgent } from "talkio"; import { createDeepgram } from "@talkio/deepgram";

const deepgram = createDeepgram({ apiKey: process.env.DEEPGRAM_API_KEY, });

const agent = createAgent({ stt: deepgram.stt({ model: "nova-3" }), tts: deepgram.tts({ model: "aura-2-thalia-en" }), llm: myLLMProvider, });

agent.start();`

`Usage`

`$3`

Create a shared Deepgram instance to reuse configuration across STT and TTS:

`typescript import { createDeepgram } from "@talkio/deepgram";

const deepgram = createDeepgram({ apiKey: process.env.DEEPGRAM_API_KEY, baseUrl: "api.deepgram.com", // optional, for enterprise/proxy });

// Create providers with shared config const stt = deepgram.stt({ model: "nova-3" }); const tts = deepgram.tts({ model: "aura-2-thalia-en" });`

`$3`

For more control, import the factory functions directly:

`typescript import { createDeepgramSTT, createDeepgramTTS } from "@talkio/deepgram";

const stt = createDeepgramSTT({ apiKey: process.env.DEEPGRAM_API_KEY, model: "nova-3", language: "en", interimResults: true, punctuate: true, smartFormat: true, });

const tts = createDeepgramTTS({ apiKey: process.env.DEEPGRAM_API_KEY, model: "aura-2-thalia-en", });`

`API Reference`

`$3`

Creates a Deepgram provider instance with shared configuration.

`typescript interface DeepgramProviderSettings { apiKey?: string; // Falls back to DEEPGRAM_API_KEY env var baseUrl?: string; // Default: "api.deepgram.com" }`

Returns an object with stt() and tts() factory methods.

`$3`

Creates a Speech-to-Text provider.

`typescript interface DeepgramSTTOptions { // Required model: string; // e.g., "nova-3", "nova-2"

// Optional apiKey?: string; // Falls back to provider settings or env var baseUrl?: string; // Falls back to provider settings language?: string; // Default: "en" interimResults?: boolean; // Default: true punctuate?: boolean; // Default: true smartFormat?: boolean; // Default: true endpointing?: number | false; // Default: 300 (ms) utteranceEndMs?: number; // Default: 1000 (ms) keywords?: string[]; // Domain-specific terms to boost vad?: boolean; // Default: true }`

Supported Models:

- nova-3- Latest and most accurate -nova-2- Fast and accurate -nova- Original Nova model - See Deepgram Models for full list

Supported Input Formats:

| Encoding | Sample Rates | Channels | Default | | ---------- | ---------------------------- | -------- | -------- | |linear16| 8000, 16000, 24000, 48000 Hz | 1, 2 | 16000 Hz | |linear32| 16000, 24000, 48000 Hz | 1, 2 | - | |flac| 16000, 24000, 48000 Hz | 1, 2 | - | |opus| 8000, 16000, 24000, 48000 Hz | 1, 2 | - | |ogg-opus| 8000, 16000, 24000, 48000 Hz | 1, 2 | - | |speex| 8000, 16000, 32000 Hz | 1 | - | |mulaw| 8000 Hz | 1 | - | |alaw| 8000 Hz | 1 | - | |amr-nb| 8000 Hz | 1 | - | |amr-wb| 16000 Hz | 1 | - | |g729 | 8000 Hz | 1 | - |

The provider automatically uses its default format (linear16 at 16000 Hz, mono) unless you specify audio.input in createAgent().

`$3`

Creates a Text-to-Speech provider.

`typescript interface DeepgramTTSOptions { // Required model: string; // e.g., "aura-2-thalia-en"

// Optional apiKey?: string; // Falls back to provider settings or env var baseUrl?: string; // Falls back to provider settings encoding?: "linear16" | "mulaw" | "alaw"; // Default: "linear16" sampleRate?: 8000 | 16000 | 24000 | 32000 | 48000; // Default: 24000 for linear16, 8000 for mulaw/alaw }`

Voice Format:

- Aura 2 voices: aura-2-{voice}-{language} (e.g., aura-2-thalia-en) - See Deepgram TTS Models for available voices

Supported Output Formats:

| Encoding | Sample Rates | Channels | Default | | ---------- | ----------------------------------- | -------- | -------- | |linear16| 8000, 16000, 24000, 32000, 48000 Hz | 1 | 24000 Hz | |mulaw| 8000, 16000 Hz | 1 | 8000 Hz | |alaw | 8000, 16000 Hz | 1 | 8000 Hz |

The provider automatically uses its default format (linear16 at 24000 Hz, mono) unless you specify audio.output in createAgent().

`Configuration`

`$3`

The API key can be provided in three ways (in order of precedence):

1. Directly in options: createDeepgramSTT({ apiKey: "..." })2. Via provider settings:createDeepgram({ apiKey: "..." })3. Environment variable:DEEPGRAM_API_KEY

`$3`

For enterprise deployments or proxies:

`typescript const deepgram = createDeepgram({ apiKey: process.env.DEEPGRAM_API_KEY, baseUrl: "your-proxy.example.com", });`

`Examples`

`$3`

`typescript import { createAgent } from "talkio"; import { createDeepgram } from "@talkio/deepgram";

const deepgram = createDeepgram();

const agent = createAgent({ stt: deepgram.stt({ model: "nova-3", language: "en", keywords: ["voice-ai", "Deepgram"], // Boost recognition }), tts: deepgram.tts({ model: "aura-2-thalia-en", }), llm: myLLMProvider, onEvent: (event) => { if (event.type === "human-turn:transcript") { console.log("Transcript:", event.transcript); } }, });

agent.start();`

`$3`

Override the default audio formats when needed:

`typescript const agent = createAgent({ stt: deepgram.stt({ model: "nova-3" }), tts: deepgram.tts({ model: "aura-2-thalia-en" }), llm: myLLMProvider, // Specify custom formats audio: { input: { encoding: "linear16", sampleRate: 24000, channels: 1 }, output: { encoding: "linear16", sampleRate: 48000, channels: 1 }, }, });`

`$3`

`typescript const stt = createDeepgramSTT({ model: "nova-3", language: "es", // Spanish });``

See Deepgram Languages for supported languages.

License

Apache-2.0

@talkio/deepgram

Deepgram STT and TTS providers for talkio. Uses Deepgram's WebSocket APIs for streaming.

Installation

``bash bun add @talkio/deepgram

`or`


npm install @talkio/deepgram
or

pnpm add @talkio/deepgram

Peer Dependencies: This package requires talkio to be installed.

`Quick Start`

`typescript import { createAgent } from "talkio"; import { createDeepgram } from "@talkio/deepgram";

const deepgram = createDeepgram({ apiKey: process.env.DEEPGRAM_API_KEY, });

const agent = createAgent({ stt: deepgram.stt({ model: "nova-3" }), tts: deepgram.tts({ model: "aura-2-thalia-en" }), llm: myLLMProvider, });

agent.start();`

`Usage`

`$3`

Create a shared Deepgram instance to reuse configuration across STT and TTS:

`typescript import { createDeepgram } from "@talkio/deepgram";

const deepgram = createDeepgram({ apiKey: process.env.DEEPGRAM_API_KEY, baseUrl: "api.deepgram.com", // optional, for enterprise/proxy });

// Create providers with shared config const stt = deepgram.stt({ model: "nova-3" }); const tts = deepgram.tts({ model: "aura-2-thalia-en" });`

`$3`

For more control, import the factory functions directly:

`typescript import { createDeepgramSTT, createDeepgramTTS } from "@talkio/deepgram";

const stt = createDeepgramSTT({ apiKey: process.env.DEEPGRAM_API_KEY, model: "nova-3", language: "en", interimResults: true, punctuate: true, smartFormat: true, });

const tts = createDeepgramTTS({ apiKey: process.env.DEEPGRAM_API_KEY, model: "aura-2-thalia-en", });`

`API Reference`

`$3`

Creates a Deepgram provider instance with shared configuration.

`typescript interface DeepgramProviderSettings { apiKey?: string; // Falls back to DEEPGRAM_API_KEY env var baseUrl?: string; // Default: "api.deepgram.com" }`

Returns an object with stt() and tts() factory methods.

`$3`

Creates a Speech-to-Text provider.

`typescript interface DeepgramSTTOptions { // Required model: string; // e.g., "nova-3", "nova-2"

Supported Models:

- nova-3- Latest and most accurate -nova-2- Fast and accurate -nova- Original Nova model - See Deepgram Models for full list

Supported Input Formats:

The provider automatically uses its default format (linear16 at 16000 Hz, mono) unless you specify audio.input in createAgent().

`$3`

Creates a Text-to-Speech provider.

`typescript interface DeepgramTTSOptions { // Required model: string; // e.g., "aura-2-thalia-en"

Voice Format:

- Aura 2 voices: aura-2-{voice}-{language} (e.g., aura-2-thalia-en) - See Deepgram TTS Models for available voices

Supported Output Formats:

The provider automatically uses its default format (linear16 at 24000 Hz, mono) unless you specify audio.output in createAgent().

`Configuration`

`$3`

The API key can be provided in three ways (in order of precedence):

1. Directly in options: createDeepgramSTT({ apiKey: "..." })2. Via provider settings:createDeepgram({ apiKey: "..." })3. Environment variable:DEEPGRAM_API_KEY

`$3`

For enterprise deployments or proxies:

`typescript const deepgram = createDeepgram({ apiKey: process.env.DEEPGRAM_API_KEY, baseUrl: "your-proxy.example.com", });`

`Examples`

`$3`

`typescript import { createAgent } from "talkio"; import { createDeepgram } from "@talkio/deepgram";

const deepgram = createDeepgram();

agent.start();`

`$3`

Override the default audio formats when needed:

`$3`

`typescript const stt = createDeepgramSTT({ model: "nova-3", language: "es", // Spanish });``

See Deepgram Languages for supported languages.

License

Apache-2.0