Browser Agent

A pair of experimental web components for integrating with Deepgram's Voice Agent API in a browser environment.

- The agent component is an all-in-one web component that manages the microphone, websocket, and
animation. Add it to any page and get chatting!
- The hoop component is the animation, standalone. More useful when you've got your own rules for
socket integration, and just want the look and feel!

Installation

Install via github by adding to your package.json dependencies:

``json "@deepgram/browser-agent": "deepgram/browser-agent#main",`

`Using the main component`

Import the library anywhere for the component to be registered to deepgram-agent:

`js import "@deepgram/browser-agent";`

Then, render it where you like!

`html id="dg-agent" url="wss://agent.deepgram.com/v1/agent/converse" height="300" width="300" idle-timeout-ms="10000" output-sample-rate="24000" >`

Then add a configattribute after some user interaction to start a connection. See more in the Attributes section.

`$3`

-config (optional): stringified json of a SettingsConfigurationto send the API on initialization - Adding or removing theconfigattribute will start or stop (respectively) the WebSocket connection to the Deepgram API. - Because this web component directly manages the user's microphone, it requires a user action to attempt a connection. For that reason, you most likely want to first render the element _without_ a config. - For better early API flexibility, there is no validation. Use our docs to ensure your configuration matches. - SettingsConfiguration - Wheneverdeepgram-agentdisconnects, unset the config and wait for another user interaction to set it and retrigger connection. -width (optional, default = "0"): the width of the canvas for agent animation - The animation will always take up a (roughly) square area, so this should typically be the same value asheight. -height (optional, default = "0"): the height of the canvas for agent animation - The animation will always take up a (roughly) square area, so this should typically be the same value aswidth. -auth-scheme (optional, default = "bearer"): the auth scheme to use with your token - Usebearerfor the Deepgram API when working with token-based authentication. For local development you may find it more convenient to use an API key (tokenscheme). **Never use API keys in a production browser application!** -url(required): The API url - Chances are you'll set this to"https://api.deepgram.com/v1/agent"! -idle-timeout-ms(optional): how long to wait for user idleness before closing the socket - Timer starts whenever the user is expected to speak (meaning right when opening the connection, and right after eachAgentAudioDoneevent). -output-sample-rate: the output sample rate you'd like for playback - Should be the same as the output rate you've got in your Settings object. Unless you're trying to have a little fun.

`$3`

- token(optional): the token to use for accessing the Deepgram /agent API. See the token-based auth docs for how to create safe-for-browser tokens. - If not provided, theauth-schemewill also be ignored. Only makes sense if your API URL is unauthenticated.

`$3`

As an experimental tech, the deepgram-agentelement emits a variety of events. You're more likely to run into some than others.

#### Common events

- "no url": emitted when trying to connect and API url is missing -"no config": emitted when trying to connect and config is missing -"invalid auth": emitted when trying to connect and the WebSocket rejects the auth scheme or token -"socket open": socket successfully opened -"socket close": socket successfully closed -"connection timeout": socket failed to connect due to a timeout (10s) -"failed to connect user media": couldn't gain access to user's microphone, usually due to permission rejection -"structured message": got JSON from the API. This is the main event to pay attention to! -"client message": sent a JSON message to the API. Useful for debugging.

#### Uncommon events

- "failed setup": some issue internal to the custom element occurred -"empty audio": got an empty message when expecting audio data -"unknown message": got a text message from the API that isn't valid JSON

`$3`

`ts sendClientMessage(message: ArrayBuffer | string): void {}`

Use this to send some (stringified) JSON or binary data to the server. Ignored when the websocket is closed.

`ts connect(): Promise {}`

Use this to explicitly connect. Prefer to handle this by setting the config attribute.

`ts disconnect(reason?: string): Promise {}`

Use this to explicitly disconnect. Prefer to handle this by _removing_ the config attribute.

`Using the hoop component`

The animation alone is available as a granular import, automatically registered as deepgram-hoop:

`js import "@deepgram/browser-agent/hoop";`

Then, render it where you like!

`html id="dg-hoop" height="300" width="300" status="active" >`

`$3`

The hoop component applies some size oscillation based on audio information:

- The output, i.e. agent audio (agent-volumeattribute) expands - The input, i.e. user audio (user-volume attribute) collapses

To ease jitter, each drawn arc trails behind a leader. You must provide amplitude data for both the user and agent on a per-frame basis. See thesendVolumeUpdates function for a working example.

`Contributing`

`$3`

- Node v18 or 20 (though I recommend installing it through nvm)

`$3`

Use npm run vite to start a dev server. You'll need to set a DG_API_KEY` environment variable in
order to open a connection.

Browser Agent

A pair of experimental web components for integrating with Deepgram's Voice Agent API in a browser environment.

Installation

Install via github by adding to your package.json dependencies:

``json "@deepgram/browser-agent": "deepgram/browser-agent#main",`

`Using the main component`

Import the library anywhere for the component to be registered to deepgram-agent:

`js import "@deepgram/browser-agent";`

Then, render it where you like!

`html id="dg-agent" url="wss://agent.deepgram.com/v1/agent/converse" height="300" width="300" idle-timeout-ms="10000" output-sample-rate="24000" >`

Then add a configattribute after some user interaction to start a connection. See more in the Attributes section.

`$3`

As an experimental tech, the deepgram-agentelement emits a variety of events. You're more likely to run into some than others.

#### Common events

#### Uncommon events

`$3`

`ts sendClientMessage(message: ArrayBuffer | string): void {}`

Use this to send some (stringified) JSON or binary data to the server. Ignored when the websocket is closed.

`ts connect(): Promise {}`

Use this to explicitly connect. Prefer to handle this by setting the config attribute.

`ts disconnect(reason?: string): Promise {}`

Use this to explicitly disconnect. Prefer to handle this by _removing_ the config attribute.

`Using the hoop component`

The animation alone is available as a granular import, automatically registered as deepgram-hoop:

`js import "@deepgram/browser-agent/hoop";`

Then, render it where you like!

`html id="dg-hoop" height="300" width="300" status="active" >`

`$3`

The hoop component applies some size oscillation based on audio information:

- The output, i.e. agent audio (agent-volumeattribute) expands - The input, i.e. user audio (user-volume attribute) collapses

To ease jitter, each drawn arc trails behind a leader. You must provide amplitude data for both the user and agent on a per-frame basis. See thesendVolumeUpdates function for a working example.

`Contributing`

`$3`

- Node v18 or 20 (though I recommend installing it through nvm)

`$3`

Use npm run vite to start a dev server. You'll need to set a DG_API_KEY` environment variable in
order to open a connection.

@deepgram/browser-agent

Dist Tags

Browser Agent

Installation

`Using the main component`

`$3`

`$3`

`$3`

`$3`

`Using the hoop component`

`$3`

`Contributing`

`$3`

`$3`

@deepgram/browser-agent

Dist Tags

Browser Agent

Installation

`Using the main component`

`$3`

`$3`

`$3`

`$3`

`Using the hoop component`

`$3`

`Contributing`

`$3`

`$3`