whspr

![npm version](https://www.npmjs.com/package/whspr)
![MIT License](https://choosealicense.com/licenses/mit/)
![PRs Welcome](./CONTRIBUTING.md)

A CLI tool that records audio from your microphone, transcribes it using Groq's Whisper API, and post-processes the transcription with AI to fix errors and apply custom vocabulary.

whspr demo

Installation

``bash npm install -g whspr`

`$3`

If you'd like to use whisper instead of whspr, add this to your shell config (~/.zshrc or ~/.bashrc):

`bash alias whisper="whspr"`

`Requirements`

- Node.js 18+ - FFmpeg (brew install ffmpegon macOS) - Groq API key (required for Whisper transcription) - Anthropic API key (optional, for Anthropic post-processing models)

`Usage`

`bash

`Set your API keys`


export GROQ_API_KEY="your-api-key"
export ANTHROPIC_API_KEY="your-api-key"  # Optional, for Anthropic models
Run the tool

whspr
With verbose output

whspr --verbose
Pipe output to another command (instead of clipboard)

whspr --pipe "pbcopy"              # Explicit clipboard
whspr --pipe "claude"              # Pipe directly to Claude Code
whspr -p "cat >> notes.txt"        # Append to a file


Press Enter to stop recording.
Features

- Live audio waveform visualization in the terminal - 15-minute max recording time - Transcription via Groq Whisper API - AI-powered post-processing to fix transcription errors - Progress bar during post-processing - Cost tracking for Anthropic models - Custom vocabulary support viaWHSPR.md(global and local) - Configurable settings via~/.whspr/settings.json- Automatic clipboard copy (or pipe to any command with--pipe) - Optional auto-save for transcriptions and audio files

`Settings`

Create ~/.whspr/settings.json to customize whspr's behavior:

`json { "verbose": false, "suffix": "\n\n(Transcribed via Whisper)", "transcriptionModel": "whisper-large-v3-turbo", "language": "en", "model": "groq:openai/gpt-oss-120b", "systemPrompt": "Your task is to clean up transcribed text...", "customPromptPrefix": "Here's my custom user prompt:", "transcriptionPrefix": "Here's my raw transcription output:", "alwaysSaveTranscriptions": false, "alwaysSaveAudio": false, "saveTranscriptionsToCwd": false }`

| Option | Type | Default | Description | | -------------------------- | ------- | --------------------------------------------------------------- | ------------------------------------------------------------------------------ | |verbose | boolean | false| Enable verbose output | |suffix| string | none | Text appended to all transcriptions | |transcriptionModel | string | "whisper-large-v3-turbo" | Whisper model ("whisper-large-v3" or "whisper-large-v3-turbo") | |language | string | "en" | ISO 639-1 language code (e.g., "en", "zh", "es") | |model | string | "groq:openai/gpt-oss-120b" | Post-processing model in provider:model-nameformat (see below) | |systemPrompt| string | (built-in) | System prompt for AI post-processing | |customPromptPrefix | string | "Here's my custom user prompt:"| Prefix before custom prompt content | |transcriptionPrefix | string | "Here's my raw transcription output that I need you to edit:"| Prefix before raw transcription | |alwaysSaveTranscriptions | boolean | false | Always save transcription text files to ~/.whspr/transcriptions/| |alwaysSaveAudio | boolean | false | Always save audio MP3 files to ~/.whspr/recordings/| |saveTranscriptionsToCwd | boolean | false | Save transcriptions to current directory instead of ~/.whspr/transcriptions/ |

`$3`

The model setting uses a provider:model-name format. Supported providers:

| Provider | API Key Required | | ----------- | ------------------- | |groq | GROQ_API_KEY| |anthropic | ANTHROPIC_API_KEY |

`$3`

| Provider | Model | Description | | ----------- | ---------------------------------- | ---------------------------------------- | |anthropic | claude-sonnet-4-5| Balanced speed and quality (recommended) | |anthropic | claude-haiku-4-5| Fastest responses, smaller model | |anthropic | claude-opus-4-5| Best quality, slower and more expensive | |groq | openai/gpt-oss-120b| Default model | |groq | llama-3.3-70b-versatile| Fast, versatile Llama model | |groq | moonshotai/kimi-k2-instruct-0905 | Moonshot Kimi model |

> Note: Model names are set by the providers and may change at any time. Check Groq Models and Anthropic Models for the latest available models.

`$3`

`json { "model": "anthropic:claude-sonnet-4-5", "suffix": "\n\n(Transcribed via Whisper, edited via Claude Sonnet 4.5)" }`

`$3`

`json { "alwaysSaveTranscriptions": true, "saveTranscriptionsToCwd": true }`

`Pipe Output`

Use --pipe (or -p) to send the transcription to any command instead of the clipboard:

`bash

`Pipe to Claude Code for further processing`


whspr --pipe "claude"
Append to a file

whspr --pipe "cat >> meeting-notes.txt"
Send via curl

whspr --pipe "xargs -I {} curl -X POST -d 'text={}' https://api.example.com"


If the pipe command fails, whspr falls back to copying to the clipboard.
Custom Vocabulary

Create a WHSPR.md (or WHISPER.md) file to provide custom vocabulary, names, or instructions for the AI post-processor.

`$3`

Place in ~/.whspr/WHSPR.md for vocabulary that applies everywhere:

`markdown

`Global Vocabulary`

- My name is "Alex" not "Alec" - Common terms: API, CLI, JSON, OAuth`

`$3`

Place in your current directory (./WHSPR.md) for project-specific vocabulary:

`markdown

`Project Vocabulary`

- PostgreSQL (not "post crest QL") - Kubernetes (not "cooper netties") - My colleague's name is "Priya" not "Maria"`

When both exist, they are combined (global first, then local).

`How It Works`

1. Records audio from your default microphone using FFmpeg 2. Displays a live waveform visualization based on audio levels 3. Converts the recording to MP3 4. Sends audio to Groq's Whisper API for transcription 5. Loads custom prompts from~/.whspr/WHSPR.md and/or ./WHSPR.md6. Sends transcription + custom vocabulary to AI for post-processing (with progress bar) 7. Applies suffix (if configured) 8. Displays result with word count, character count, and cost estimate 9. Pipes to command (--pipe) or copies to clipboard 10. Saves transcription/audio files (if configured)

If transcription fails, the recording is saved to ~/.whspr/recordings/` for manual recovery.

License

MIT

whspr

![npm version](https://www.npmjs.com/package/whspr)
![MIT License](https://choosealicense.com/licenses/mit/)
![PRs Welcome](./CONTRIBUTING.md)

A CLI tool that records audio from your microphone, transcribes it using Groq's Whisper API, and post-processes the transcription with AI to fix errors and apply custom vocabulary.

whspr demo

Installation

``bash npm install -g whspr`

`$3`

If you'd like to use whisper instead of whspr, add this to your shell config (~/.zshrc or ~/.bashrc):

`bash alias whisper="whspr"`

`Requirements`

- Node.js 18+ - FFmpeg (brew install ffmpegon macOS) - Groq API key (required for Whisper transcription) - Anthropic API key (optional, for Anthropic post-processing models)

`Usage`

`bash

`Set your API keys`


export GROQ_API_KEY="your-api-key"
export ANTHROPIC_API_KEY="your-api-key"  # Optional, for Anthropic models
Run the tool

whspr
With verbose output

whspr --verbose
Pipe output to another command (instead of clipboard)

whspr --pipe "pbcopy"              # Explicit clipboard
whspr --pipe "claude"              # Pipe directly to Claude Code
whspr -p "cat >> notes.txt"        # Append to a file


Press Enter to stop recording.
Features

`Settings`

Create ~/.whspr/settings.json to customize whspr's behavior:

`$3`

The model setting uses a provider:model-name format. Supported providers:

| Provider | API Key Required | | ----------- | ------------------- | |groq | GROQ_API_KEY| |anthropic | ANTHROPIC_API_KEY |

`$3`

> Note: Model names are set by the providers and may change at any time. Check Groq Models and Anthropic Models for the latest available models.

`$3`

`json { "model": "anthropic:claude-sonnet-4-5", "suffix": "\n\n(Transcribed via Whisper, edited via Claude Sonnet 4.5)" }`

`$3`

`json { "alwaysSaveTranscriptions": true, "saveTranscriptionsToCwd": true }`

`Pipe Output`

Use --pipe (or -p) to send the transcription to any command instead of the clipboard:

`bash

`Pipe to Claude Code for further processing`


whspr --pipe "claude"
Append to a file

whspr --pipe "cat >> meeting-notes.txt"
Send via curl

whspr --pipe "xargs -I {} curl -X POST -d 'text={}' https://api.example.com"


If the pipe command fails, whspr falls back to copying to the clipboard.
Custom Vocabulary

Create a WHSPR.md (or WHISPER.md) file to provide custom vocabulary, names, or instructions for the AI post-processor.

`$3`

Place in ~/.whspr/WHSPR.md for vocabulary that applies everywhere:

`markdown

`Global Vocabulary`

- My name is "Alex" not "Alec" - Common terms: API, CLI, JSON, OAuth`

`$3`

Place in your current directory (./WHSPR.md) for project-specific vocabulary:

`markdown

`Project Vocabulary`

- PostgreSQL (not "post crest QL") - Kubernetes (not "cooper netties") - My colleague's name is "Priya" not "Maria"`

When both exist, they are combined (global first, then local).

`How It Works`

If transcription fails, the recording is saved to ~/.whspr/recordings/` for manual recovery.

License

MIT