Ex-Coder (Scrapegoat)

Ex-Coder is a CLI tool that allows you to run Claude Code with any OpenRouter model by proxying requests through a local Anthropic API-compatible server.

Features

- ✅ Cross-platform - Works with both Node.js and Bun (v1.3.0+)
- ✅ Universal compatibility - Use with npx or bunx - no installation required
- ✅ Interactive setup - Prompts for API key and model if not provided (zero config!)
- ✅ Monitor mode - Proxy to real Anthropic API and log all traffic (for debugging)
- ✅ Protocol compliance - 1:1 compatibility with Claude Code communication protocol
- ✅ Snapshot testing - Comprehensive test suite with 13/13 passing tests
- ✅ Headless mode - Automatic print mode for non-interactive execution
- ✅ Quiet mode - Clean output by default (no log pollution)
- ✅ JSON output - Structured data for tool integration
- ✅ Real-time streaming - See Claude Code output as it happens
- ✅ Parallel runs - Each instance gets isolated proxy
- ✅ Autonomous mode - Bypass all prompts with flags
- ✅ Context inheritance - Runs in current directory with same .claude settings
- ✅ Multiple models - 10+ prioritized OpenRouter models
- ✅ Agent support - Use Claude Code agents in headless mode with --agent

Installation

$3

- Node.js 18+ or Bun 1.0+ - JavaScript runtime (either works!)
- Claude Code - Claude CLI must be installed
- OpenRouter API Key - Free tier available

$3

✨ NEW in v1.3.0: Universal compatibility! Works with both Node.js and Bun.

Option 1: Use without installing (recommended)

``bash

`With Node.js (works everywhere)`


npx ex-coder@latest --model x-ai/grok-code-fast-1 "your prompt"
With Bun (faster execution)

bunx ex-coder@latest --model openai/gpt-5-codex "your prompt"


Option 2: Install globally

`bash

`With npm (Node.js)`


npm install -g ex-coder
With Bun (faster)

bun install -g ex-coder

$3

`bash cd /Users/nathan/ex-coder/Excoder/landingpage pnpm install pnpm dev # Vite dev server pnpm build pnpm preview pnpm firebase:deploy`

Option 3: Install from source

`bash cd mcp/ex-coder bun install # or: npm install bun run build # or: npm run build bun link # or: npm link`

Performance Note: While ex-coder works with both runtimes, Bun offers faster startup times. Both provide identical functionality.

`Quick Start`

`$3`

`bash

`Navigate to your project directory`


cd /path/to/your/project
Install ex-coder skill for automatic best practices

ex-coder --init
Reload Claude Code to discover the skill

What this does: - ✅ Installs ex-coder usage skill in.claude/skills/ex-coder-usage/- ✅ Enables automatic sub-agent delegation - ✅ Enforces file-based instruction patterns - ✅ Prevents context window pollution

After running --init, Claude will automatically: - Use sub-agents when you mention external models (Grok, GPT-5, etc.) - Follow best practices for ex-coder usage - Suggest specialized agents for different tasks

`$3`

`bash

`Just run it - will prompt for API key and model`


ex-coder
Enter your OpenRouter API key when prompted

Select a model from the list

Start coding!

$3

`bash

`Set up environment`


export OPENROUTER_API_KEY=sk-or-v1-...
export ANTHROPIC_API_KEY=sk-ant-api03-placeholder
Run with specific task

ex-coder "implement user authentication"
Or with specific model

ex-coder --model openai/gpt-5-codex "add tests"

Note: In interactive mode, if OPENROUTER_API_KEY is not set, you'll be prompted to enter it. This makes first-time usage super simple!

`AI Agent Usage`

For AI agents running within Claude Code: Use the dedicated AI agent guide for comprehensive instructions on file-based patterns and sub-agent delegation.

`bash

`Print complete AI agent usage guide`


ex-coder --help-ai
Save guide to file for reference

ex-coder --help-ai > ex-coder-agent-guide.md


Quick Reference for AI Agents:
$3

1. Get available models:`bash # List all models or search ex-coder --models ex-coder --models gemini

# Get top recommended models (JSON) ex-coder --top-models --json`

2. Run ex-coder through sub-agent (recommended pattern):`typescript // Don't run ex-coder directly in main conversation // Use Task tool to delegate to sub-agent const result = await Task({ subagent_type: "general-purpose", description: "Implement feature with Grok", prompt:
Use ex-coder to implement feature with Grok model.

STEPS:
1. Create instruction file: /tmp/ex-coder-task-${Date.now()}.md
2. Write feature requirements to file
3. Run: ex-coder --model x-ai/grok-code-fast-1 --stdin < /tmp/ex-coder-task-*.md
4. Read result and return ONLY summary (2-3 sentences)

DO NOT return full implementation. Keep response under 300 tokens.
});`

3. File-based instruction pattern (avoids context pollution):`typescript // Write instructions to file const instructionFile =/tmp/ex-coder-task-${Date.now()}.md; const resultFile =/tmp/ex-coder-result-${Date.now()}.md;

await Write({ file_path: instructionFile, content:
# Task
Your task description here

# Output
Write results to: ${resultFile}
});

// Run ex-coder with stdin await Bash(ex-coder --model x-ai/grok-code-fast-1 --stdin < ${instructionFile});

// Read result const result = await Read({ file_path: resultFile });

// Return summary only return extractSummary(result);`

Key Principles: - ✅ Use file-based patterns to avoid context window pollution - ✅ Delegate to sub-agents instead of running directly - ✅ Return summaries only (not full conversation transcripts) - ✅ Choose appropriate model for task (see--models or --top-models)

Resources: - Full AI agent guide:ex-coder --help-ai- Skill document:skills/excoder_usage/SKILL.md(in repository root) - Model integration:skills/excoder_usage/SKILL.md (in repository root)

`Usage`

`$3`

`bash ex-coder [OPTIONS]`

`$3`

| Flag | Description | Default | |------|-------------|---------| |-i, --interactive| Run in interactive mode (persistent session) | Single-shot mode | |-m, --model | OpenRouter model to use | x-ai/grok-code-fast-1| |-p, --port | Proxy server port | Random (3000-9000) | |-q, --quiet| Suppress [ex-coder] log messages | Quiet in single-shot | |-v, --verbose| Show [ex-coder] log messages | Verbose in interactive | |--json | Output in JSON format (implies --quiet) | false| |-d, --debug | Enable debug logging to file | false| |--no-auto-approve| Disable auto-approve (require prompts) | Auto-approve enabled | |--dangerous | Pass --dangerouslyDisableSandbox | false| |--agent | Use specific agent (e.g., frontend:developer) | - | |--models | List all models or search (e.g., --models gemini) | - | |--top-models| Show top recommended programming models | - | |--list-agents| List available agents in current project | - | |--force-update| Force refresh model cache | - | |--init| Install ex-coder skill in current project | - | |--help-ai| Show AI agent usage guide | - | |-h, --help | Show help message | - |

`$3`

| Variable | Description | Required | |----------|-------------|----------| |OPENROUTER_API_KEY| Your OpenRouter API key | ⚡ Optional in interactive mode (will prompt if not set) ✅ Required in non-interactive mode | |GROQ_API_KEY | API key for Groq when --provider groq| ✅ Required for Groq | |HF_TOKEN | API token for HuggingFace Inference when --provider huggingface| ✅ Required for HuggingFace | |API_KEY_OPENAI | API key for OpenAI-compatible SDK mode when --provider openai-sdk| ✅ Required for OpenAI SDK | |EXCODER_PROVIDER | Default provider (openrouter \| groq \| huggingface \| openai-sdk) | ❌ No | |EXCODER_BASE_URL| Override base URL for OpenAI-compatible providers | ❌ No | |ANTHROPIC_API_KEY| Placeholder to prevent Claude Code dialog (not used for auth) | ✅ Required | |EXCODER_MODEL| Default model to use | ❌ No | |EXCODER_PORT| Default proxy port | ❌ No | |EXCODER_ACTIVE_MODEL_NAME| Automatically set by Ex-Coder to show active model in status line (read-only) | ❌ No | |ex-coder_* | Legacy equivalents (still honored for compatibility) | ❌ No |

Important Notes: - NEW in v1.3.0: In interactive mode, ifOPENROUTER_API_KEYis not set, you'll be prompted to enter it - You MUST setANTHROPIC_API_KEY=sk-ant-api03-placeholder (or any value). Without it, Claude Code will show a dialog

`Available Models`

Ex-Coder supports 5 OpenRouter models in priority order:

1. x-ai/grok-code-fast-1 (Default) - Fast coding-focused model from xAI - Best for quick iterations

2. openai/gpt-5-codex - Advanced coding model from OpenAI - Best for complex implementations

3. minimax/minimax-m2 - High-performance model from MiniMax - Good for general coding tasks

4. zhipu-ai/glm-4.6 - Advanced model from Zhipu AI - Good for multilingual code

5. qwen/qwen3-vl-235b-a22b-instruct - Vision-language model from Alibaba - Best for UI/visual tasks

List models anytime with:

`bash ex-coder --models`

`Agent Support (NEW in v2.1.0)`

Run specialized agents in headless mode with direct agent selection:

`bash

`Use frontend developer agent`


ex-coder --model x-ai/grok-code-fast-1 --agent frontend:developer "create a React button component"
Use API architect agent

ex-coder --model openai/gpt-5-codex --agent api-architect "design REST API for user management"
Discover available agents in your project

ex-coder --list-agents


Agent Features:

- ✅ Direct agent selection - No need to ask Claude to use an agent - ✅ Automatic prefixing - Adds@agent- automatically (frontend:developer → @agent-frontend:developer) - ✅ Project-specific agents - Works with any agents installed in.claude/agents/- ✅ Agent discovery - List all available agents with--list-agents

`Status Line Display`

ex-coder automatically shows critical information in the Claude Code status bar - no setup required!

Ultra-Compact Format: directory • model-id • $cost • ctx%

Visual Design: - 🔵 Directory (bright cyan, bold) - Where you are - 🟡 Model ID (bright yellow) - Actual OpenRouter model ID - 🟢 Cost (bright green) - Real-time session cost from OpenRouter - 🟣 Context (bright magenta) - % of context window remaining - ⚪ Separators (dim) - Visual dividers

Examples: -ex-coder • x-ai/grok-code-fast-1 • $0.003 • 95%- Using Grok, $0.003 spent, 95% context left -my-project • openai/gpt-5-codex • $0.12 • 67%- Using GPT-5, $0.12 spent, 67% context left -backend • minimax/minimax-m2 • $0.05 • 82%- Using MiniMax M2, $0.05 spent, 82% left -test • openrouter/auto • $0.01 • 90% - Using any custom model, $0.01 spent, 90% left

Critical Tracking (Live Updates): - 💰 Cost tracking - Real-time USD from Claude Code session data - 📊 Context monitoring - Percentage of model's context window remaining - ⚡ Performance optimized - Ultra-compact to fit with thinking mode UI

Thinking Mode Optimized: - ✅ Ultra-compact - Directory limited to 15 chars (leaves room for everything) - ✅ Critical first - Most important info (directory, model) comes first - ✅ Smart truncation - Long directories shortened with "..." - ✅ Space reservation - Reserves ~40 chars for Claude's thinking mode UI - ✅ Color-coded - Instant visual scanning - ✅ No overflow - Fits perfectly even with thinking mode enabled

Custom Model Support: - ✅ ANY OpenRouter model - Not limited to shortlist (e.g.,openrouter/auto, custom models) - ✅ Actual model IDs - Shows exact OpenRouter model ID (no translation) - ✅ Context fallback - Unknown models use 100k context window (safe default) - ✅ Shortlist optimized - Our recommended models have accurate context sizes - ✅ Future-proof - Works with new models added to OpenRouter

How it works: - Each ex-coder instance creates a temporary settings file with custom status line - Settings use--settingsflag (doesn't modify global Claude Code config) - Status line uses simple bash script with ANSI colors (no external dependencies!) - Displays actual OpenRouter model ID fromex-coder_ACTIVE_MODEL_NAMEenv var - Context tracking uses model-specific sizes for our shortlist, 100k fallback for others - Temp files are automatically cleaned up when ex-coder exits - Each instance is completely isolated - run multiple in parallel!

Per-instance isolation: - ✅ Doesn't modify~/.claude/settings.json- ✅ Each instance has its own config - ✅ Safe to run multiple ex-coder instances in parallel - ✅ Standard Claude Code unaffected - ✅ Temp files auto-cleanup on exit - ✅ No external dependencies (bash only, no jq!)

`Examples`

`$3`

`bash

`Simple prompt`


ex-coder "fix the bug in user.ts"
Multi-word prompt

ex-coder "implement user authentication with JWT tokens"

$3

`bash

`Use Grok for fast coding`


ex-coder --model x-ai/grok-code-fast-1 "add error handling"
Use GPT-5 Codex for complex tasks

ex-coder --model openai/gpt-5-codex "refactor entire API layer"
Use Qwen for UI tasks

ex-coder --model qwen/qwen3-vl-235b-a22b-instruct "implement dashboard UI"

$3

Auto-approve is enabled by default. For fully autonomous mode, add --dangerous:

`bash

`Basic usage (auto-approve already enabled)`


ex-coder "delete unused files"
Fully autonomous (auto-approve + dangerous sandbox disabled)

ex-coder --dangerous "install dependencies"
Disable auto-approve if you want prompts

ex-coder --no-auto-approve "make important changes"

$3

`bash

`Use specific port`


ex-coder --port 3000 "analyze codebase"
Or set default

export ex-coder_PORT=3000
ex-coder "your task"

$3

`bash

`Verbose mode`


ex-coder "debug issue" --verbose
Custom working directory

ex-coder "analyze code" --cwd /path/to/project
Multiple flags

ex-coder --model openai/gpt-5-codex "task" --verbose --debug


$3
NEW! ex-coder now includes a monitor mode to help you understand how Claude Code works internally.

`bash

`Enable monitor mode (requires real Anthropic API key)`


ex-coder --monitor --debug "implement a feature"

What Monitor Mode Does: - ✅ Proxies to REAL Anthropic API (not OpenRouter) - Uses your actual Anthropic API key - ✅ Logs ALL traffic - Captures complete requests and responses - ✅ Both streaming and JSON - Logs SSE streams and JSON responses - ✅ Debug logs to file - Saves tologs/ex-coder_*.log when --debugis used - ✅ Pass-through proxy - No translation, forwards as-is to Anthropic

When to use Monitor Mode: - 🔍 Understanding Claude Code's API protocol - 🐛 Debugging integration issues - 📊 Analyzing Claude Code's behavior - 🔬 Research and development

Requirements:`bash

`Monitor mode requires a REAL Anthropic API key (not placeholder)`


export ANTHROPIC_API_KEY='sk-ant-api03-...'
Use with --debug to save logs to file

ex-coder --monitor --debug "your task"
Logs are saved to: logs/ex-coder_TIMESTAMP.log

Example Output:`[Monitor] Server started on http://127.0.0.1:8765 [Monitor] Mode: Passthrough to real Anthropic API [Monitor] All traffic will be logged for analysis

=== [MONITOR] Claude Code → Anthropic API Request === { "model": "claude-sonnet-4.5", "messages": [...], "max_tokens": 4096, ... } === End Request ===

=== [MONITOR] Anthropic API → Claude Code Response (Streaming) === event: message_start data: {"type":"message_start",...}

event: content_block_start data: {"type":"content_block_start",...} ... === End Streaming Response ===`

Note: Monitor mode charges your Anthropic account (not OpenRouter). Use --debug flag to save logs for analysis.

`$3`

ex-coder supports three output modes for different use cases:

#### 1. Quiet Mode (Default in Single-Shot)

Clean output with no [ex-coder] logs - perfect for piping to other tools:

`bash

`Quiet by default in single-shot`


ex-coder "what is 2+2?"
Output: 2 + 2 equals 4.
Use in pipelines

ex-coder "list 3 colors" | grep -i blue
Redirect to file

ex-coder "analyze code" > analysis.txt


#### 2. Verbose Mode

Show all [ex-coder] log messages for debugging:

`bash

`Verbose mode`


ex-coder --verbose "what is 2+2?"
Output:

[ex-coder] Starting Claude Code with openai/gpt-4o

[ex-coder] Proxy URL: http://127.0.0.1:8797

[ex-coder] Status line: dir • openai/gpt-4o • $cost • ctx%

...

2 + 2 equals 4.

[ex-coder] Shutting down proxy server...

[ex-coder] Done
Interactive mode is verbose by default

ex-coder --interactive


#### 3. JSON Output Mode
Structured output perfect for automation and tool integration:

`bash

`JSON output (always quiet)`


ex-coder --json "what is 2+2?"
Output: {"type":"result","result":"2 + 2 equals 4.","total_cost_usd":0.068,"usage":{...}}
Extract just the result with jq

ex-coder --json "list 3 colors" | jq -r '.result'
Get cost and token usage

ex-coder --json "analyze code" | jq '{result, cost: .total_cost_usd, tokens: .usage.input_tokens}'
Use in scripts

RESULT=$(ex-coder --json "check if tests pass" | jq -r '.result')
echo "AI says: $RESULT"
Track costs across multiple runs

for task in task1 task2 task3; do
  ex-coder --json "$task" | jq -r '"\(.total_cost_usd)"'
done | awk '{sum+=$1} END {print "Total: $"sum}'

JSON Output Fields: -result- The AI's response text -total_cost_usd- Total cost in USD -usage.input_tokens- Input tokens used -usage.output_tokens- Output tokens used -duration_ms- Total duration in milliseconds -num_turns- Number of conversation turns -modelUsage - Per-model usage breakdown

`How It Works`

`$3`

`ex-coder "your prompt" ↓ 1. Parse arguments (--model, --no-auto-approve, --dangerous, etc.) 2. Find available port (random or specified) 3. Start local proxy on http://127.0.0.1:PORT 4. Spawn: claude --auto-approve --env ANTHROPIC_BASE_URL=http://127.0.0.1:PORT 5. Proxy translates: Anthropic API → OpenRouter API 6. Stream output in real-time 7. Cleanup proxy on exit`

`$3`

Normal Mode (OpenRouter):`Claude Code → Anthropic API format → Local Proxy → OpenRouter API format → OpenRouter ↓ Claude Code ← Anthropic API format ← Local Proxy ← OpenRouter API format ← OpenRouter`

Monitor Mode (Anthropic Passthrough):`Claude Code → Anthropic API format → Local Proxy (logs) → Anthropic API ↓ Claude Code ← Anthropic API format ← Local Proxy (logs) ← Anthropic API`

`$3`

Each ex-coderinvocation: - Gets a unique random port - Starts isolated proxy server - Runs independent Claude Code instance - Cleans up on exit

This allows multiple parallel runs:

`bash

`Terminal 1`


ex-coder --model x-ai/grok-code-fast-1 "task A"
Terminal 2

ex-coder --model openai/gpt-5-codex "task B"
Terminal 3

ex-coder --model minimax/minimax-m2 "task C"


Extended Thinking Support
NEW in v1.1.0: ex-coder now fully supports models with extended thinking/reasoning capabilities (Grok, o1, etc.) with complete Anthropic Messages API protocol compliance.
$3
ex-coder includes a sophisticated Thinking Translation Model that aligns Claude Code's native thinking budget with the unique requirements of every major AI provider.

When you set a thinking budget in Claude (e.g., budget: 16000), ex-coder automatically translates it:

This ensures you can use standard Claude Code thinking controls with ANY supported model, without worrying about API specificities.

`$3`

Some AI models (like Grok and OpenAI's o1) can show their internal reasoning process before providing the final answer. This "thinking" content helps you understand how the model arrived at its conclusion.

`$3`

ex-coder implements the Anthropic Messages API's interleaved-thinking protocol:

Thinking Blocks (Hidden): - Contains model's reasoning process - Automatically collapsed in Claude Code UI - Shows "Claude is thinking..." indicator - User can expand to view reasoning

Text Blocks (Visible): - Contains final response - Displayed normally - Streams incrementally

`$3`

- ✅ x-ai/grok-code-fast-1 - Grok's reasoning mode - ✅ openai/gpt-5-codex - o1 reasoning (when enabled) - ✅ openai/o1-preview - Full reasoning support - ✅ openai/o1-mini - Compact reasoning - ⚠️ Other models may support reasoning in future

`$3`

Streaming Protocol (V2 - Protocol Compliant):`1. message_start 2. content_block_start (text, index=0) ← IMMEDIATE! (required) 3. ping 4. [If reasoning arrives] - content_block_stop (index=0) ← Close initial empty block - content_block_start (thinking, index=1) ← Reasoning - thinking_delta events × N - content_block_stop (index=1) 5. content_block_start (text, index=2) ← Response 6. text_delta events × M 7. content_block_stop (index=2) 8. message_delta + message_stop`

Critical: content_block_start must be sent immediately after message_start, before ping. This is required by the Anthropic Messages API protocol for proper UI initialization.

Key Features: - ✅ Separate thinking and text blocks (proper indices) - ✅thinking_delta vs text_deltaevent types - ✅ Thinking content hidden by default - ✅ Smooth transitions between blocks - ✅ Full Claude Code UI compatibility

`$3`

Before (v1.0.0 - No Thinking Support): - Reasoning visible as regular text - Confusing output with internal thoughts - No progress indicators - "All at once" message updates

After (v1.1.0 - Full Protocol Support): - ✅ Reasoning hidden/collapsed - ✅ Clean, professional output - ✅ "Claude is thinking..." indicator shown - ✅ Smooth incremental streaming - ✅ Message headers/structure visible - ✅ Protocol compliant with Anthropic Messages API

`$3`

For complete protocol documentation, see: - STREAMING_PROTOCOL.md - Complete SSE protocol spec - PROTOCOL_FIX_V2.md - Critical V2 protocol fix (event ordering) - COMPREHENSIVE_UX_ISSUE_ANALYSIS.md - Technical analysis - THINKING_BLOCKS_IMPLEMENTATION.md - Implementation summary

`Dynamic Reasoning Support (NEW in v1.4.0)`

ex-coder now intelligently adapts to ANY reasoning model!

No more hardcoded lists or manual flags. ex-coder dynamically queries OpenRouter metadata to enable thinking capabilities for any model that supports them.

`$3`

1. Auto-Detection: - Automatically checks model capabilities at startup - Enables Extended Thinking UI only when supported - Future-proof: Works instantly with new models (e.g.,deepseek-r1 or minimax-m2)

2. Smart Parameter Mapping: - Claude: Passes token budget directly (e.g., 16k tokens) - OpenAI (o1/o3): Translates budget toreasoning_effort- "ultrathink" (≥32k) →high- "think hard" (16k-32k) →medium- "think" (<16k) →low- Gemini & Grok: Preserves thought signatures and XML traces automatically

3. Universal Compatibility: - Use "ultrathink" or "think hard" prompts with ANY supported model - ex-coder handles the translation layer for you

`Context Scaling & Auto-Compaction`

NEW in v1.2.0: ex-coder now intelligently manages token counting to support ANY context window size (from 128k to 2M+) while preserving Claude Code's native auto-compaction behavior.

`$3`

Claude Code naturally assumes a fixed context window (typically 200k tokens for Sonnet). - Small Models (e.g., Grok 128k): Claude might overuse context and crash. - Massive Models (e.g., Gemini 2M): Claude would compact way too early (at 10% usage), wasting the model's potential.

`$3`

ex-coder implements a "Dual-Accounting" system:

1. Internal Scaling (For Claude): - We fetch the real context limit from OpenRouter (e.g., 1M tokens). - We scale reported token usage so Claude thinks 1M tokens is 200k. - Result: Auto-compaction triggers at the correct percentage of usage (e.g., 90% full), regardless of the actual limit.

2. Accurate Reporting (For You): - The status line displays the Real Unscaled Usage and Real Context %. - You see specific costs and limits, while Claude remains blissfully unaware and stable.

Benefits: - ✅ Works with ANY model size (128k, 1M, 2M, etc.) - ✅ Unlocks massive context windows (Claude Code becomes 10x more powerful with Gemini!) - ✅ Prevents crashes on smaller models (Grok) - ✅ Native behavior (compaction just works)

`Development`

`$3`

`mcp/ex-coder/ ├── src/ │ ├── index.ts # Main entry point │ ├── cli.ts # CLI argument parser │ ├── proxy-server.ts # Hono-based proxy server │ ├── transform.ts # API format translation (from claude-code-proxy) │ ├── claude-runner.ts # Claude CLI runner (creates temp settings) │ ├── port-manager.ts # Port utilities │ ├── config.ts # Constants and defaults │ └── types.ts # TypeScript types ├── tests/ # Test files ├── package.json ├── tsconfig.json └── biome.json`

`$3`

ex-coder uses a Hono-based proxy server inspired by claude-code-proxy:

- Framework: Hono - Fast, lightweight web framework - API Translation: Converts Anthropic API format ↔ OpenAI format - Streaming: Full support for Server-Sent Events (SSE) - Tool Calling: Handles Claude's tool_use ↔ OpenAI's tool_calls - Battle-tested: Based on production-ready claude-code-proxy implementation

Why Hono? - Native Bun support (no adapters needed) - Extremely fast and lightweight - Middleware support (CORS, logging, etc.) - Works across Node.js, Bun, and Cloudflare Workers

`$3`

`bash

`Install dependencies`


bun install
Development mode

bun run dev "test prompt"
Build

bun run build
Lint

bun run lint
Format

bun run format
Type check

bun run typecheck
Run tests

bun test


$3
ex-coder includes a comprehensive snapshot testing system to ensure 1:1 compatibility with the official Claude Code protocol:

`bash

`Run snapshot tests (13/13 passing ✅)`


bun test tests/snapshot.test.ts
Full workflow: capture fixtures + run tests

./tests/snapshot-workflow.sh --full
Capture new test fixtures from monitor mode

./tests/snapshot-workflow.sh --capture
Debug SSE events

bun tests/debug-snapshot.ts


What Gets Tested:
- ✅ Event sequence (message_start → content_block_start → deltas → stop → message_delta → message_stop)
- ✅ Content block indices (sequential: 0, 1, 2, ...)
- ✅ Tool input streaming (fine-grained JSON chunks)
- ✅ Usage metrics (present in message_start and message_delta)
- ✅ Stop reasons (always present and valid)
- ✅ Cache metrics (creation and read tokens)
Documentation:
- Quick Start Guide - Get started with testing
- Snapshot Testing Guide - Complete testing documentation
- Implementation Details - Technical implementation summary
- Protocol Compliance Plan - Detailed compliance roadmap
$3

`bash

`Link for global use`


bun run install:global
Now use anywhere

ex-coder "your task"


Troubleshooting
$3
Install Claude Code:

`bash npm install -g claude-code

`or visit: https://claude.com/claude-code`


$3
Set your API key:

`bash export OPENROUTER_API_KEY=sk-or-v1-...`

Or add to your shell profile (~/.zshrc, ~/.bashrc):

`bash echo 'export OPENROUTER_API_KEY=sk-or-v1-...' >> ~/.zshrc source ~/.zshrc`

`$3`

Specify a custom port:

`bash ex-coder --port 3000 "your task"`

Or increase port range in src/config.ts.

`$3`

Check OpenRouter API status: - https://openrouter.ai/status

Verify your API key works: - https://openrouter.ai/keys

`$3`

If the status line doesn't show the model name:

1. Check if --settings flag is being passed:`bash # Look for this in ex-coder output: # [ex-coder] Instance settings: /tmp/ex-coder-settings-{timestamp}.json`

2. Verify environment variable is set:`bash # Should be set automatically by ex-coder echo $ex-coder_ACTIVE_MODEL_NAME # Should output something like: xAI/Grok-1`

3. Test status line command manually:`bash export ex-coder_ACTIVE_MODEL_NAME="xAI/Grok-1" cat > /dev/null && echo "[$ex-coder_ACTIVE_MODEL_NAME] 📁 $(basename "$(pwd)")" # Should output: [xAI/Grok-1] 📁 your-directory-name`

4. Check temp settings file:`bash # File is created in /tmp/ex-coder-settings-*.json ls -la /tmp/ex-coder-settings-*.json 2>/dev/null | tail -1 cat /tmp/ex-coder-settings-*.json | head -1`

5. Verify bash is available:`bash which bash # Should show path to bash (usually /bin/bash or /usr/bin/bash)`

Note: Temp settings files are automatically cleaned up when ex-coder exits. If you see multiple files, you may have crashed instances - they're safe to delete manually.

`Comparison with Claude Code`

| Feature | Claude Code | ex-coder | |---------|-------------|----------| | Model | Anthropic models only | Any OpenRouter model | | API | Anthropic API | OpenRouter API | | Cost | Anthropic pricing | OpenRouter pricing | | Setup | API key → direct | API key → proxy → OpenRouter | | Speed | Direct connection | ~Same (local proxy) | | Features | All Claude Code features | All Claude Code features |

When to use ex-coder: - ✅ Want to try different models (Grok, GPT-5, etc.) - ✅ Need OpenRouter-specific features - ✅ Prefer OpenRouter pricing - ✅ Testing model performance

When to use Claude Code: - ✅ Want latest Anthropic models only - ✅ Need official Anthropic support - ✅ Simpler setup (no proxy)

`Contributing`

Contributions welcome! Please:

1. Fork the repo 2. Create feature branch:git checkout -b feature/amazing3. Commit changes:git commit -m 'Add amazing feature'4. Push to branch:git push origin feature/amazing5. Open Pull Request

`License`

MIT © Scrapegoat Pro Contributors

`Acknowledgments`

ex-coder's proxy implementation is based on claude-code-proxy by @kiyo-e. We've adapted their excellent Hono-based API translation layer for OpenRouter integration.

Key contributions from claude-code-proxy: - Anthropic ↔ OpenAI API format translation (transform.ts`)
- Streaming response handling with Server-Sent Events
- Tool calling compatibility layer
- Clean Hono framework architecture

Thank you to the claude-code-proxy team for building a robust, production-ready foundation! 🙏

Links

- GitHub: https://github.com/ex-coder/ex-coder
- OpenRouter: https://openrouter.ai
- Claude Code: https://claude.com/claude-code
- Bun: https://bun.sh
- Hono: https://hono.dev
- claude-code-proxy: https://github.com/kiyo-e/claude-code-proxy

---

Made with ❤️ by the Ex-Coder core team

Ex-Coder (Scrapegoat)

Ex-Coder is a CLI tool that allows you to run Claude Code with any OpenRouter model by proxying requests through a local Anthropic API-compatible server.

Features

Installation

$3

- Node.js 18+ or Bun 1.0+ - JavaScript runtime (either works!)
- Claude Code - Claude CLI must be installed
- OpenRouter API Key - Free tier available

$3

✨ NEW in v1.3.0: Universal compatibility! Works with both Node.js and Bun.

Option 1: Use without installing (recommended)

``bash

`With Node.js (works everywhere)`


npx ex-coder@latest --model x-ai/grok-code-fast-1 "your prompt"
With Bun (faster execution)

bunx ex-coder@latest --model openai/gpt-5-codex "your prompt"


Option 2: Install globally

`bash

`With npm (Node.js)`


npm install -g ex-coder
With Bun (faster)

bun install -g ex-coder

$3

`bash cd /Users/nathan/ex-coder/Excoder/landingpage pnpm install pnpm dev # Vite dev server pnpm build pnpm preview pnpm firebase:deploy`

Option 3: Install from source

`bash cd mcp/ex-coder bun install # or: npm install bun run build # or: npm run build bun link # or: npm link`

Performance Note: While ex-coder works with both runtimes, Bun offers faster startup times. Both provide identical functionality.

`Quick Start`

`$3`

`bash

`Navigate to your project directory`


cd /path/to/your/project
Install ex-coder skill for automatic best practices

ex-coder --init
Reload Claude Code to discover the skill

`$3`

`bash

`Just run it - will prompt for API key and model`


ex-coder
Enter your OpenRouter API key when prompted

Select a model from the list

Start coding!

$3

`bash

`Set up environment`


export OPENROUTER_API_KEY=sk-or-v1-...
export ANTHROPIC_API_KEY=sk-ant-api03-placeholder
Run with specific task

ex-coder "implement user authentication"
Or with specific model

ex-coder --model openai/gpt-5-codex "add tests"

Note: In interactive mode, if OPENROUTER_API_KEY is not set, you'll be prompted to enter it. This makes first-time usage super simple!

`AI Agent Usage`

For AI agents running within Claude Code: Use the dedicated AI agent guide for comprehensive instructions on file-based patterns and sub-agent delegation.

`bash

`Print complete AI agent usage guide`


ex-coder --help-ai
Save guide to file for reference

ex-coder --help-ai > ex-coder-agent-guide.md


Quick Reference for AI Agents:
$3

1. Get available models:`bash # List all models or search ex-coder --models ex-coder --models gemini

# Get top recommended models (JSON) ex-coder --top-models --json`

DO NOT return full implementation. Keep response under 300 tokens.
});`

await Write({ file_path: instructionFile, content:
# Task
Your task description here

# Output
Write results to: ${resultFile}
});

// Run ex-coder with stdin await Bash(ex-coder --model x-ai/grok-code-fast-1 --stdin < ${instructionFile});

// Read result const result = await Read({ file_path: resultFile });

// Return summary only return extractSummary(result);`

Resources: - Full AI agent guide:ex-coder --help-ai- Skill document:skills/excoder_usage/SKILL.md(in repository root) - Model integration:skills/excoder_usage/SKILL.md (in repository root)

`Usage`

`$3`

`bash ex-coder [OPTIONS]`

`$3`

`Available Models`

Ex-Coder supports 5 OpenRouter models in priority order:

1. x-ai/grok-code-fast-1 (Default) - Fast coding-focused model from xAI - Best for quick iterations

2. openai/gpt-5-codex - Advanced coding model from OpenAI - Best for complex implementations

3. minimax/minimax-m2 - High-performance model from MiniMax - Good for general coding tasks

4. zhipu-ai/glm-4.6 - Advanced model from Zhipu AI - Good for multilingual code

5. qwen/qwen3-vl-235b-a22b-instruct - Vision-language model from Alibaba - Best for UI/visual tasks

List models anytime with:

`bash ex-coder --models`

`Agent Support (NEW in v2.1.0)`

Run specialized agents in headless mode with direct agent selection:

`bash

`Use frontend developer agent`


ex-coder --model x-ai/grok-code-fast-1 --agent frontend:developer "create a React button component"
Use API architect agent

ex-coder --model openai/gpt-5-codex --agent api-architect "design REST API for user management"
Discover available agents in your project

ex-coder --list-agents


Agent Features:

`Status Line Display`

ex-coder automatically shows critical information in the Claude Code status bar - no setup required!

Ultra-Compact Format: directory • model-id • $cost • ctx%

`Examples`

`$3`

`bash

`Simple prompt`


ex-coder "fix the bug in user.ts"
Multi-word prompt

ex-coder "implement user authentication with JWT tokens"

$3

`bash

`Use Grok for fast coding`


ex-coder --model x-ai/grok-code-fast-1 "add error handling"
Use GPT-5 Codex for complex tasks

ex-coder --model openai/gpt-5-codex "refactor entire API layer"
Use Qwen for UI tasks

ex-coder --model qwen/qwen3-vl-235b-a22b-instruct "implement dashboard UI"

$3

Auto-approve is enabled by default. For fully autonomous mode, add --dangerous:

`bash

`Basic usage (auto-approve already enabled)`


ex-coder "delete unused files"
Fully autonomous (auto-approve + dangerous sandbox disabled)

ex-coder --dangerous "install dependencies"
Disable auto-approve if you want prompts

ex-coder --no-auto-approve "make important changes"

$3

`bash

`Use specific port`


ex-coder --port 3000 "analyze codebase"
Or set default

export ex-coder_PORT=3000
ex-coder "your task"

$3

`bash

`Verbose mode`


ex-coder "debug issue" --verbose
Custom working directory

ex-coder "analyze code" --cwd /path/to/project
Multiple flags

ex-coder --model openai/gpt-5-codex "task" --verbose --debug


$3
NEW! ex-coder now includes a monitor mode to help you understand how Claude Code works internally.

`bash

`Enable monitor mode (requires real Anthropic API key)`


ex-coder --monitor --debug "implement a feature"

When to use Monitor Mode: - 🔍 Understanding Claude Code's API protocol - 🐛 Debugging integration issues - 📊 Analyzing Claude Code's behavior - 🔬 Research and development

Requirements:`bash

`Monitor mode requires a REAL Anthropic API key (not placeholder)`


export ANTHROPIC_API_KEY='sk-ant-api03-...'
Use with --debug to save logs to file

ex-coder --monitor --debug "your task"
Logs are saved to: logs/ex-coder_TIMESTAMP.log

Example Output:`[Monitor] Server started on http://127.0.0.1:8765 [Monitor] Mode: Passthrough to real Anthropic API [Monitor] All traffic will be logged for analysis

=== [MONITOR] Claude Code → Anthropic API Request === { "model": "claude-sonnet-4.5", "messages": [...], "max_tokens": 4096, ... } === End Request ===

=== [MONITOR] Anthropic API → Claude Code Response (Streaming) === event: message_start data: {"type":"message_start",...}

event: content_block_start data: {"type":"content_block_start",...} ... === End Streaming Response ===`

Note: Monitor mode charges your Anthropic account (not OpenRouter). Use --debug flag to save logs for analysis.

`$3`

ex-coder supports three output modes for different use cases:

#### 1. Quiet Mode (Default in Single-Shot)

Clean output with no [ex-coder] logs - perfect for piping to other tools:

`bash

`Quiet by default in single-shot`


ex-coder "what is 2+2?"
Output: 2 + 2 equals 4.
Use in pipelines

ex-coder "list 3 colors" | grep -i blue
Redirect to file

ex-coder "analyze code" > analysis.txt


#### 2. Verbose Mode

Show all [ex-coder] log messages for debugging:

`bash

`Verbose mode`


ex-coder --verbose "what is 2+2?"
Output:

[ex-coder] Starting Claude Code with openai/gpt-4o

[ex-coder] Proxy URL: http://127.0.0.1:8797

[ex-coder] Status line: dir • openai/gpt-4o • $cost • ctx%

...

2 + 2 equals 4.

[ex-coder] Shutting down proxy server...

[ex-coder] Done
Interactive mode is verbose by default

ex-coder --interactive


#### 3. JSON Output Mode
Structured output perfect for automation and tool integration:

`bash

`JSON output (always quiet)`


ex-coder --json "what is 2+2?"
Output: {"type":"result","result":"2 + 2 equals 4.","total_cost_usd":0.068,"usage":{...}}
Extract just the result with jq

ex-coder --json "list 3 colors" | jq -r '.result'
Get cost and token usage

ex-coder --json "analyze code" | jq '{result, cost: .total_cost_usd, tokens: .usage.input_tokens}'
Use in scripts

RESULT=$(ex-coder --json "check if tests pass" | jq -r '.result')
echo "AI says: $RESULT"
Track costs across multiple runs

for task in task1 task2 task3; do
  ex-coder --json "$task" | jq -r '"\(.total_cost_usd)"'
done | awk '{sum+=$1} END {print "Total: $"sum}'

`How It Works`

`$3`

Each ex-coderinvocation: - Gets a unique random port - Starts isolated proxy server - Runs independent Claude Code instance - Cleans up on exit

This allows multiple parallel runs:

`bash

`Terminal 1`


ex-coder --model x-ai/grok-code-fast-1 "task A"
Terminal 2

ex-coder --model openai/gpt-5-codex "task B"
Terminal 3

ex-coder --model minimax/minimax-m2 "task C"


Extended Thinking Support
NEW in v1.1.0: ex-coder now fully supports models with extended thinking/reasoning capabilities (Grok, o1, etc.) with complete Anthropic Messages API protocol compliance.
$3
ex-coder includes a sophisticated Thinking Translation Model that aligns Claude Code's native thinking budget with the unique requirements of every major AI provider.

When you set a thinking budget in Claude (e.g., budget: 16000), ex-coder automatically translates it:

This ensures you can use standard Claude Code thinking controls with ANY supported model, without worrying about API specificities.

`$3`

ex-coder implements the Anthropic Messages API's interleaved-thinking protocol:

Thinking Blocks (Hidden): - Contains model's reasoning process - Automatically collapsed in Claude Code UI - Shows "Claude is thinking..." indicator - User can expand to view reasoning

Text Blocks (Visible): - Contains final response - Displayed normally - Streams incrementally

`$3`

Critical: content_block_start must be sent immediately after message_start, before ping. This is required by the Anthropic Messages API protocol for proper UI initialization.

`$3`

Before (v1.0.0 - No Thinking Support): - Reasoning visible as regular text - Confusing output with internal thoughts - No progress indicators - "All at once" message updates

`$3`

`Dynamic Reasoning Support (NEW in v1.4.0)`

ex-coder now intelligently adapts to ANY reasoning model!

No more hardcoded lists or manual flags. ex-coder dynamically queries OpenRouter metadata to enable thinking capabilities for any model that supports them.

`$3`

3. Universal Compatibility: - Use "ultrathink" or "think hard" prompts with ANY supported model - ex-coder handles the translation layer for you

`Context Scaling & Auto-Compaction`

NEW in v1.2.0: ex-coder now intelligently manages token counting to support ANY context window size (from 128k to 2M+) while preserving Claude Code's native auto-compaction behavior.

`$3`

ex-coder implements a "Dual-Accounting" system:

2. Accurate Reporting (For You): - The status line displays the Real Unscaled Usage and Real Context %. - You see specific costs and limits, while Claude remains blissfully unaware and stable.

`Development`

`$3`

ex-coder uses a Hono-based proxy server inspired by claude-code-proxy:

Why Hono? - Native Bun support (no adapters needed) - Extremely fast and lightweight - Middleware support (CORS, logging, etc.) - Works across Node.js, Bun, and Cloudflare Workers

`$3`

`bash

`Install dependencies`


bun install
Development mode

bun run dev "test prompt"
Build

bun run build
Lint

bun run lint
Format

bun run format
Type check

bun run typecheck
Run tests

bun test


$3
ex-coder includes a comprehensive snapshot testing system to ensure 1:1 compatibility with the official Claude Code protocol:

`bash

`Run snapshot tests (13/13 passing ✅)`


bun test tests/snapshot.test.ts
Full workflow: capture fixtures + run tests

./tests/snapshot-workflow.sh --full
Capture new test fixtures from monitor mode

./tests/snapshot-workflow.sh --capture
Debug SSE events

bun tests/debug-snapshot.ts


What Gets Tested:
- ✅ Event sequence (message_start → content_block_start → deltas → stop → message_delta → message_stop)
- ✅ Content block indices (sequential: 0, 1, 2, ...)
- ✅ Tool input streaming (fine-grained JSON chunks)
- ✅ Usage metrics (present in message_start and message_delta)
- ✅ Stop reasons (always present and valid)
- ✅ Cache metrics (creation and read tokens)
Documentation:
- Quick Start Guide - Get started with testing
- Snapshot Testing Guide - Complete testing documentation
- Implementation Details - Technical implementation summary
- Protocol Compliance Plan - Detailed compliance roadmap
$3

`bash

`Link for global use`


bun run install:global
Now use anywhere

ex-coder "your task"


Troubleshooting
$3
Install Claude Code:

`bash npm install -g claude-code

`or visit: https://claude.com/claude-code`


$3
Set your API key:

`bash export OPENROUTER_API_KEY=sk-or-v1-...`

Or add to your shell profile (~/.zshrc, ~/.bashrc):

`bash echo 'export OPENROUTER_API_KEY=sk-or-v1-...' >> ~/.zshrc source ~/.zshrc`

`$3`

Specify a custom port:

`bash ex-coder --port 3000 "your task"`

Or increase port range in src/config.ts.

`$3`

Check OpenRouter API status: - https://openrouter.ai/status

Verify your API key works: - https://openrouter.ai/keys

`$3`

If the status line doesn't show the model name:

1. Check if --settings flag is being passed:`bash # Look for this in ex-coder output: # [ex-coder] Instance settings: /tmp/ex-coder-settings-{timestamp}.json`

2. Verify environment variable is set:`bash # Should be set automatically by ex-coder echo $ex-coder_ACTIVE_MODEL_NAME # Should output something like: xAI/Grok-1`

4. Check temp settings file:`bash # File is created in /tmp/ex-coder-settings-*.json ls -la /tmp/ex-coder-settings-*.json 2>/dev/null | tail -1 cat /tmp/ex-coder-settings-*.json | head -1`

5. Verify bash is available:`bash which bash # Should show path to bash (usually /bin/bash or /usr/bin/bash)`

Note: Temp settings files are automatically cleaned up when ex-coder exits. If you see multiple files, you may have crashed instances - they're safe to delete manually.

`Comparison with Claude Code`

When to use ex-coder: - ✅ Want to try different models (Grok, GPT-5, etc.) - ✅ Need OpenRouter-specific features - ✅ Prefer OpenRouter pricing - ✅ Testing model performance

When to use Claude Code: - ✅ Want latest Anthropic models only - ✅ Need official Anthropic support - ✅ Simpler setup (no proxy)

`Contributing`

Contributions welcome! Please:

`License`

`Acknowledgments`

ex-coder's proxy implementation is based on claude-code-proxy by @kiyo-e. We've adapted their excellent Hono-based API translation layer for OpenRouter integration.

Thank you to the claude-code-proxy team for building a robust, production-ready foundation! 🙏

Links

---

Made with ❤️ by the Ex-Coder core team

scrapegoat-ex-coder

Ex-Coder (Scrapegoat)

Features

Installation

$3

$3

With Node.js (works everywhere)

With Bun (faster execution)

With npm (Node.js)

With Bun (faster)

$3

Quick Start

$3

Navigate to your project directory

Install ex-coder skill for automatic best practices

Reload Claude Code to discover the skill

$3

Just run it - will prompt for API key and model

Enter your OpenRouter API key when prompted

Select a model from the list

Start coding!

$3

Set up environment

Run with specific task

Or with specific model

AI Agent Usage

Print complete AI agent usage guide

Save guide to file for reference

$3

Usage

$3

$3

$3

Available Models

Agent Support (NEW in v2.1.0)

Use frontend developer agent

Use API architect agent

Discover available agents in your project

Status Line Display

Examples

$3

Simple prompt

Multi-word prompt

$3

Use Grok for fast coding

Use GPT-5 Codex for complex tasks

Use Qwen for UI tasks

$3

Basic usage (auto-approve already enabled)

Fully autonomous (auto-approve + dangerous sandbox disabled)

Disable auto-approve if you want prompts

$3

Use specific port

Or set default

$3

Verbose mode

Custom working directory

Multiple flags

$3

Enable monitor mode (requires real Anthropic API key)

Monitor mode requires a REAL Anthropic API key (not placeholder)

Use with --debug to save logs to file

Logs are saved to: logs/ex-coder_TIMESTAMP.log

$3

Quiet by default in single-shot

Output: 2 + 2 equals 4.

Use in pipelines

Redirect to file

Verbose mode

Output:

[ex-coder] Starting Claude Code with openai/gpt-4o

[ex-coder] Proxy URL: http://127.0.0.1:8797

[ex-coder] Status line: dir • openai/gpt-4o • $cost • ctx%

...

2 + 2 equals 4.

[ex-coder] Shutting down proxy server...

[ex-coder] Done

Interactive mode is verbose by default

JSON output (always quiet)

Output: {"type":"result","result":"2 + 2 equals 4.","total_cost_usd":0.068,"usage":{...}}

`With Node.js (works everywhere)`

`With npm (Node.js)`

`Quick Start`

`$3`

`Navigate to your project directory`

`$3`

`Just run it - will prompt for API key and model`

`Set up environment`

`AI Agent Usage`

`Print complete AI agent usage guide`

`Usage`

`$3`

`$3`

`$3`

`Available Models`

`Agent Support (NEW in v2.1.0)`

`Use frontend developer agent`

`Status Line Display`

`Examples`

`$3`

`Simple prompt`

`Use Grok for fast coding`

`Basic usage (auto-approve already enabled)`

`Use specific port`

`Verbose mode`

`Enable monitor mode (requires real Anthropic API key)`

`Monitor mode requires a REAL Anthropic API key (not placeholder)`

`$3`

`Quiet by default in single-shot`

`Verbose mode`

`JSON output (always quiet)`

`How It Works`

`$3`

`$3`

`$3`

`Terminal 1`

`$3`

`$3`

`$3`

`$3`

`$3`

`$3`

`Dynamic Reasoning Support (NEW in v1.4.0)`

`$3`

`Context Scaling & Auto-Compaction`

`$3`

`$3`

`Development`

`$3`

`$3`

`$3`

`Install dependencies`

`Run snapshot tests (13/13 passing ✅)`

`Link for global use`

`or visit: https://claude.com/claude-code`

`$3`

`$3`

`$3`

`Comparison with Claude Code`

`Contributing`

`License`

`Acknowledgments`

`With Node.js (works everywhere)`

`With npm (Node.js)`

`Quick Start`

`$3`

`Navigate to your project directory`