@runhuman/mcp

![npm version](https://www.npmjs.com/package/@runhuman/mcp)
![License: ISC](https://opensource.org/licenses/ISC)
![MCP](https://modelcontextprotocol.io/)

A Model Context Protocol (MCP) server that enables AI agents to orchestrate on-demand human QA testing through the Runhuman service.

> Note: This is the MCP server package. For the main CLI tool, see runhuman (coming soon).

What is this?

This MCP server allows AI agents (like Claude, GPT, or any MCP-compatible AI) to request human testing of web applications. The AI defines what to test and what data format it needs back, and real humans perform the testing with structured results extracted automatically.

Perfect for:
- AI coding agents that need human verification of their work
- Automated workflows requiring human-in-the-loop testing
- CI/CD pipelines with human QA gates
- Visual/UX testing that requires real human judgment

Quick Start

$3

1. Get your API key at runhuman.com/dashboard

2. Add to your Claude Desktop config:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json

``json { "mcpServers": { "runhuman": { "command": "npx", "args": ["-y", "@runhuman/mcp", "--api-key=qa_live_xxxxxxxxxxxxx"] } } }`

3. Restart Claude Desktop

That's it! Claude can now request human testing:

> "Can someone test my checkout page at https://myapp.com/checkout and verify the payment flow works?"

`$3`

For other MCP clients (VS Code extensions, custom agents, etc.):

`bash

`Install globally`


npm install -g @runhuman/mcp
Run with your API key

runhuman-mcp --api-key=qa_live_xxxxxxxxxxxxx


Or use npx without installation:

`bash npx @runhuman/mcp --api-key=qa_live_xxxxxxxxxxxxx`

`Features`

`$3`

#### create_jobCreate a new human QA testing job.

Parameters: -url(string, required): The URL to test -description(string, required): Natural language instructions for the human tester describing what to test -schema(object, optional): JSON schema defining the structured output format you want back -targetDurationMinutes (number, optional): Time limit for tester (default: 5, range: 1-60)

Example:`typescript { url: "https://myapp.com/checkout", description: "Test the checkout flow: add item to cart, proceed to checkout, and verify payment options are displayed correctly.", schema: { type: "object", properties: { checkout_loaded: { type: "boolean" }, payment_options_visible: { type: "boolean" }, any_errors: { type: "string" } } }, targetDurationMinutes: 5 }`

Returns: -jobId: Unique identifier for tracking the job -status: Current job status (pending, claimed, in_progress, completed, failed, timeout) -estimatedCompletionTime: When the test is expected to complete -url: The URL being tested -description: Test instructions

#### wait_for_resultCheck status and retrieve results for a QA job.

Parameters: -jobId(string, required): The ID returned from create_job -waitSeconds (number, optional): How long to wait before checking (default: 30, range: 1-300)

Behavior: 1. Checks status immediately (returns right away if already complete) 2. Waits for the specified duration if not complete 3. Checks status again after waiting 4. Returns results if complete, otherwise suggests calling again

Returns when complete: -result: Structured test results matching your schema -status: "completed" -costUsd: Exact cost in USD (e.g., 0.396) -testDurationSeconds: Time spent by tester (rounded up) -testerResponse: Raw natural language feedback from the human -testerAlias: Anonymized tester name (e.g., "Tester Alpha") -testerAvatarUrl: Avatar image URL -testerColor: Hex color for theming - Additional metadata (timestamps, etc.)

Example workflow:`typescript // Create job const job = await create_job({ url: "https://myapp.com", description: "Test the login page" });

// Wait for result with increasing intervals let result = await wait_for_result(job.jobId, { waitSeconds: 30 }); if (result.status !== 'completed') { result = await wait_for_result(job.jobId, { waitSeconds: 60 }); }`

`Configuration`

`$3`

By default, the server connects to the production API at https://runhuman.com. For local development or testing:

`json { "mcpServers": { "runhuman": { "command": "npx", "args": [ "-y", "@runhuman/mcp", "--api-key=qa_live_xxxxx", "--api-url=http://localhost:3000" ] } } }`

`$3`

Alternatively, use environment variables:

`bash export RUNHUMAN_API_KEY=qa_live_xxxxxxxxxxxxx export RUNHUMAN_API_URL=https://runhuman.com # optional`

`Pricing`

Runhuman uses pay-per-second pricing: - $0.0018 per second of tester time - Duration is rounded up to the nearest second - Example: 220 seconds of testing = $0.396

Costs are calculated exactly and returned in the API response. No monthly fees, no minimums.

`How It Works`

1. AI creates job - Your AI agent calls create_jobwith test instructions 2. Job posted to testers - Request goes to the human tester pool via Slack 3. Human performs test - A real person tests the application (video/screenshots recorded) 4. Human reports findings - Tester describes what they observed in natural language 5. AI extracts data - GPT-4o processes feedback into structured JSON matching your schema 6. AI gets results -wait_for_result returns clean, typed data ready for automation

`Example Use Cases`

`$3`


User: "Can you update my checkout page and verify it works?"
Agent: [Updates code, deploys, then uses create_job to verify]
Agent: "✅ Updated and verified by human tester. Payment flow works correctly."

$3

yaml
- name: Human QA Gate
  run: |
    # Deploy preview
    # Call Runhuman API via MCP
    # Fail build if test fails

$3


Agent: "Please verify the new homepage design looks good on mobile"
[Human tester provides UX feedback]
Agent: "Human feedback: Design looks great, minor spacing issue noted..."


Development
$3

`bash

`Clone the repo`


git clone https://github.com/volter-ai/runhuman.git
cd runhuman
Install dependencies

npm install
Build the MCP server

npm run build --workspace=@runhuman/mcp
Run with local API

cd packages/mcp-server
node dist/index.js --api-key=qa_live_test_key_123 --api-url=http://localhost:3000

$3

`bash

`Run tests`


npm run test --workspace=@runhuman/mcp
Use MCP Inspector (interactive testing)

npm run test:inspector --workspace=@runhuman/mcp

$3

`packages/mcp-server/ ├── src/ │ ├── index.ts # CLI entry point (stdio) │ ├── mcp-server-factory.ts # Core server factory │ ├── lib.ts # Library exports │ ├── types.ts # TypeScript types │ └── tools/ # Tool implementations │ ├── create-job.ts │ └── wait-for-result.ts ├── dist/ # Compiled output ├── tests/ # Test suite ├── docs/ # Developer documentation ├── package.json ├── tsconfig.json └── README.md # This file`

`Documentation`

- Model Context Protocol - Learn about MCP - Runhuman Website - Product information - API Documentation - REST API reference - GitHub Repository - Full source code

`$3`

- How Agents Use MCP - Understanding MCP tool discovery - Tool Response Best Practices - Writing effective tool responses - API Authentication - Authentication details

`Integration Examples`

`$3`

Once installed, Claude can naturally use the tools:

User: "Test my app at https://staging.myapp.com and check if the login works"

Claude:`I'll create a QA job to test your login page. [Calls create_job tool] Job created! Waiting for a human tester... [Calls wait_for_result periodically] ✅ Test complete! The login page works correctly. The tester was able to log in successfully with test credentials.`

`$3`

`typescript import { createMcpServer } from '@runhuman/mcp/factory';

const server = createMcpServer({ mode: 'direct', apiUrl: 'https://runhuman.com', apiKey: 'qa_live_xxxxx' });

// Use with your MCP client await server.connect(transport);`

`Troubleshooting`

`$3`

Make sure you've: 1. Created an account at runhuman.com 2. Generated an API key in the dashboard 3. Added the key to your config with the correct format (--api-key=qa_live_...)

`$3`

API keys must start with: -qa_live_for production -qa_test_ for testing

Get a valid key at runhuman.com/dashboard

`$3`

If you're having trouble connecting:

1. Check your API URL is correct (default: https://runhuman.com) 2. Verify your network connection 3. Test the API directly:`bash curl https://runhuman.com/api/jobs \ -H "Authorization: Bearer qa_live_xxxxx" \ -H "Content-Type: application/json" \ -d '{"url":"https://example.com","description":"test","outputSchema":{}}'`

`$3`

1. Verify config file location is correct for your OS 2. Check JSON syntax is valid (use a JSON validator) 3. Restart Claude Desktop completely 4. Check Claude's logs for errors

`Contributing`

This package is part of the Runhuman monorepo. Contributions welcome!

`bash

`Clone the full repo`


git clone https://github.com/volter-ai/runhuman.git
cd runhuman
Install dependencies

npm install
Make changes to packages/mcp-server/
Build and test

npm run build --workspace=@runhuman/mcp
npm run test --workspace=@runhuman/mcp
Submit PR

Support

- Email: hey@runhuman.com
- Issues: GitHub Issues
- Website: runhuman.com

License

ISC License - See LICENSE for details

About Runhuman

Runhuman is an AI-orchestrated human QA testing service. Part of the Volter AI ecosystem.

Key Features:
- On-demand human testing (no hiring/managing)
- AI-powered orchestration and result extraction
- Pay-per-second pricing ($0.0018/sec)
- Custom output schemas for structured results
- Browser automation with video/screenshot recording
- GitHub Actions integration
- REST API and MCP server for any workflow

---

Built with ❤️ by the Volter AI team

@runhuman/mcp

![npm version](https://www.npmjs.com/package/@runhuman/mcp)
![License: ISC](https://opensource.org/licenses/ISC)
![MCP](https://modelcontextprotocol.io/)

A Model Context Protocol (MCP) server that enables AI agents to orchestrate on-demand human QA testing through the Runhuman service.

> Note: This is the MCP server package. For the main CLI tool, see runhuman (coming soon).

What is this?

Quick Start

$3

1. Get your API key at runhuman.com/dashboard

2. Add to your Claude Desktop config:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json

``json { "mcpServers": { "runhuman": { "command": "npx", "args": ["-y", "@runhuman/mcp", "--api-key=qa_live_xxxxxxxxxxxxx"] } } }`

3. Restart Claude Desktop

That's it! Claude can now request human testing:

> "Can someone test my checkout page at https://myapp.com/checkout and verify the payment flow works?"

`$3`

For other MCP clients (VS Code extensions, custom agents, etc.):

`bash

`Install globally`


npm install -g @runhuman/mcp
Run with your API key

runhuman-mcp --api-key=qa_live_xxxxxxxxxxxxx


Or use npx without installation:

`bash npx @runhuman/mcp --api-key=qa_live_xxxxxxxxxxxxx`

`Features`

`$3`

#### create_jobCreate a new human QA testing job.

#### wait_for_resultCheck status and retrieve results for a QA job.

Parameters: -jobId(string, required): The ID returned from create_job -waitSeconds (number, optional): How long to wait before checking (default: 30, range: 1-300)

Example workflow:`typescript // Create job const job = await create_job({ url: "https://myapp.com", description: "Test the login page" });

`Configuration`

`$3`

By default, the server connects to the production API at https://runhuman.com. For local development or testing:

`json { "mcpServers": { "runhuman": { "command": "npx", "args": [ "-y", "@runhuman/mcp", "--api-key=qa_live_xxxxx", "--api-url=http://localhost:3000" ] } } }`

`$3`

Alternatively, use environment variables:

`bash export RUNHUMAN_API_KEY=qa_live_xxxxxxxxxxxxx export RUNHUMAN_API_URL=https://runhuman.com # optional`

`Pricing`

Runhuman uses pay-per-second pricing: - $0.0018 per second of tester time - Duration is rounded up to the nearest second - Example: 220 seconds of testing = $0.396

Costs are calculated exactly and returned in the API response. No monthly fees, no minimums.

`How It Works`

`Example Use Cases`

`$3`


User: "Can you update my checkout page and verify it works?"
Agent: [Updates code, deploys, then uses create_job to verify]
Agent: "✅ Updated and verified by human tester. Payment flow works correctly."

$3

yaml
- name: Human QA Gate
  run: |
    # Deploy preview
    # Call Runhuman API via MCP
    # Fail build if test fails

$3


Agent: "Please verify the new homepage design looks good on mobile"
[Human tester provides UX feedback]
Agent: "Human feedback: Design looks great, minor spacing issue noted..."


Development
$3

`bash

`Clone the repo`


git clone https://github.com/volter-ai/runhuman.git
cd runhuman
Install dependencies

npm install
Build the MCP server

npm run build --workspace=@runhuman/mcp
Run with local API

cd packages/mcp-server
node dist/index.js --api-key=qa_live_test_key_123 --api-url=http://localhost:3000

$3

`bash

`Run tests`


npm run test --workspace=@runhuman/mcp
Use MCP Inspector (interactive testing)

npm run test:inspector --workspace=@runhuman/mcp

$3

`Documentation`

- Model Context Protocol - Learn about MCP - Runhuman Website - Product information - API Documentation - REST API reference - GitHub Repository - Full source code

`$3`

- How Agents Use MCP - Understanding MCP tool discovery - Tool Response Best Practices - Writing effective tool responses - API Authentication - Authentication details

`Integration Examples`

`$3`

Once installed, Claude can naturally use the tools:

User: "Test my app at https://staging.myapp.com and check if the login works"

`$3`

`typescript import { createMcpServer } from '@runhuman/mcp/factory';

const server = createMcpServer({ mode: 'direct', apiUrl: 'https://runhuman.com', apiKey: 'qa_live_xxxxx' });

// Use with your MCP client await server.connect(transport);`

`Troubleshooting`

`$3`

Make sure you've: 1. Created an account at runhuman.com 2. Generated an API key in the dashboard 3. Added the key to your config with the correct format (--api-key=qa_live_...)

`$3`

API keys must start with: -qa_live_for production -qa_test_ for testing

Get a valid key at runhuman.com/dashboard

`$3`

If you're having trouble connecting:

`$3`

1. Verify config file location is correct for your OS 2. Check JSON syntax is valid (use a JSON validator) 3. Restart Claude Desktop completely 4. Check Claude's logs for errors

`Contributing`

This package is part of the Runhuman monorepo. Contributions welcome!

`bash

`Clone the full repo`


git clone https://github.com/volter-ai/runhuman.git
cd runhuman
Install dependencies

npm install
Make changes to packages/mcp-server/
Build and test

npm run build --workspace=@runhuman/mcp
npm run test --workspace=@runhuman/mcp
Submit PR

Support

- Email: hey@runhuman.com
- Issues: GitHub Issues
- Website: runhuman.com

License

ISC License - See LICENSE for details

About Runhuman

Runhuman is an AI-orchestrated human QA testing service. Part of the Volter AI ecosystem.

---

Built with ❤️ by the Volter AI team