A CLI tool for testing AI coding agents
npm install agent-gauntlet





> Don't just review the agent's code — put it through the gauntlet.
Agent Gauntlet is a configurable “feedback loop” runner for AI-assisted development workflows.
You configure which paths in your repo should trigger which validations — shell commands like tests and linters, plus AI-powered code reviews. When files change, Gauntlet automatically runs the relevant validations and reports results.
For AI reviews, it uses the CLI tool of your choice: Gemini, Codex, Claude Code, GitHub Copilot, or Cursor.
- Agent validation loop: Keep your coding agent on track with automated feedback loops. Detect problems — deterministically and/or non-deterministically — and let your agent fix and Gauntlet verify.
- Multi-agent collaboration: Enable one AI agent to automatically request code reviews from another. For example, if Claude made changes, Gauntlet can request a review from Codex or Gemini — spreading token usage across your subscriptions instead of burning through one.
- Leverage existing subscriptions: Agent Gauntlet is free and tool-agnostic, leveraging the AI CLI tools you already have installed.
- Easy CI setup: Define your checks once, run them locally and in GitHub.
Unlike traditional code review tools designed for PR workflows, Agent Gauntlet provides real-time feedback loops for autonomous coding agents.
| Use Case | Recommended |
| :--- | :--- |
| Autonomous agent development | Agent Gauntlet |
| Traditional PR review with human reviewers | Other tools |
| IDE-integrated review while coding | Other tools |
| Enterprise with strict compliance requirements | Other tools |
| Budget-conscious teams with existing AI CLI tools | Agent Gauntlet |
Agent Gauntlet supports three workflows, ranging from simple CLI execution to fully autonomous agentic integration:
- CLI Mode — Run checks via command line; ideal for CI pipelines and scripts.
- Assistant Mode — AI assistant runs validation loop, fixing issues iteratively.
- Agentic Mode — Autonomous agent validates and fixes in real-time via stop hook.
1. Install: bun add -g agent-gauntlet
2. Initialize: agent-gauntlet init
3. Run: agent-gauntlet run
For basic usage and configuration guide, see the Quick Start Guide.
- Quick Start Guide — installation, basic usage, and config layout
- User Guide — full usage details
- Configuration Reference — all configuration fields + defaults
- Stop Hook Guide — integrate with Claude Code's stop hook
- CLI Invocation Details — how we securely invoke AI CLIs
- Feature Comparison — how Agent Gauntlet compares to other tools
- Development Guide — how to build and develop this project