GitHub - pacaplan/agent-gauntlet: Don't just review the agent's code; put it through the gauntlet.

Don't just review the agent's code — put it through the gauntlet.

Agent Gauntlet is a configurable “feedback loop” runner for AI-assisted development workflows.

You configure which paths in your repo should trigger which validations — shell commands like tests and linters, plus AI-powered local code reviews. When files change, Gauntlet automatically runs the relevant validations and reports results.

For AI reviews, it uses the CLI tool of your choice: Gemini, Codex, Claude Code, GitHub Copilot, or Cursor.

Features

Agent validation loop: Keep your coding agent on track with automated feedback loops. Detect problems — deterministically and/or non-deterministically — and let your agent fix and Gauntlet verify.
Local cross-agent code reviews: Enable one AI agent to automatically request code reviews from another. For example, if Claude made changes, Gauntlet can request a review from Codex — spreading token usage across your subscriptions instead of burning through one.
- Multiple AI review adapters have been evaluated for quality and efficiency. Claude and Codex deliver optimal review quality with superior token efficiency. For detailed metrics, see Eval Results.
Leverage existing subscriptions: Agent Gauntlet is free and tool-agnostic, leveraging the AI CLI tools you already have installed.
Easy CI setup: Define your CI gates once, run them locally and in GitHub.

Common Workflows

Agent Gauntlet supports three workflows, ranging from simple CLI execution to fully autonomous agentic integration:

CLI Mode — Run checks via command line; ideal for CI pipelines and scripts.
Assistant Mode — AI assistant runs validation loop, fixing issues iteratively.
Agentic Mode — Autonomous agent validates and fixes in real-time via stop hook (experimental).

Example Workflow

Claude implements a feature
Agent Gauntlet reports quality issues detected by static code analysis and Codex reviewer agent
Claude fixes issues
Agent Gauntlet verifies

Comparison vs Other Tools

AI Code Review Tools

Agent Gauntlet is not a replacement for tools that provide AI pull request code reviews. It provides real-time feedback loops for autonomous coding agents, combining deterministic static checks (build, lint, test) with multi-agent AI reviews in a single pipeline. This enables agents to iterate and self-correct until all checks and reviews pass, without human intervention.

Full comparison →

Spec-Driven Workflow Tools

It is recommended to use Agent Gauntlet in conjunction with other spec-driven development tools. We believe is the ideal implementation of the validation step in any Spec → Implement → Validate workflow.

Quick Start

For basic usage and configuration guide, see the Quick Start Guide.

Documentation

Quick Start Guide — installation, basic usage, and config layout
User Guide — full usage details
Configuration Reference — all configuration fields + defaults
Stop Hook Guide — integrate with Claude Code's stop hook (experimental).
CLI Invocation Details — how we securely invoke AI CLIs
Feature Comparison — how Agent Gauntlet compares to other tools
Development Guide — how to build and develop this project

Name		Name	Last commit message	Last commit date
Latest commit History 439 Commits
.agent/workflows		.agent/workflows
.changeset		.changeset
.claude		.claude
.codescene		.codescene
.config		.config
.cursor		.cursor
.gauntlet		.gauntlet
.gemini/commands/openspec		.gemini/commands/openspec
.github/workflows		.github/workflows
docs		docs
evals		evals
openspec		openspec
src		src
test		test
.bunfig.toml		.bunfig.toml
.cursorrules		.cursorrules
.gitignore		.gitignore
.markdownlintignore		.markdownlintignore
.npmrc		.npmrc
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
biome.json		biome.json
build.ts		build.ts
bun.lock		bun.lock
package.json		package.json
test_filter.ts		test_filter.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Features

Common Workflows

Example Workflow

Comparison vs Other Tools

AI Code Review Tools

Spec-Driven Workflow Tools

Quick Start

Documentation

About

Uh oh!

Releases 8

Packages

Contributors 4

Uh oh!

Languages

License

pacaplan/agent-gauntlet

Folders and files

Latest commit

History

Repository files navigation

Features

Common Workflows

Example Workflow

Comparison vs Other Tools

AI Code Review Tools

Spec-Driven Workflow Tools

Quick Start

Documentation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Contributors 4

Uh oh!

Languages

Packages