LIVE
ANTHROPICOpus 4.7 benchmarks published2m ago
CLAUDEOK142ms
OPUS 4.7$15 / $75per Mtok
CHATGPTOK89ms
HACKERNEWSWhy has not AI improved design quality the way it improved dev speed?14m ago
MMLU-PROleader Opus 4.788.4
GEMINIDEGRADED312ms
MISTRALMistral Medium 3 released6m ago
GPT-4o$5 / $15per Mtok
ARXIVCompositional reasoning in LRMs22m ago
BEDROCKOK178ms
GEMINI 2.5$3.50 / $10.50per Mtok
THE VERGEFrontier Model Forum expansion announced38m ago
SWE-BENCHleader Claude Opus 4.772.1%
MISTRALOK104ms
ANTHROPICOpus 4.7 benchmarks published2m ago
CLAUDEOK142ms
OPUS 4.7$15 / $75per Mtok
CHATGPTOK89ms
HACKERNEWSWhy has not AI improved design quality the way it improved dev speed?14m ago
MMLU-PROleader Opus 4.788.4
GEMINIDEGRADED312ms
MISTRALMistral Medium 3 released6m ago
GPT-4o$5 / $15per Mtok
ARXIVCompositional reasoning in LRMs22m ago
BEDROCKOK178ms
GEMINI 2.5$3.50 / $10.50per Mtok
THE VERGEFrontier Model Forum expansion announced38m ago
SWE-BENCHleader Claude Opus 4.772.1%
MISTRALOK104ms
All harnesses

Claude Code

Anthropic

Anthropic's official terminal agent for Claude. Ships as a CLI installable from npm, with first-class support for MCP servers, custom hooks, slash commands, subagent orchestration, and a project-level CLAUDE.md memory file. The harness benefits from being co-designed with Claude's agentic post-training, which is the main reason it tops most agentic-coding leaderboards when paired with Opus 4.7 or Sonnet 4.6.

Type
cli
License
Proprietary
Model story
Anthropic models only
Vendor
Anthropic

Leaderboard Placements

BenchmarkBest base modelScoreRank
SWE-bench Verified Claude Opus 4.774.5#1 / 15
Terminal-Bench Claude Opus 4.752.3#1 / 13
Aider Polyglot Claude Opus 4.784.2#1 / 7
SWE-Lancer Claude Opus 4.741.8#1 / 5

Distribution

CLI installable via npm. Also available as a VS Code and JetBrains extension, plus a desktop app.

Model Story

Anthropic models only. Routes Opus 4.7 by default, with Sonnet 4.6 and Haiku 4.5 selectable. No bring-your-own-key for non-Anthropic models.

Pricing

Subscription tied to Anthropic API credits or a flat-rate Pro/Max plan. No per-tool fees.

Who It's For

Engineers running long, autonomous coding sessions in a terminal where MCP integrations and project-aware memory pay off.

Notable Features

  • Native MCP server orchestration (single config, hot reload)
  • CLAUDE.md per-project and ~/.claude/CLAUDE.md global memory
  • Hooks: pre-tool, post-tool, on-stop
  • Subagent delegation with isolated worktrees
  • Plan mode with approval gating
Vendor site for Claude Code:https://www.anthropic.com/claude-code

Other Harnesses