LIVE
ANTHROPICOpus 4.7 benchmarks published2m ago
CLAUDEOK142ms
OPUS 4.7$15 / $75per Mtok
CHATGPTOK89ms
HACKERNEWSWhy has not AI improved design quality the way it improved dev speed?14m ago
MMLU-PROleader Opus 4.788.4
GEMINIDEGRADED312ms
MISTRALMistral Medium 3 released6m ago
GPT-4o$5 / $15per Mtok
ARXIVCompositional reasoning in LRMs22m ago
BEDROCKOK178ms
GEMINI 2.5$3.50 / $10.50per Mtok
THE VERGEFrontier Model Forum expansion announced38m ago
SWE-BENCHleader Claude Opus 4.772.1%
MISTRALOK104ms
ANTHROPICOpus 4.7 benchmarks published2m ago
CLAUDEOK142ms
OPUS 4.7$15 / $75per Mtok
CHATGPTOK89ms
HACKERNEWSWhy has not AI improved design quality the way it improved dev speed?14m ago
MMLU-PROleader Opus 4.788.4
GEMINIDEGRADED312ms
MISTRALMistral Medium 3 released6m ago
GPT-4o$5 / $15per Mtok
ARXIVCompositional reasoning in LRMs22m ago
BEDROCKOK178ms
GEMINI 2.5$3.50 / $10.50per Mtok
THE VERGEFrontier Model Forum expansion announced38m ago
SWE-BENCHleader Claude Opus 4.772.1%
MISTRALOK104ms
All harnesses

Aider

Paul Gauthier

Aider is one of the original edit-by-diff coding agents and the harness behind the widely cited Aider Polyglot leaderboard. It runs locally, supports any model with an OpenAI-compatible API, and is famous for shipping working diff edits rather than full file rewrites, which makes it cheap to run on long sessions.

Type
cli
License
Open source
Model story
Multi-model, BYOK
Vendor
Paul Gauthier

Leaderboard Placements

BenchmarkBest base modelScoreRank
SWE-bench Verified
Terminal-Bench Claude Opus 4.731.2#8 / 13
Aider Polyglot Claude Opus 4.784.2#2 / 7
SWE-Lancer

Distribution

Open-source Python CLI. Install via pip. Apache 2.0 license.

Model Story

Multi-model. Anthropic, OpenAI, Google, DeepSeek, and any OpenAI-compatible endpoint. Bring your own key.

Pricing

Free harness; you pay for the underlying API tokens.

Who It's For

Engineers who want a local-first, model-agnostic terminal agent and care about token efficiency on long edits.

Notable Features

  • Edit-by-diff over whole-file rewrites
  • Self-published Polyglot leaderboard (225 hardest Exercism tasks)
  • Git-aware: every edit is a commit
  • Voice mode
  • Architect / Editor split-model routing
Vendor site for Aider:https://aider.chat

Other Harnesses