Last Updated: April 2026

The Frontier Model Wars

Real-time tracking of the race between Anthropic, OpenAI, Google, and every lab pushing the edge.

15
Active models
70
Benchmark scores
...
Last update

Every frontier AI lab is running the same race. Scale up compute, scale up data, push a model through pre-training, run an increasingly elaborate post-training pipeline, stamp a release candidate, and ship. The top of the pack has never been more crowded. Anthropic, OpenAI, and Google DeepMind trade the overall lead every few weeks. xAI has closed more ground than most people expected. Meta is running the open weight strategy and winning it. DeepSeek keeps proving that efficiency is a category of its own.

This page is where we track the state of that race in public. The leaderboard updates daily from our benchmark pipeline. The recent releases section pulls straight from the pricing database and filters to anything shipped in the last ninety days. The winners by category read off the latest benchmark scores and pricing data. The news feed at the bottom filters the full TensorFeed stream for model release coverage only.

Current Frontier Leaderboard

Ranked by average score across MMLU-Pro, HumanEval, GPQA Diamond, MATH, and SWE-bench. Models without complete benchmark coverage are omitted.

RankModelProviderReleasedAvg ScorePricing (in / out, per 1M)
#1Claude Opus 4.6AnthropicMar 202683.2$15.00 / $75.00
#2o1OpenAIDec 202482.4$15.00 / $60.00
#3Gemini 2.5 ProGoogleMar 202581.4$1.25 / $10.00
#4Claude Sonnet 4.6AnthropicMar 202677.5$3.00 / $15.00
#5Llama 4 MaverickMetaApr 202576.9$0.00 / $0.00
#6o3-miniOpenAIJan 202574.5$1.10 / $4.40
#7GPT-4oOpenAIMay 202473.3$2.50 / $10.00
#8Mistral LargeMistralJan 202572.0$2.00 / $6.00
#9Llama 4 ScoutMetaApr 202571.0$0.00 / $0.00
#10Gemini 2.0 FlashGoogleFeb 202569.4$0.10 / $0.40
#11Claude Haiku 4.5AnthropicJun 202567.3$0.80 / $4.00
#12Mistral SmallMistralJan 202561.8$0.10 / $0.30

Want to drill into a specific benchmark? See the full benchmarks page.

Head to Head Matchups

The three most searched frontier matchups, at a glance. Each card pulls live scores from the benchmark database.

Claude Opus 4.6

Anthropic
83.2
Average across 5 benchmarks
MMLU-Pro92.4
HumanEval95.1
GPQA Diamond74.2
MATH91.8
SWE-bench62.3

GPT-4.5

OpenAI
79.3
Average across 5 benchmarks
MMLU-Pro90.1
HumanEval93.4
GPQA Diamond68.7
MATH88.2
SWE-bench56.1

Gemini 2.5 Pro

Google
81.4
Average across 5 benchmarks
MMLU-Pro91.2
HumanEval93.8
GPQA Diamond71.9
MATH90.5
SWE-bench59.4

Build your own matchup on the compare tool.

Recent Releases (Last 90 Days)

Every frontier model released in the last quarter, newest first. Auto-updated from the model database.

Mar 2026Claude Opus 4.6
Anthropic
Mar 2026Claude Sonnet 4.6
Anthropic

Who Is Winning at What?

Category leaders pulled live from the benchmark and pricing databases. Scores update daily.

Best at Coding
Claude Opus 4.6
Anthropic · SWE-bench
62.3 / 100
Best at Reasoning
Claude Opus 4.6
Anthropic · GPQA Diamond
74.2 / 100
Best at Math
o1
OpenAI · MATH benchmark
94.6 / 100
Best Long Context
Llama 4 Scout
Meta · Context window
10M tokens
Best Value
Mistral Small
Mistral · Lowest blended price
$0.10 in / $0.30 out per 1M
Best Open Source
Llama 4 Maverick
Meta · Top open weight
Open weights available

Incoming Models

What the rumor mill says is coming next. Treat everything here as unofficial unless linked directly to a lab announcement.

Claude MythosAnthropic
Private preview

In closed preview with 40 launch partners. Pitched as a step change in long horizon agentic reasoning. No public release date.

GPT-5.5OpenAI
Rumored Q2 2026

Multiple leaks describe a unified model that folds in reasoning, voice, and tool use by default. Training reportedly completed in late 2025.

Gemini 3.5Google
Expected 2026

Google has signaled the next Gemini generation at upcoming I/O. Focus areas are deep research agents, video understanding, and multi step tool use.

Grok 4xAI
Announced

xAI has publicly targeted a next generation Grok trained on the Memphis cluster. Elon Musk has framed it as 'smarter than any human.'

Llama 5Meta
Rumored late 2026

Meta has not confirmed timing, but internal comments suggest the next open weight Llama release will push past Llama 4 Maverick on reasoning.

Provider Spotlights

Each frontier lab is running a different strategy. Understanding the strategic posture helps predict the next move.

Anthropic

Anthropic has leaned hard into agentic coding as the wedge. Claude has become the default model for serious developer tools, which creates a revenue base that funds frontier training. The company pairs this with the most aggressive public stance on safety of any major lab. Responsible scaling commitments, detailed model cards, and constitutional AI research are all part of a single story: if you believe the most capable models are coming soon, your commercial strategy should be inseparable from your safety strategy.

OpenAI

OpenAI still owns the largest consumer surface area in AI. ChatGPT is the default chatbot for a huge fraction of the market, which creates data, revenue, and distribution. The o-series reasoning models were the first public bet on test time compute as a primary capability lever, and the results shifted how every other lab thinks about reasoning. Expect continued emphasis on multimodal, voice, and vertical integration through Operator, Sora, and Codex.

Google DeepMind

Google has structural advantages no one else has. Custom TPU infrastructure, a search index, YouTube, and decades of research depth at DeepMind. Gemini has closed most of the capability gap and owns the long context and multimodal categories. The real leverage is distribution: Gemini is shipping into every Google surface, from Search to Workspace to Android. Every Google user becomes a Gemini user by default.

xAI

xAI has compressed an enormous amount of capability into a short timeline. The Memphis training cluster came online with unusual speed, and Grok 3 closed most of the gap to the frontier. Distribution through X gives xAI a feedback loop that other labs do not have. The open question is how long the pace can be sustained.

Meta

Meta is the single biggest reason open weight models exist at the frontier. The Llama family has forced every other lab to compete on value, not just capability. Mark Zuckerberg has framed open weights as a strategic asset, not a charity move, and the Llama 4 family proved that open weight models can be competitive on real benchmarks.

Latest Model Wars News

Live stream of model release and frontier capability coverage. Filtered from the full TensorFeed news feed.

Frequently Asked Questions

Which AI model is the best in 2026?

There is no single best model. Claude Opus 4.6 leads on coding and agentic benchmarks. OpenAI o1 and o3 lead on math reasoning. Gemini 2.5 Pro leads on long context and multimodal. The right answer depends on the workload. See our benchmark leaderboard for category winners.

Who is winning the frontier AI race?

As of April 2026, Anthropic, OpenAI, and Google DeepMind are effectively tied at the top of the frontier, with each lab leading on specific categories. xAI has closed a lot of ground with Grok 3. Meta leads the open weight race with Llama 4. Deepseek has continued to punch above its weight on efficiency.

How often do new frontier models ship?

Frontier releases now land every four to eight weeks on average. Anthropic, OpenAI, and Google each ship a major update every quarter, interspersed with point releases. Open weight launches from Meta, Mistral, and DeepSeek add to the cadence.

What is a frontier model?

A frontier model is one of the most capable AI systems publicly available at a given moment, typically defined by benchmark performance, compute used during training, and agentic capability. The frontier moves constantly as new releases ship.

Which frontier model is cheapest?

Among frontier tier models, Claude Haiku 4.5, Gemini Flash, and GPT-4o mini consistently offer the best price per token. For open weights, Llama 4 and DeepSeek V3 are close to free when you self host.

Related Hubs

Keep tracking the frontier from every angle.