Last Updated: April 2026

The Frontier Model Wars

Real-time tracking of the race between Anthropic, OpenAI, Google, and every lab pushing the edge.

Active models

Benchmark scores

...

Last update

Every frontier AI lab is running the same race. Scale up compute, scale up data, push a model through pre-training, run an increasingly elaborate post-training pipeline, stamp a release candidate, and ship. The top of the pack has never been more crowded. Anthropic, OpenAI, and Google DeepMind trade the overall lead every few weeks. xAI has closed more ground than most people expected. Meta is running the open weight strategy and winning it. DeepSeek keeps proving that efficiency is a category of its own.

This page is where we track the state of that race in public. The leaderboard updates daily from our benchmark pipeline. The recent releases section pulls straight from the pricing database and filters to anything shipped in the last ninety days. The winners by category read off the latest benchmark scores and pricing data. The news feed at the bottom filters the full TensorFeed stream for model release coverage only.

Current Frontier Leaderboard

Ranked by average score across MMLU-Pro, HumanEval, GPQA Diamond, MATH, and SWE-bench. Models without complete benchmark coverage are omitted.

Rank	Model	Provider	Released	Avg Score	Pricing (in / out, per 1M)
#1	GPT-5.5	OpenAI	Apr 2026	86.8	$5.00 / $30.00
#2	Claude Opus 4.7	Anthropic	Apr 2026	85.0	$15.00 / $75.00
#3	Claude Opus 4.6	Anthropic	Mar 2026	83.2	$15.00 / $75.00
#4	DeepSeek V4 Pro	DeepSeek	Apr 2026	83.1	$1.74 / $3.48
#5	o1	OpenAI	Dec 2024	82.4	$15.00 / $60.00
#6	Gemini 2.5 Pro	Google	Mar 2025	81.4	$1.25 / $10.00
#7	Claude Sonnet 4.6	Anthropic	Mar 2026	77.5	$3.00 / $15.00
#8	Llama 4 Maverick	Meta	Apr 2025	76.9	$0.00 / $0.00
#9	o3-mini	OpenAI	Jan 2025	74.5	$1.10 / $4.40
#10	GPT-4o	OpenAI	May 2024	73.3	$2.50 / $10.00
#11	DeepSeek V4 Flash	DeepSeek	Apr 2026	72.9	$0.14 / $0.28
#12	Mistral Large	Mistral	Jan 2025	72.0	$2.00 / $6.00
#13	Llama 4 Scout	Meta	Apr 2025	71.0	$0.00 / $0.00
#14	Gemini 2.0 Flash	Google	Feb 2025	69.4	$0.10 / $0.40
#15	Claude Haiku 4.5	Anthropic	Jun 2025	67.3	$0.80 / $4.00

Want to drill into a specific benchmark? See the full benchmarks page.

Head to Head Matchups

The three most searched frontier matchups, at a glance. Each card pulls live scores from the benchmark database.

Claude Opus 4.7

Anthropic

85.0

Average across 5 benchmarks

MMLU-Pro93.8

HumanEval96.2

GPQA Diamond76.5

MATH93.1

SWE-bench65.4

GPT-4.5

OpenAI

79.3

Average across 5 benchmarks

MMLU-Pro90.1

HumanEval93.4

GPQA Diamond68.7

MATH88.2

SWE-bench56.1

Gemini 2.5 Pro

Google

81.4

Average across 5 benchmarks

MMLU-Pro91.2

HumanEval93.8

GPQA Diamond71.9

MATH90.5

SWE-bench59.4

Build your own matchup on the compare tool.

Recent Releases (Last 90 Days)

Every frontier model released in the last quarter, newest first. Auto-updated from the model database.

May 2026Gemini 3.1 Flash-Lite

Google

May 2026Gemini 3.5 Flash

Google

May 2026Qwen3.7-Max

Alibaba

Apr 2026Claude Opus 4.7

Anthropic

Apr 2026GPT-5.5

OpenAI

Apr 2026DeepSeek V4 Pro

DeepSeek

Apr 2026DeepSeek V4 Flash

DeepSeek

Apr 2026Nemotron 3 Nano Omni

NVIDIA

Mar 2026Claude Opus 4.6

Anthropic

Mar 2026Claude Sonnet 4.6

Anthropic

Who Is Winning at What?

Category leaders pulled live from the benchmark and pricing databases. Scores update daily.

Best at Coding

GPT-5.5

OpenAI · SWE-bench

68.7 / 100

Best at Reasoning

GPT-5.5

OpenAI · GPQA Diamond

78.3 / 100

Best at Math

GPT-5.5

OpenAI · MATH benchmark

95.8 / 100

Best Long Context

Llama 4 Scout

Meta · Context window

10M tokens

Best Value

Mistral Small

Mistral · Lowest blended price

$0.10 in / $0.30 out per 1M

Best Open Source

DeepSeek V4 Pro

DeepSeek · Top open weight

Open weights available

Incoming Models

What the rumor mill says is coming next. Treat everything here as unofficial unless linked directly to a lab announcement.

Claude MythosAnthropic

Private preview

In closed preview with 40 launch partners. Pitched as a step change in long horizon agentic reasoning. No public release date.

GPT-5.5OpenAI

Rumored Q2 2026

Multiple leaks describe a unified model that folds in reasoning, voice, and tool use by default. Training reportedly completed in late 2025.

Gemini 3.5Google

Expected 2026

Google has signaled the next Gemini generation at upcoming I/O. Focus areas are deep research agents, video understanding, and multi step tool use.

Grok 4xAI

Announced

xAI has publicly targeted a next generation Grok trained on the Memphis cluster. Elon Musk has framed it as 'smarter than any human.'

Llama 5Meta

Rumored late 2026

Meta has not confirmed timing, but internal comments suggest the next open weight Llama release will push past Llama 4 Maverick on reasoning.

Provider Spotlights

Each frontier lab is running a different strategy. Understanding the strategic posture helps predict the next move.

Anthropic

Anthropic has leaned hard into agentic coding as the wedge. Claude has become the default model for serious developer tools, which creates a revenue base that funds frontier training. The company pairs this with the most aggressive public stance on safety of any major lab. Responsible scaling commitments, detailed model cards, and constitutional AI research are all part of a single story: if you believe the most capable models are coming soon, your commercial strategy should be inseparable from your safety strategy.

OpenAI

OpenAI still owns the largest consumer surface area in AI. ChatGPT is the default chatbot for a huge fraction of the market, which creates data, revenue, and distribution. The o-series reasoning models were the first public bet on test time compute as a primary capability lever, and the results shifted how every other lab thinks about reasoning. Expect continued emphasis on multimodal, voice, and vertical integration through Operator, Sora, and Codex.

Google DeepMind

Google has structural advantages no one else has. Custom TPU infrastructure, a search index, YouTube, and decades of research depth at DeepMind. Gemini has closed most of the capability gap and owns the long context and multimodal categories. The real leverage is distribution: Gemini is shipping into every Google surface, from Search to Workspace to Android. Every Google user becomes a Gemini user by default.

xAI

xAI has compressed an enormous amount of capability into a short timeline. The Memphis training cluster came online with unusual speed, and Grok 3 closed most of the gap to the frontier. Distribution through X gives xAI a feedback loop that other labs do not have. The open question is how long the pace can be sustained.

Latest Model Wars News

Live stream of model release and frontier capability coverage. Filtered from the full TensorFeed news feed.

Frequently Asked Questions

Which AI model is the best in 2026?

There is no single best model. Claude Opus 4.7 leads on coding and agentic benchmarks and ships with a 1M context window. OpenAI o1 and o3 lead on math reasoning. Gemini 2.5 Pro leads on price-per-token at long context. The right answer depends on the workload. See our benchmark leaderboard for category winners.

Who is winning the frontier AI race?

As of April 2026, Anthropic, OpenAI, and Google DeepMind are effectively tied at the top of the frontier, with each lab leading on specific categories. xAI has closed a lot of ground with Grok 3. Meta leads the open weight race with Llama 4. Deepseek has continued to punch above its weight on efficiency.

How often do new frontier models ship?

Frontier releases now land every four to eight weeks on average. Anthropic, OpenAI, and Google each ship a major update every quarter, interspersed with point releases. Open weight launches from Meta, Mistral, and DeepSeek add to the cadence.

What is a frontier model?

A frontier model is one of the most capable AI systems publicly available at a given moment, typically defined by benchmark performance, compute used during training, and agentic capability. The frontier moves constantly as new releases ship.

Which frontier model is cheapest?

Among frontier tier models, Claude Haiku 4.5, Gemini Flash, and GPT-4o mini consistently offer the best price per token. For open weights, Llama 4 and DeepSeek V3 are close to free when you self host.

Related Hubs

Keep tracking the frontier from every angle.

AGI & ASI Benchmarks Compare Models Model Tracker Originals

← Back to Feed