Skip to content
LIVE
OPUS 4.8$5 / $25per Mtok
GPT-5.5$5 / $30per Mtok
GEMINI 2.5 PRO$1.25 / $10per Mtok
SONNET 4.6$3 / $15per Mtok
SWE-BENCHleader GPT-5.568.7%
MMLU-PROleader GPT-5.594.2
GPQAleader GPT-5.578.3
AFTAv1.0 whitepaper live at /whitepaper
OPUS 4.8$5 / $25per Mtok
GPT-5.5$5 / $30per Mtok
GEMINI 2.5 PRO$1.25 / $10per Mtok
SONNET 4.6$3 / $15per Mtok
SWE-BENCHleader GPT-5.568.7%
MMLU-PROleader GPT-5.594.2
GPQAleader GPT-5.578.3
AFTAv1.0 whitepaper live at /whitepaper
All systems operational0 AI providers monitored, polled every 2 minutes
Live status

MiniMax is the Shanghai-based AI lab known for shipping open-weight models with aggressive long-context engineering. Its June 1, 2026 release, MiniMax M3, is built on MiniMax Sparse Attention (MSA), which replaces full attention with KV-block selection and cuts per-token compute at 1M context to roughly one twentieth of the previous generation. M3 takes text, image, and video input across a 1,048,576 token context window, reports 59% on SWE-Bench Pro and 83.5 on BrowseComp, and is priced at $0.30 per million input and $1.20 per million output, around 5 to 10 percent of the cost of proprietary flagships. The headline benchmark runs used MiniMax's own infrastructure and agent scaffolding, so independent verification is pending, and the open weights are due on Hugging Face within about ten days of launch.

Founded

2021

Headquarters

Shanghai, China

CEO

Yan Junjie

Models

1 active

Key Products

MiniMax M3MiniMax M2MiniMax API platformHailuo AI video

Strengths

  • Sparse attention long-context efficiency
  • Ultra-low pricing
  • Open-weight release cadence
  • Multimodal input at 1M context
  • Strong agentic coding claims

MiniMax Models

ModelInput / 1MOutput / 1MContextCapabilities
MiniMax M30.301.201.0Mtext, vision, video, code, tool-use

Prices per 1M tokens in USD. See the full pricing guide for detailed analysis.

Comparisons

Other Providers

Visit MiniMaxCompare ModelsAll Models