LIVE
OPUS 4.7$15 / $75per Mtok
SONNET 4.6$3 / $15per Mtok
GPT-5.5$10 / $30per Mtok
GEMINI 3.1$3.50 / $10.50per Mtok
SWE-BENCHleader Claude Opus 4.772.1%
MMLU-PROleader Opus 4.788.4
VALS FINANCEleader Opus 4.764.4%
AFTAv1.0 whitepaper live at /whitepaper
OPUS 4.7$15 / $75per Mtok
SONNET 4.6$3 / $15per Mtok
GPT-5.5$10 / $30per Mtok
GEMINI 3.1$3.50 / $10.50per Mtok
SWE-BENCHleader Claude Opus 4.772.1%
MMLU-PROleader Opus 4.788.4
VALS FINANCEleader Opus 4.764.4%
AFTAv1.0 whitepaper live at /whitepaper
All systems operational0 AI providers monitored, polled every 2 minutes
Live status

Gemini 3.1 Flash-Lite

Budget

by Google

Gemini 3.1 Flash-Lite is Google's most cost-efficient model to date, released in preview on May 5, 2026. At $0.25 per million input tokens and $1.50 per million output tokens, it costs roughly half of Gemini 3 Flash while keeping a 1,048,576 token context window and reasoning support. Google positions it for high-volume developer workloads where latency, throughput, and price-per-task matter more than frontier reasoning.

Input Price

$0.25

per 1M tokens

Output Price

$1.50

per 1M tokens

Context Window

1.0M

tokens

Released

2026-05

API access

Capabilities

textvisiontool-usecodereasoning

Key Strengths

  • $0.25 per 1M input tokens
  • 1M token context window
  • 2.5x faster time-to-first-token vs 2.5 Flash
  • Reasoning, vision, and tool use included

Best For

  • High-volume classification and extraction
  • Customer support routing
  • RAG over very long contexts
  • Batch document processing

Pricing Details

Input tokens

$0.25

per 1M tokens

Output tokens

$1.50

per 1M tokens

Estimated cost per 1K requests

$1.00

~1K input + ~500 output tokens avg

Prices are subject to change. Check the official documentation for current pricing. See the cost calculator for detailed estimates.

Related Models

View DocumentationCompare ModelsCost CalculatorFull Pricing Guide