Gemini 3.1 Flash-Lite
Budgetby Google
Gemini 3.1 Flash-Lite is Google's most cost-efficient model to date, released in preview on May 5, 2026. At $0.25 per million input tokens and $1.50 per million output tokens, it costs roughly half of Gemini 3 Flash while keeping a 1,048,576 token context window and reasoning support. Google positions it for high-volume developer workloads where latency, throughput, and price-per-task matter more than frontier reasoning.
Input Price
$0.25
per 1M tokens
Output Price
$1.50
per 1M tokens
Context Window
1.0M
tokens
Released
2026-05
API access
Capabilities
Key Strengths
- ✓$0.25 per 1M input tokens
- ✓1M token context window
- ✓2.5x faster time-to-first-token vs 2.5 Flash
- ✓Reasoning, vision, and tool use included
Best For
- ▸High-volume classification and extraction
- ▸Customer support routing
- ▸RAG over very long contexts
- ▸Batch document processing
Pricing Details
Input tokens
$0.25
per 1M tokens
Output tokens
$1.50
per 1M tokens
Estimated cost per 1K requests
$1.00
~1K input + ~500 output tokens avg
Prices are subject to change. Check the official documentation for current pricing. See the cost calculator for detailed estimates.