Gemini 3.5 Flash
Mid-tierby Google
Gemini 3.5 Flash went generally available on May 19, 2026 at Google I/O. At $1.50 per million input tokens and $9.00 per million output, with a 1,048,576 token input window and 65,536 output tokens, it is the first Flash-tier release that beats the previous Pro flagship (Gemini 3.1 Pro) on agentic coding suites (Terminal-Bench 2.1 at 76.2%, MCP Atlas at 83.6%, CharXiv Reasoning at 84.2%) while running roughly 4x faster at around 289 output tokens per second. Google positions it as the new daily-driver for coding agents and tool-use workflows where throughput and price-per-task matter.
Input Price
$1.50
per 1M tokens
Output Price
$9.00
per 1M tokens
Context Window
1.0M
tokens
Released
2026-05
API access
Capabilities
Key Strengths
- ✓Beats Gemini 3.1 Pro on agentic coding
- ✓1M token input context
- ✓Roughly 4x faster than prior frontier tier
- ✓Reasoning, vision, and tool use included
Best For
- ▸Agentic coding and SWE-style tool use
- ▸High-throughput RAG over long contexts
- ▸Live customer-facing assistants
- ▸Computer-use and MCP-driven agents
Pricing Details
Input tokens
$1.50
per 1M tokens
Output tokens
$9.00
per 1M tokens
Estimated cost per 1K requests
$6.00
~1K input + ~500 output tokens avg
Prices are subject to change. Check the official documentation for current pricing. See the cost calculator for detailed estimates.
Related Models
Gemini 2.5 Pro
Flagship$1.25 in / $10.00 out
Gemini 2.0 Flash
Budget$0.10 in / $0.40 out
Gemini 3.1 Flash-Lite
Budget$0.25 in / $1.50 out
Claude Sonnet 4.6
Mid-tierAnthropic
$3.00 in / $15.00 out
o3-mini
Mid-tierOpenAI
$1.10 in / $4.40 out
Llama 4 Maverick
Mid-tierMeta
Free
Nemotron 3 Nano Omni
Mid-tierNVIDIA
Free