MAI-Code-1-Flash
Budgetby Microsoft
MAI-Code-1-Flash is Microsoft's first in-house coding model, announced at Build 2026 on June 2. It targets fast, cheap code generation inside GitHub Copilot, where it is rolling out across Free, Pro, Pro+, and Max tiers as a model picker option. Microsoft says it uses roughly 60% fewer tokens than comparable models on hard tasks, and GitHub lists it at $0.75 per million input tokens ($0.075 cached) and $4.50 per million output, undercutting Claude Haiku 4.5 on price to performance. It ships with a 256K context window and reaches third-party providers including Fireworks AI, Baseten, and OpenRouter via Microsoft Foundry. Pricing is listed as still being finalized, so treat the numbers as launch-window figures.
Input Price
$0.75
per 1M tokens
Output Price
$4.50
per 1M tokens
Context Window
256K
tokens
Released
2026-06
API access
Capabilities
Key Strengths
- ✓256K context window
- ✓Cheap token-based billing ($0.75/$4.50)
- ✓60% fewer tokens on hard coding tasks
- ✓Native GitHub Copilot integration
- ✓Available via Fireworks, Baseten, OpenRouter
Best For
- ▸High-volume code completion
- ▸Copilot-style IDE assistance
- ▸Refactoring sessions with long file context
- ▸Budget agentic coding pipelines
Pricing Details
Input tokens
$0.75
per 1M tokens
Output tokens
$4.50
per 1M tokens
Estimated cost per 1K requests
$3.00
~1K input + ~500 output tokens avg
Prices are subject to change. Check the official documentation for current pricing. See the cost calculator for detailed estimates.