MAI-Code-1-Flash

Budget

MAI-Code-1-Flash is Microsoft's first in-house coding model, announced at Build 2026 on June 2. It targets fast, cheap code generation inside GitHub Copilot, where it is rolling out across Free, Pro, Pro+, and Max tiers as a model picker option. Microsoft says it uses roughly 60% fewer tokens than comparable models on hard tasks, and GitHub lists it at $0.75 per million input tokens ($0.075 cached) and $4.50 per million output, undercutting Claude Haiku 4.5 on price to performance. It ships with a 256K context window and reaches third-party providers including Fireworks AI, Baseten, and OpenRouter via Microsoft Foundry. Pricing is listed as still being finalized, so treat the numbers as launch-window figures.

Input Price

$0.75

per 1M tokens

Output Price

$4.50

per 1M tokens

Context Window

256K

tokens

Released

2026-06

API access

Capabilities

textcodetool-use

Key Strengths

✓256K context window
✓Cheap token-based billing ($0.75/$4.50)
✓60% fewer tokens on hard coding tasks
✓Native GitHub Copilot integration
✓Available via Fireworks, Baseten, OpenRouter

Best For

▸High-volume code completion
▸Copilot-style IDE assistance
▸Refactoring sessions with long file context
▸Budget agentic coding pipelines

Pricing Details

Input tokens

$0.75

per 1M tokens

Output tokens

$4.50

per 1M tokens

Estimated cost per 1K requests

$3.00

~1K input + ~500 output tokens avg

Prices are subject to change. Check the official documentation for current pricing. See the cost calculator for detailed estimates.

Related Models

Claude Haiku 4.5

Budget

Anthropic

$0.80 in / $4.00 out

GPT-5.6 Luna

Budget

OpenAI

$1.00 in / $6.00 out

GPT-4o-mini

Budget

OpenAI

$0.15 in / $0.60 out

Gemini 2.0 Flash

Budget

Google

$0.10 in / $0.40 out

View Documentation Compare Models Cost Calculator Full Pricing Guide