Qwen3.7-Max
Flagshipby Alibaba
Qwen3.7-Max is Alibaba's May 2026 flagship model, landed on the Alibaba Cloud API on May 19 and unveiled at the 2026 Cloud Summit on May 20. At $2.50 per million input and $7.50 per million output tokens, with a 1,000,000 token context window and up to 65,536 output tokens, it carries an extended-thinking mode and claims of autonomous operation for up to 35 hours on long-horizon agentic tasks. Posted the top result on the public Artificial Analysis Intelligence Index at 57 and roughly 1,475 Elo on the LM Arena text leaderboard at release. Cached input drops to $0.25 per million tokens via OpenRouter, a 90 percent discount that makes repeated long-context calls dramatically cheaper.
Input Price
$2.50
per 1M tokens
Output Price
$7.50
per 1M tokens
Context Window
1M
tokens
Released
2026-05
API access
Capabilities
Key Strengths
- ✓1M token context window
- ✓Top public Intelligence Index (57, #1)
- ✓Extended thinking mode
- ✓35-hour autonomous agentic operation
- ✓90 percent cache discount on input
Best For
- ▸Long-horizon agentic workflows
- ▸Deep multi-step reasoning
- ▸Agentic coding with sustained context
- ▸Document and codebase analysis at 1M scale
Pricing Details
Input tokens
$2.50
per 1M tokens
Output tokens
$7.50
per 1M tokens
Estimated cost per 1K requests
$6.25
~1K input + ~500 output tokens avg
Prices are subject to change. Check the official documentation for current pricing. See the cost calculator for detailed estimates.