Cheapest Active Model
Gemini 2.5 Flash-Lite
Runner-up: Gemini 3.1 Flash-Lite Preview ($0.25/$1.50)
In: $0.10 | Out: $0.40 per 1M tokens
(Example: 1M in + 1M out = $0.50 total)
Why This Model?
- Lowest cost on an actively supported model
- 1M token context window
- Includes thinking tokens in output
- High-volume, at-scale workloads
Best Overall Value
GPT-4o Mini
Runner-up: Gemini 2.5 Flash
In: $0.15 | Out: $0.60 per 1M tokens
(Example: 1M in + 1M out = $0.75 total)
Why This Model?
- GPT-4 quality at a fraction of the cost
- Multimodal: vision, audio, and text
- 128K context window
- Gemini 2.5 Flash: best if you need 1M context + reasoning
Most Capable
Claude Opus 4.6
Runner-up: Gemini 3.1 Pro Preview
In: $5.00 | Out: $25.00 per 1M tokens
(Example: 1M in + 1M out = $30 total)
Why This Model?
- Anthropic's latest and most powerful model
- 1M token context window (standard pricing)
- Best-in-class for complex reasoning & agentic tasks
- Gemini 3.1 Pro: strong multimodal alternative at $2/$12

