Related Models
Anthropic: Claude Fable 52026-06-09Anthropic: Claude Opus 4.82026-05-27Anthropic: Claude Opus 4.8 (Fast)2026-05-27Anthropic: Claude Opus 4.7 (Fast)2026-05-12Anthropic: Claude Opus 4.72026-04-16Claude Opus 4.7 (Non-reasoning, High Effort)2026-04-16Anthropic: Claude Opus 4.6 (Fast)2026-04-07Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)2026-02-17
Pricing
Input
$15.00
per 1M tokens
Output
$75.00
per 1M tokens
Blended
$30.00
per 1M tokens
Cheaper than 2% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$30.00
Monthly
$900.00
vs. Similar Models
GPT-5 mini (medium)Q:-0.1
$0.69-98%
DeepSeek V4 Pro (Non-reasoning)Q:+0.2
$0.54-98%
GPT-5 (low)Q:+0.2
$3.44-89%
MiMo-V2-Flash (Reasoning)Q:+0.2
$0.15-100%
Performance
34
tokens/sec
Faster than 3% of models
10.22
seconds
Faster than 12% of models
10.22
seconds
Faster than 40% of models
Market Median
94 tok/s
64% slower
Median TTFT
1.11s
819% slower
Throughput/Dollar
1
tok/s per $/1M
Speed Comparison
Gemma 3 4B Instruct
34 tok/s-0%
OpenAI: o3 Pro
34 tok/s-1%
Llama 3.1 Instruct 70B
35 tok/s+2%
Benchmarks
MMLU-Pro
87.3%
GPQA Diamond
79.6%
HLE
11.7%
LiveCodeBench
63.6%
SciCode
39.8%
TerminalBench Hard
31.1%
MATH-500
98.2%
AIME
75.7%
AIME 2025
73.3%
IFBench
53.7%
Long Context Recall
33.7%
Tau2
73.4%
Market AverageTop Score