Related Models
Anthropic: Claude Fable 52026-06-09Anthropic: Claude Opus 4.82026-05-27Anthropic: Claude Opus 4.8 (Fast)2026-05-27Anthropic: Claude Opus 4.7 (Fast)2026-05-12Anthropic: Claude Opus 4.72026-04-16Claude Opus 4.7 (Non-reasoning, High Effort)2026-04-16Anthropic: Claude Opus 4.6 (Fast)2026-04-07Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)2026-02-17
Pricing
Input
$15.00
per 1M tokens
Output
$75.00
per 1M tokens
Blended
$30.00
per 1M tokens
Cheaper than 2% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$30.00
Monthly
$900.00
vs. Similar Models
MiniMax: MiniMax M2.5Q:0.0
$0.21-99%
Qwen: Qwen3.5 397B A17BQ:0.0
$0.90-97%
GPT-5 (medium)Q:0.0
$3.44-89%
Qwen: Qwen3.5-27BQ:+0.1
$0.54-98%
Performance
40
tokens/sec
Faster than 8% of models
8.38
seconds
Faster than 13% of models
8.38
seconds
Faster than 42% of models
Market Median
94 tok/s
57% slower
Median TTFT
1.11s
652% slower
Throughput/Dollar
1
tok/s per $/1M
Speed Comparison
Hermes 4 - Llama-3.1 405B (Reasoning)
40 tok/s-1%
Qwen3.5 4B (Non-reasoning)
40 tok/s-1%
Microsoft: Phi 4
40 tok/s-1%
Benchmarks
MMLU-Pro
88.0%
GPQA Diamond
80.9%
HLE
11.9%
LiveCodeBench
65.4%
SciCode
40.9%
TerminalBench Hard
34.3%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
80.3%
IFBench
55.4%
Long Context Recall
66.3%
Tau2
71.4%
Market AverageTop Score