Related Models
Hermes 4 - Llama-3.1 70B (Reasoning)2025-08-27Hermes 4 - Llama-3.1 405B (Non-reasoning)2025-08-27Hermes 4 - Llama-3.1 70B (Non-reasoning)2025-08-27Nous: Hermes 4 405B2025-08-26Nous: Hermes 4 70B2025-08-26Nous: Hermes 3 70B Instruct2024-08-18Nous: Hermes 3 405B Instruct2024-08-16Nous: Hermes 3 405B Instruct (free)2024-08-16
Pricing
Input
$1.00
per 1M tokens
Output
$3.00
per 1M tokens
Blended
$1.50
per 1M tokens
Cheaper than 29% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$1.50
Monthly
$45.00
vs. Similar Models
ERNIE 4.5 300B A47BQ:0.0
$0.48-68%
Ministral 3 8BQ:0.0
$0.15-90%
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)Q:0.0
$0.30-80%
GLM-4.5V (Reasoning)Q:+0.1
$0.90-40%
Performance
40
tokens/sec
Faster than 8% of models
0.78
seconds
Faster than 66% of models
51.01
seconds
Faster than 6% of models
Market Median
94 tok/s
58% slower
Median TTFT
1.11s
30% faster
Throughput/Dollar
27
tok/s per $/1M
Speed Comparison
Qwen3.5 4B (Non-reasoning)
40 tok/s-1%
Microsoft: Phi 4
40 tok/s-1%
Claude 4.1 Opus (Reasoning)
40 tok/s+1%
Benchmarks
MMLU-Pro
82.9%
GPQA Diamond
72.7%
HLE
10.3%
LiveCodeBench
68.6%
SciCode
25.2%
TerminalBench Hard
11.4%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
69.7%
IFBench
32.7%
Long Context Recall
20.7%
Tau2
22.2%
Market AverageTop Score