Skip to main content
Back to Explore

Granite 4.0 H Small

IBM·Released 2025-09-22
Open Source

Pricing

Input

$0.06

per 1M tokens

Output

$0.25

per 1M tokens

Blended

$0.11

per 1M tokens

Cheaper than 82% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.11

Monthly

$3.21

vs. Similar Models

Jamba 1.7 LargeQ:+0.1
$3.50+3171%
Jamba 1.5 LargeQ:-0.1
$3.50+3171%
Nous: Hermes 3 70B InstructQ:-0.1
$0.70+554%
Qwen3 8B (Non-reasoning)Q:-0.1
$0.18+73%

Performance

441

tokens/sec

Faster than 99% of models

8.72

seconds

Faster than 13% of models

8.72

seconds

Faster than 42% of models

Market Median

94 tok/s

367% faster

Median TTFT

1.11s

684% slower

Throughput/Dollar

4117

tok/s per $/1M

Speed Comparison

HyperNova 60B 2605
417 tok/s-5%
StepFun: Step 3.7 Flash
400 tok/s-9%
LFM2.5-VL-1.6B
508 tok/s+15%

Benchmarks

MMLU-Pro
62.4%
GPQA Diamond
41.6%
HLE
3.7%
LiveCodeBench
25.1%
SciCode
20.9%
TerminalBench Hard
2.3%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025
13.7%
IFBench
31.5%
Long Context Recall
9.0%
Tau2
17.3%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models