Loading...
Loading...
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...
Input
$0.05
per 1M tokens
Output
$0.10
per 1M tokens
Blended
$0.06
per 1M tokens
Cheaper than 88% of models. Median price is $0.56/1M tokens.
Daily
$0.06
Monthly
$1.88
110
tokens/sec
Faster than 62% of models
0.41
seconds
Faster than 91% of models
0.41
seconds
Faster than 94% of models
Market Median
86 tok/s
28% faster
Median TTFT
1.07s
62% faster
Throughput/Dollar
1755
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 33% of models
Max Output
131K
tokens
100% of context
Quality Index
12.4
384th of 507
Top 76%
Coding Index
7.3
354th of 417
Top 85%
Price/1M
$0.06
76th cheapest
89% below median
Top 12%
Speed
110 tok/s
Top 38%
TTFT
0.41s
Context Window
131K
201st largest
Top 67%