About
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...
Related Models
Pricing
Input
$0.05
per 1M tokens
Output
$0.10
per 1M tokens
Blended
$0.06
per 1M tokens
Cheaper than 88% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.06
Monthly
$1.88
vs. Similar Models
Performance
119
tokens/sec
Faster than 62% of models
0.47
seconds
Faster than 89% of models
0.47
seconds
Faster than 93% of models
Market Median
94 tok/s
27% faster
Median TTFT
1.10s
58% faster
Throughput/Dollar
1897
tok/s per $/1M
Speed Comparison
Context Window
131K
tokens
Larger than 27% of models
Max Output
131K
tokens
100% of context