About
Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function...
Related Models
Pricing
Input
$0.04
per 1M tokens
Output
$0.15
per 1M tokens
Blended
$0.07
per 1M tokens
Cheaper than 87% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.07
Monthly
$2.14
vs. Similar Models
Meta: Llama 3.2 1B Instruct
$0.07-1%
NVIDIA Nemotron Nano 9B V2 (Reasoning)
$0.07-2%
Llama 3 Instruct 8B
$0.07-2%
NVIDIA Nemotron Nano 9B v2
$0.07-2%
Performance
Context Window
131K
tokens
Larger than 27% of models
Max Output
131K
tokens
100% of context
Context Window Comparison
DeepSeek: DeepSeek V3.2
131KSame
OpenAI: gpt-oss-120b
131KSame
MoonshotAI: Kimi K2 0711
131KSame