About
MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a...
Related Models
Pricing
Input
$0.10
per 1M tokens
Output
$0.30
per 1M tokens
Blended
$0.15
per 1M tokens
Cheaper than 75% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.15
Monthly
$4.50
vs. Similar Models
Performance
92
tokens/sec
Faster than 49% of models
1.88
seconds
Faster than 24% of models
1.88
seconds
Faster than 51% of models
Market Median
94 tok/s
2% slower
Median TTFT
1.10s
70% slower
Throughput/Dollar
614
tok/s per $/1M
Speed Comparison
Context Window
262K
tokens
Larger than 62% of models
Max Output
66K
tokens
25% of context