About
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...
Related Models
Pricing
Input
$0.09
per 1M tokens
Output
$0.18
per 1M tokens
Blended
$0.11
per 1M tokens
Cheaper than 82% of models. Median price is $0.54/1M tokens.
Cost Calculator
Daily
$0.11
Monthly
$3.38
vs. Similar Models
Performance
112
tokens/sec
Faster than 60% of models
0.99
seconds
Faster than 57% of models
51.22
seconds
Faster than 6% of models
Market Median
95 tok/s
18% faster
Median TTFT
1.11s
11% faster
Throughput/Dollar
993
tok/s per $/1M
Speed Comparison
Context Window
1.0M
tokens
Larger than 90% of models
Max Output
66K
tokens
6% of context