Skip to main content
Back to Explore

Llama 3.2 Instruct 1B

Meta·Released 2024-09-25
Open SourceMultimodal

Pricing

Input

$0.05

per 1M tokens

Output

$0.05

per 1M tokens

Blended

$0.05

per 1M tokens

Cheaper than 90% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.05

Monthly

$1.50

vs. Similar Models

Gemma 3 4B InstructQ:0.0
$0.050%
Gemma 3n E4B InstructQ:+0.1
$0.03-50%
Llama 3 Instruct 8BQ:+0.1
$0.07+40%
Apertus 8B InstructQ:-0.1
$0.13+150%

Performance

86

tokens/sec

Faster than 45% of models

0.60

seconds

Faster than 78% of models

0.60

seconds

Faster than 85% of models

Market Median

94 tok/s

9% slower

Median TTFT

1.07s

44% faster

Throughput/Dollar

1716

tok/s per $/1M

Speed Comparison

Mistral: Mistral Medium 3.1
85 tok/s-1%
MiMo-V2-Omni
86 tok/s+1%
Ring-flash-2.0
86 tok/s+1%

Benchmarks

MMLU-Pro
20.0%
GPQA Diamond
19.6%
HLE
5.3%
LiveCodeBench
1.9%
SciCode
1.7%
TerminalBench Hard
0.0%
MATH-500
14.0%
AIME
0.0%
AIME 2025
0.0%
IFBench
22.8%
Long Context Recall
5.0%
Tau2
0.0%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models