Skip to main content
Back to Explore

Llama 3.2 Instruct 11B (Vision)

Meta·Released 2024-09-25
Open SourceMultimodal

Pricing

Input

$0.24

per 1M tokens

Output

$0.24

per 1M tokens

Blended

$0.24

per 1M tokens

Cheaper than 67% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.24

Monthly

$7.35

vs. Similar Models

Gemma 3 12B InstructQ:+0.1
$0.14-43%
Llama 3 Instruct 70BQ:+0.2
$1.18+380%
Microsoft: Phi 4 Mini InstructQ:-0.3
$0.15-40%
Command-R+ (Apr '24)Q:-0.3
$6.00+2349%

Performance

87

tokens/sec

Faster than 46% of models

0.49

seconds

Faster than 87% of models

0.49

seconds

Faster than 91% of models

Market Median

94 tok/s

8% slower

Median TTFT

1.12s

56% faster

Throughput/Dollar

353

tok/s per $/1M

Speed Comparison

OpenAI: GPT-5.2
87 tok/s-0%
Llama 3.2 Instruct 1B
87 tok/s+0%
Ring-flash-2.0
86 tok/s-0%

Benchmarks

MMLU-Pro
46.4%
GPQA Diamond
22.1%
HLE
5.2%
LiveCodeBench
11.0%
SciCode
11.2%
TerminalBench Hard
0.8%
MATH-500
51.6%
AIME
9.3%
AIME 2025
1.7%
IFBench
30.4%
Long Context Recall
11.7%
Tau2
14.6%
Market AverageTop Score

Open Source

Quick Compare

Similar Models

Compare all 7 models