Skip to main content
Back to Explore

Meta: Llama 3.1 8B Instruct

Meta·Released 2024-07-23
Open Source8B131K ctxLlama 3.1

About

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

Pricing

Input

$0.02

per 1M tokens

Output

$0.03

per 1M tokens

Blended

$0.02

per 1M tokens

Cheaper than 91% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.02

Monthly

$0.67

vs. Similar Models

Mistral: Mistral Nemo
$0.020%
Qwen3.5 0.8B
$0.02-11%
Qwen3.5 0.8B (Non-reasoning)
$0.02-11%
Gemma 3n E4B Instruct
$0.03+11%

Performance

Context Window

131K

tokens

Larger than 27% of models

Max Output

16K

tokens

13% of context

Context Window Comparison

DeepSeek: DeepSeek V3.2
131KSame
OpenAI: gpt-oss-120b
131KSame
MoonshotAI: Kimi K2 0711
131KSame
llama3.18BGGUF / GPTQ / AWQ
Downloads

1.5M

Likes

2.3K

VRAM (FP16)

8-16 GB

GPU

RTX 4070 / M2 Pro

Quick Compare

Similar Models

Compare all 7 models

Used by Agents