Skip to main content
Back to Explore

Llama 3.1 8B Instruct

Meta·Released 2024-07-18
Open Source8B16K ctxLlama 3.1Multimodal

Pricing

Input

$0.02

per 1M tokens

Output

$0.05

per 1M tokens

Blended

$0.03

per 1M tokens

Cheaper than 91% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.03

Monthly

$0.82

vs. Similar Models

Gemma 3n E4B Instruct
$0.03-9%
Llama 3.1 8B
$0.02-18%
Mistral: Mistral Nemo
$0.02-18%
Qwen3.5 0.8B
$0.02-27%

Performance

Context Window

16K

tokens

Larger than 5% of models

Max Output

16K

tokens

100% of context

Context Window Comparison

phi 4
16KSame
Reka Edge
16KSame
OpenAI: GPT-3.5 Turbo
16K1.0x
llama3.18BGGUF / GPTQ / AWQ
Downloads

10.1M

Likes

6.2K

VRAM (FP16)

8-16 GB

GPU

RTX 4070 / M2 Pro

Quick Compare

Similar Models

Compare all 7 models