About
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...
Related Models
Pricing
Input
$0.02
per 1M tokens
Output
$0.03
per 1M tokens
Blended
$0.02
per 1M tokens
Cheaper than 91% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.02
Monthly
$0.67
vs. Similar Models
Mistral: Mistral Nemo
$0.020%
Qwen3.5 0.8B
$0.02-11%
Qwen3.5 0.8B (Non-reasoning)
$0.02-11%
Gemma 3n E4B Instruct
$0.03+11%
Performance
Context Window
131K
tokens
Larger than 27% of models
Max Output
16K
tokens
13% of context
Context Window Comparison
DeepSeek: DeepSeek V3.2
131KSame
OpenAI: gpt-oss-120b
131KSame
MoonshotAI: Kimi K2 0711
131KSame
Open Source
llama3.18BGGUF / GPTQ / AWQ
Downloads
1.5M
Likes
2.3K
VRAM (FP16)
8-16 GB
GPU
RTX 4070 / M2 Pro