Related Models
Pricing
Input
$0.02
per 1M tokens
Output
$0.05
per 1M tokens
Blended
$0.03
per 1M tokens
Cheaper than 91% of models. Median price is $0.54/1M tokens.
Cost Calculator
Tokens per day1M
100K100M
Daily
$0.03
Monthly
$0.82
vs. Similar Models
Gemma 3n E4B Instruct
$0.03-9%
Llama 3.1 8B
$0.02-18%
Mistral: Mistral Nemo
$0.02-18%
Qwen3.5 0.8B
$0.02-27%
Performance
Context Window
16K
tokens
Larger than 5% of models
Max Output
16K
tokens
100% of context
Context Window Comparison
phi 4
16KSame
Reka Edge
16KSame
OpenAI: GPT-3.5 Turbo
16K1.0x
Open Source
llama3.18BGGUF / GPTQ / AWQ
Downloads
10.1M
Likes
6.2K
VRAM (FP16)
8-16 GB
GPU
RTX 4070 / M2 Pro