Skip to main content
Back to Explore

Apertus 8B Instruct

Swiss AI Initiative·Released 2025-09-02

Pricing

Input

$0.10

per 1M tokens

Output

$0.20

per 1M tokens

Blended

$0.13

per 1M tokens

Cheaper than 80% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M
100K100M

Daily

$0.13

Monthly

$3.75

vs. Similar Models

Qwen3 0.6B (Non-reasoning)Q:0.0
$0.19+50%
Gemma 3 4B InstructQ:+0.1
$0.05-60%
Llama 3.2 Instruct 1BQ:+0.1
$0.05-60%
Gemma 3n E4B InstructQ:+0.2
$0.03-80%

Performance

148

tokens/sec

Faster than 73% of models

1.88

seconds

Faster than 24% of models

1.88

seconds

Faster than 51% of models

Market Median

94 tok/s

58% faster

Median TTFT

1.10s

70% slower

Throughput/Dollar

1186

tok/s per $/1M

Speed Comparison

Qwen3.5 122B A10B
148 tok/s+0%
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
148 tok/s-0%
GPT-5 nano (minimal)
149 tok/s+0%

Benchmarks

MMLU-ProNot evaluated
GPQA Diamond
25.6%
HLE
5.0%
LiveCodeBenchNot evaluated
SciCode
4.1%
TerminalBench Hard
0.0%
MATH-500Not evaluated
AIMENot evaluated
AIME 2025Not evaluated
IFBench
22.4%
Long Context Recall
0.0%
Tau2
11.4%
Market AverageTop Score

Quick Compare

Similar Models

Compare all 7 models