Apertus 8B Instruct

Swiss AI Initiative·Released 2025-09-02

Compare

Related Models

Apertus 70B Instruct2025-09-02

Quality Index

1.0

530th of 537

Coding Index

1.4

429th of 447

Top 97%

Price/1M

$0.13

130th cheapest

77% below median

Top 20%

Speed

148 tok/s

Top 27%

TTFT

1.88s

Market Position

Apertus 8B InstructMarket Average

Pricing

Input

$0.10

per 1M tokens

Output

$0.20

per 1M tokens

Blended

$0.13

per 1M tokens

Cheaper than 80% of models. Median price is $0.54/1M tokens.

Cost Calculator

Tokens per day1M

100K100M

Daily

$0.13

Monthly

$3.75

vs. Similar Models

Qwen3 0.6B (Non-reasoning)Q:0.0

$0.19+50%

Gemma 3 4B InstructQ:+0.1

$0.05-60%

Llama 3.2 Instruct 1BQ:+0.1

$0.05-60%

Gemma 3n E4B InstructQ:+0.2

$0.03-80%

Performance

148

tokens/sec

Faster than 73% of models

1.88

seconds

Faster than 24% of models

1.88

seconds

Faster than 51% of models

Market Median

94 tok/s

57% faster

Median TTFT

1.11s

68% slower

Throughput/Dollar

1186

tok/s per $/1M

Speed Comparison

Mistral Small 4 (Non-reasoning)

149 tok/s+0%

GLM-4.7-Flash (Non-reasoning)

148 tok/s-0%

Qwen3.6 35B A3B (Non-reasoning)

149 tok/s+0%

Benchmarks

MMLU-ProNot evaluated

GPQA Diamond

25.6%

HLE

5.0%

LiveCodeBenchNot evaluated

SciCode

4.1%

TerminalBench Hard

0.0%

MATH-500Not evaluated

AIMENot evaluated

AIME 2025Not evaluated

IFBench

22.4%

Long Context Recall

0.0%

Tau2

11.4%

Market AverageTop Score

Quick Compare

Similar Models

Gemma 3 1B Instruct

Google

Q: 1.0N/A/1M

Slower: 63%

Gemma 3n E2B Instruct

Google

Q: 1.0N/A/1M

Slower: 57%

LFM2.5-VL-1.6B

Liquid AI

Q: 1.0N/A/1M

Faster: 243%

Granite 4.0 350M

IBM

Q: 1.0N/A/1M

Granite 4.0 H 350M

IBM

Q: 1.0N/A/1M

Tiny Aya Global

Cohere

Q: 1.0N/A/1M

Compare all 7 models