Skip to main content
Back to Explore

Claude 2.1

Anthropic·Released 2023-11-21
Multimodal

Benchmarks

MMLU-Pro
49.5%
GPQA Diamond
31.9%
HLE
4.2%
LiveCodeBench
19.5%
SciCode
18.4%
TerminalBench HardNot evaluated
MATH-500
37.4%
AIME
3.3%
AIME 2025Not evaluated
IFBenchNot evaluated
Long Context RecallNot evaluated
Tau2Not evaluated
Market AverageTop Score

Quick Compare

Similar Models

Compare all 7 models