benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Model comparisons
190 matchups across 51 models. Click any to see the full breakdown.
Claude Opus 4.6
Anthropic
0
-
10
10 benchmarks
Claude Mythos Preview
Anthropic
Claude Opus 4.6
Anthropic
4
-
4
9 benchmarks
GPT-5.4
OpenAI
Claude Opus 4.6
Anthropic
6
-
3
9 benchmarks
Gemini 3.1 Pro
Google
Claude Opus 4.6
Anthropic
7
-
2
9 benchmarks
Kimi K2.5
Moonshot AI
GPT-5.4
OpenAI
9
-
0
9 benchmarks
Kimi K2.5
Moonshot AI
Gemini 3.1 Pro
Google
9
-
0
9 benchmarks
Kimi K2.5
Moonshot AI
Kimi K2.5
Moonshot AI
8
-
1
9 benchmarks
Qwen3.5 27B
Alibaba
Claude Opus 4.6
Anthropic
6
-
2
8 benchmarks
Qwen3.5 27B
Alibaba
GPT-5.4
OpenAI
3
-
5
8 benchmarks
Gemini 3.1 Pro
Google
GPT-5.4
OpenAI
0
-
8
8 benchmarks
Claude Mythos Preview
Anthropic
Claude Opus 4.6
Anthropic
7
-
0
7 benchmarks
GLM-5
Zhipu AI
Claude Opus 4.6
Anthropic
5
-
2
7 benchmarks
Step-3.5-Flash
StepFun
Claude Opus 4.6
Anthropic
5
-
2
7 benchmarks
Qwen 3.6 Plus
Alibaba
Claude Opus 4.6
Anthropic
7
-
0
7 benchmarks
Claude Sonnet 4.6
Anthropic
Gemini 3.1 Pro
Google
7
-
0
7 benchmarks
Qwen3.5 27B
Alibaba
Gemini 3.1 Pro
Google
5
-
1
7 benchmarks
Muse Spark
Meta
Gemini 3.1 Pro
Google
7
-
0
7 benchmarks
GLM-5
Zhipu AI
Gemini 3.1 Pro
Google
7
-
0
7 benchmarks
Gemma 4 31B
Google
Kimi K2.5
Moonshot AI
0
-
7
7 benchmarks
Claude Mythos Preview
Anthropic
Kimi K2.5
Moonshot AI
3
-
4
7 benchmarks
Step-3.5-Flash
StepFun
Kimi K2.5
Moonshot AI
1
-
6
7 benchmarks
Qwen 3.6 Plus
Alibaba
Kimi K2.5
Moonshot AI
2
-
5
7 benchmarks
Claude Sonnet 4.6
Anthropic
Kimi K2.5
Moonshot AI
3
-
4
7 benchmarks
Qwen 3.5 397B
Alibaba
Qwen3.5 27B
Alibaba
0
-
7
7 benchmarks
Qwen 3.5 397B
Alibaba
Claude Opus 4.6
Anthropic
4
-
2
6 benchmarks
Gemma 4 31B
Google
Claude Opus 4.6
Anthropic
5
-
1
6 benchmarks
GLM-5.1
Zhipu AI
Claude Opus 4.6
Anthropic
4
-
2
6 benchmarks
Qwen 3.5 397B
Alibaba
Claude Opus 4.6
Anthropic
4
-
2
6 benchmarks
Grok 4
xAI
Claude Opus 4.6
Anthropic
6
-
0
6 benchmarks
GLM-4.7-Flash
Zhipu AI
Claude Opus 4.6
Anthropic
6
-
0
6 benchmarks
Sarvam 105B
Sarvam AI
GPT-5.4
OpenAI
6
-
0
6 benchmarks
Qwen3.5 27B
Alibaba
GPT-5.4
OpenAI
4
-
2
6 benchmarks
Muse Spark
Meta
Gemini 3.1 Pro
Google
0
-
6
6 benchmarks
Claude Mythos Preview
Anthropic
Gemini 3.1 Pro
Google
6
-
0
6 benchmarks
Step-3.5-Flash
StepFun
Gemini 3.1 Pro
Google
5
-
1
6 benchmarks
Qwen 3.6 Plus
Alibaba
Gemini 3.1 Pro
Google
5
-
1
6 benchmarks
GLM-5.1
Zhipu AI
Gemini 3.1 Pro
Google
6
-
0
6 benchmarks
Claude Sonnet 4.6
Anthropic
Gemini 3.1 Pro
Google
6
-
0
6 benchmarks
Qwen 3.5 397B
Alibaba
Kimi K2.5
Moonshot AI
2
-
4
6 benchmarks
GLM-5
Zhipu AI
Kimi K2.5
Moonshot AI
6
-
0
6 benchmarks
Gemma 4 31B
Google
Kimi K2.5
Moonshot AI
2
-
4
6 benchmarks
GLM-5.1
Zhipu AI
Kimi K2.5
Moonshot AI
3
-
3
6 benchmarks
Seed 2.0 Pro
ByteDance
Kimi K2.5
Moonshot AI
5
-
1
6 benchmarks
Seed 2.0 Lite
ByteDance
Kimi K2.5
Moonshot AI
6
-
0
6 benchmarks
GLM-4.7-Flash
Zhipu AI
Kimi K2.5
Moonshot AI
6
-
0
6 benchmarks
Sarvam 105B
Sarvam AI
Claude Mythos Preview
Anthropic
6
-
0
6 benchmarks
GLM-5.1
Zhipu AI
Qwen3.5 27B
Alibaba
5
-
1
6 benchmarks
Gemma 4 31B
Google
Qwen3.5 27B
Alibaba
2
-
4
6 benchmarks
Step-3.5-Flash
StepFun
Qwen3.5 27B
Alibaba
0
-
6
6 benchmarks
Qwen 3.6 Plus
Alibaba
Qwen3.5 27B
Alibaba
1
-
5
6 benchmarks
Claude Sonnet 4.6
Anthropic
Qwen3.5 27B
Alibaba
5
-
1
6 benchmarks
GLM-4.7-Flash
Zhipu AI
Qwen3.5 27B
Alibaba
6
-
0
6 benchmarks
Sarvam 105B
Sarvam AI
Step-3.5-Flash
StepFun
1
-
5
6 benchmarks
Qwen 3.6 Plus
Alibaba
Step-3.5-Flash
StepFun
2
-
4
6 benchmarks
Claude Sonnet 4.6
Anthropic
Step-3.5-Flash
StepFun
2
-
4
6 benchmarks
Qwen 3.5 397B
Alibaba
Step-3.5-Flash
StepFun
6
-
0
6 benchmarks
Sarvam 105B
Sarvam AI
Qwen 3.5 397B
Alibaba
6
-
0
6 benchmarks
Sarvam 105B
Sarvam AI
Seed 2.0 Pro
ByteDance
5
-
1
6 benchmarks
Seed 2.0 Lite
ByteDance
Claude Opus 4.6
Anthropic
3
-
2
5 benchmarks
Seed 2.0 Pro
ByteDance
Claude Opus 4.6
Anthropic
3
-
2
5 benchmarks
Arcee Trinity
Arcee AI
Claude Opus 4.6
Anthropic
3
-
2
5 benchmarks
Seed 2.0 Lite
ByteDance
GPT-5.4
OpenAI
4
-
1
5 benchmarks
GLM-5
Zhipu AI
GPT-5.4
OpenAI
3
-
2
5 benchmarks
GLM-5.1
Zhipu AI
GPT-5.4
OpenAI
4
-
1
5 benchmarks
Claude Sonnet 4.6
Anthropic
Gemini 3.1 Pro
Google
4
-
1
5 benchmarks
Seed 2.0 Pro
ByteDance
Gemini 3.1 Pro
Google
4
-
1
5 benchmarks
Arcee Trinity
Arcee AI
Gemini 3.1 Pro
Google
5
-
0
5 benchmarks
Grok 4
xAI
Gemini 3.1 Pro
Google
5
-
0
5 benchmarks
Seed 2.0 Lite
ByteDance
Gemini 3.1 Pro
Google
5
-
0
5 benchmarks
GLM-4.7-Flash
Zhipu AI
Gemini 3.1 Pro
Google
5
-
0
5 benchmarks
Sarvam 105B
Sarvam AI
Kimi K2.5
Moonshot AI
0
-
5
5 benchmarks
Muse Spark
Meta
Kimi K2.5
Moonshot AI
3
-
2
5 benchmarks
Arcee Trinity
Arcee AI
Kimi K2.5
Moonshot AI
4
-
1
5 benchmarks
Grok 4
xAI
Claude Mythos Preview
Anthropic
5
-
0
5 benchmarks
Qwen3.5 27B
Alibaba
Claude Mythos Preview
Anthropic
5
-
0
5 benchmarks
GLM-5
Zhipu AI
Claude Mythos Preview
Anthropic
5
-
0
5 benchmarks
Claude Sonnet 4.6
Anthropic
Qwen3.5 27B
Alibaba
0
-
5
5 benchmarks
GLM-5
Zhipu AI
Qwen3.5 27B
Alibaba
0
-
5
5 benchmarks
Seed 2.0 Pro
ByteDance
Qwen3.5 27B
Alibaba
3
-
2
5 benchmarks
Arcee Trinity
Arcee AI
Qwen3.5 27B
Alibaba
2
-
3
5 benchmarks
Grok 4
xAI
Qwen3.5 27B
Alibaba
1
-
4
5 benchmarks
Seed 2.0 Lite
ByteDance
GLM-5
Zhipu AI
4
-
1
5 benchmarks
Step-3.5-Flash
StepFun
GLM-5
Zhipu AI
0
-
5
5 benchmarks
Qwen 3.6 Plus
Alibaba
GLM-5
Zhipu AI
1
-
3
5 benchmarks
GLM-5.1
Zhipu AI
GLM-5
Zhipu AI
1
-
4
5 benchmarks
Claude Sonnet 4.6
Anthropic
GLM-5
Zhipu AI
5
-
0
5 benchmarks
GLM-4.7-Flash
Zhipu AI
Gemma 4 31B
Google
0
-
5
5 benchmarks
Qwen 3.6 Plus
Alibaba
Gemma 4 31B
Google
0
-
5
5 benchmarks
Qwen 3.5 397B
Alibaba
Gemma 4 31B
Google
2
-
3
5 benchmarks
Arcee Trinity
Arcee AI
Step-3.5-Flash
StepFun
2
-
3
5 benchmarks
GLM-5.1
Zhipu AI
Step-3.5-Flash
StepFun
0
-
5
5 benchmarks
Seed 2.0 Pro
ByteDance
Step-3.5-Flash
StepFun
4
-
1
5 benchmarks
Arcee Trinity
Arcee AI
Step-3.5-Flash
StepFun
3
-
2
5 benchmarks
Seed 2.0 Lite
ByteDance
Step-3.5-Flash
StepFun
5
-
0
5 benchmarks
GLM-4.7-Flash
Zhipu AI
Qwen 3.6 Plus
Alibaba
4
-
1
5 benchmarks
Claude Sonnet 4.6
Anthropic
Qwen 3.6 Plus
Alibaba
5
-
0
5 benchmarks
Qwen 3.5 397B
Alibaba
Qwen 3.6 Plus
Alibaba
3
-
2
5 benchmarks
Seed 2.0 Pro
ByteDance
Qwen 3.6 Plus
Alibaba
3
-
2
5 benchmarks
Arcee Trinity
Arcee AI
Qwen 3.6 Plus
Alibaba
5
-
0
5 benchmarks
Grok 4
xAI
Qwen 3.6 Plus
Alibaba
5
-
0
5 benchmarks
Seed 2.0 Lite
ByteDance