benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Claude Opus 4.6
vs
Qwen3.5 27B
8 shared benchmarks
6
wins
0
ties
2
wins
100.0%
AIME
90.83%
84.0%
BrowseComp
61.0%
91.3%
GPQA Diamond
85.5%
53.1%
Humanity's Last Exam
24.3%
76.0%
LiveCodeBench
80.7%
82.0%
MMLU Pro
86.1%
72.7%
OSWorld
56.2%
81.42%
SWE-bench Verified
72.4%