benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Qwen3.5 27B
vs
Gemma 4 31B
6 shared benchmarks
5
wins
0
ties
1
wins
90.83%
AIME
89.2%
85.5%
GPQA Diamond
84.3%
24.3%
Humanity's Last Exam
19.5%
80.7%
LiveCodeBench
80.0%
86.1%
MMLU Pro
85.2%
75.0%
MMMU Pro
76.9%