Gemma 4 31B
8 benchmarks
AIME
#30 of 39
89.2%
τ2-bench
#5 of 5
86.4%
MMLU Pro
#11 of 29
85.2%
GPQA Diamond
#30 of 49
84.3%
LiveCodeBench
#14 of 28
80.0%
MMMU Pro
#7 of 15
76.9%
MRCR v2
#4 of 5
66.4%
Humanity's Last Exam
#22 of 24
19.5%