benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Gemini 3.1 Pro
vs
Grok 4
5 shared benchmarks
5
wins
0
ties
0
wins
98.13%
AIME
94.0%
94.3%
GPQA Diamond
88.0%
44.4%
Humanity's Last Exam
24.0%
90.99%
MMLU Pro
87.0%
80.6%
SWE-bench Verified
58.6%