benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Claude Opus 4.6
vs
Gemini 3.1 Pro
9 shared benchmarks
6
wins
0
ties
3
wins
100.0%
AIME
98.13%
85.0%
ARC-AGI 2
77.1%
84.0%
BrowseComp
85.9%
91.3%
GPQA Diamond
94.3%
53.1%
Humanity's Last Exam
44.4%
82.0%
MMLU Pro
90.99%
93.0%
MRCR v2
84.9%
81.42%
SWE-bench Verified
80.6%
81.8%
Terminal-Bench 2.0
80.2%