benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
GPT-5.2
vs
GPT-5.4
3 shared benchmarks
1
wins
0
ties
2
wins
92.4%
GPQA Diamond
92.8%
27.8%
Humanity's Last Exam
36.24%
80.0%
SWE-bench Verified
77.2%