benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
GLM-4.7
vs
GLM-5
4 shared benchmarks
2
wins
0
ties
2
wins
95.7%
AIME
92.7%
85.7%
GPQA Diamond
86.0%
42.8%
Humanity's Last Exam
30.5%
73.8%
SWE-bench Verified
77.8%