benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
GLM-4.7-Flash
vs
GPT-5.3 Codex
3 shared benchmarks
0
wins
0
ties
3
wins
75.2%
GPQA Diamond
91.5%
14.4%
Humanity's Last Exam
39.9%
59.2%
SWE-bench Verified
80.0%