benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Claude Mythos Preview
vs
GLM-4.7
3 shared benchmarks
3
wins
0
ties
0
wins
94.6%
GPQA Diamond
85.7%
56.8%
Humanity's Last Exam
42.8%
93.9%
SWE-bench Verified
73.8%