benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
head to head
DeepSeek V4 Flash
vs
GLM-5
7 shared benchmarks
6
wins
0
ties
1
wins
94.8%
AIME
92.7%
73.2%
BrowseComp
75.9%
88.1%
GPQA Diamond
86.0%
34.8%
Humanity's Last Exam
30.5%
78.7%
MRCR v2
26.3%
79.0%
SWE-bench Verified
77.8%
56.9%
Terminal-Bench 2.0
56.2%