benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
head to head
DASD-4B-Thinking
vs
GLM-5.1
2 shared benchmarks
0
wins
0
ties
2
wins
83.3%
AIME
95.3%
68.4%
GPQA Diamond
86.2%