benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
head to head
DASD-4B-Thinking
vs
GPT-5.2
2 shared benchmarks
0
wins
0
ties
2
wins
83.3%
AIME
100.0%
68.4%
GPQA Diamond
92.4%