benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
head to head
Claude Sonnet 4.6
vs
DASD-4B-Thinking
2 shared benchmarks
2
wins
0
ties
0
wins
94.0%
AIME
83.3%
89.9%
GPQA Diamond
68.4%