benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
head to head
DeepSeek V3.2
vs
DeepSeek V4 Flash
5 shared benchmarks
0
wins
0
ties
5
wins
93.1%
AIME
94.8%
82.4%
GPQA Diamond
88.1%
74.1%
LiveCodeBench
91.6%
85.0%
MMLU Pro
86.2%
73.1%
SWE-bench Verified
79.0%