Claude Mythos Preview vs DeepSeek V4 Pro
7 shared benchmarks
7
wins
0
ties
0
wins
86.9%
BrowseComp
83.4%
94.6%
GPQA Diamond
90.1%
64.7%
Humanity's Last Exam
37.7%
87.3%
SWE-bench Multilingual
76.2%
77.8%
SWE-bench Pro
55.4%
93.9%
SWE-bench Verified
80.6%
82.0%
Terminal-Bench 2.0
67.9%