benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Claude Mythos Preview
vs
Claude Opus 4.6
10 shared benchmarks
10
wins
0
ties
0
wins
86.9%
BrowseComp
84.0%
83.1%
CyberGym
66.6%
94.6%
GPQA Diamond
91.3%
56.8%
Humanity's Last Exam
53.1%
79.6%
OSWorld
72.7%
87.3%
SWE-bench Multilingual
77.8%
59.0%
SWE-bench Multimodal
27.1%
93.9%
SWE-bench Verified
81.42%
82.0%
Terminal-Bench 2.0
81.8%
97.6%
USAMO 2026
42.3%