benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Claude Mythos Preview
vs
GPT-5.4 Mini
4 shared benchmarks
4
wins
0
ties
0
wins
94.6%
GPQA Diamond
88.0%
79.6%
OSWorld
72.1%
77.8%
SWE-bench Pro
54.4%
82.0%
Terminal-Bench 2.0
60.0%