benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
head to head
Claude Opus 4.6
vs
GPT-5.4 Mini
3 shared benchmarks
3
wins
0
ties
0
wins
91.3%
GPQA Diamond
88.0%
72.7%
OSWorld
72.1%
81.8%
Terminal-Bench 2.0
60.0%