benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
OpenAI
GPT-5
4 benchmarks
AIME
#18 of 39
94.6%
Aider Polyglot
#1 of 7
88.0%
MMMU
#1 of 2
84.2%
SWE-bench Verified
#19 of 40
74.9%