GPT-5
4 benchmarks
AIME
#18 of 39
94.6%
Aider Polyglot
#1 of 7
88.0%
MMMU
#1 of 2
84.2%
SWE-bench Verified
#19 of 40
74.9%