benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
FrontierMath leaderboard
FrontierMath
2 models tested · Updated 2026-03-05 · Verified sources only
GPT-5.4 Pro
leads at
50.0%
1
GPT-5.4 Pro
OpenAI ·
Epoch AI Blog
· 2026-03-05
New FrontierMath record on Tiers 1-3 (undergrad to postdoc math). Also scored 38% on Tier 4 (research-grade). Solved 2 previously unsolved Tier 4 problems.
50.0%
2
Muse Spark
Meta ·
X/@EpochAIResearch
· 2026-04-08
Independent evaluation by Epoch AI on Tiers 1-3 (undergrad to early postdoc). Behind GPT-5.4 Pro (50%) but competitive with other frontier models.
39.0%