FrontierMath
2 models tested · Updated 2026-03-05 · Verified sources only
GPT-5.4 Pro leads at 50.0%
1
OpenAI · Epoch AI Blog · 2026-03-05
New FrontierMath record on Tiers 1-3 (undergrad to postdoc math). Also scored 38% on Tier 4 (research-grade). Solved 2 previously unsolved Tier 4 problems.
50.0%
2
Meta · X/@EpochAIResearch · 2026-04-08
Independent evaluation by Epoch AI on Tiers 1-3 (undergrad to early postdoc). Behind GPT-5.4 Pro (50%) but competitive with other frontier models.
39.0%