GDPval
1 models tested · Updated 2026-03-05 · Verified sources only
GPT-5.4 leads at 83.0%
1
OpenAI · Blog/OpenAI · 2026-03-05
Real-world knowledge work benchmark: 1,320 tasks across 44 occupations in 9 GDP sectors. Frontier models approach expert quality at 100x speed/cost.
83.0%