benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
GDPval leaderboard
GDPval
1 models tested · Updated 2026-03-05 · Verified sources only
GPT-5.4
leads at
83.0%
1
GPT-5.4
OpenAI ·
Blog/OpenAI
· 2026-03-05
Real-world knowledge work benchmark: 1,320 tasks across 44 occupations in 9 GDP sectors. Frontier models approach expert quality at 100x speed/cost.
83.0%