benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
MMMU leaderboard
MMMU
2 models tested · Updated 2025-08-07 · Verified sources only
GPT-5
leads at
84.2%
1
GPT-5
OpenAI ·
Blog/OpenAI
· 2025-08-07
College-level visual reasoning. Led prior OpenAI models at launch.
84.2%
2
Llama 4 Maverick
Meta ·
HuggingFace/Meta
· 2026-04-05
Instruction-tuned multimodal score.
73.4%