TheAIGRID on Muse Spark — benchmark.space

YouTube · 2026-04-09

"Humanity's last exam, this currently looks like it's state-of-the-art, just three points behind GPT 5.4 Pro, and it's actually currently better than other models when it doesn't use tools."

TheAIGRID

AI YouTube channel host

Humanity's Last Exam Muse Spark

view original source → all researcher takes →