benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
YouTube · 2026-04-10
"We're quite upfront that our model does not perform well on ARC-AGI 2, for example, and publish those results for the community to understand."
Alexander Wang
Meta AI Leader, Scale AI Founder
ARC-AGI 2
Muse Spark
view original source →
all researcher takes →