fchollet on AI benchmarks
2 quotes from AI researchers about benchmarks, models, and evaluation
"The new model from Meta is already looking like a disappointment: overoptimized for public benchmark numbers at the detriment of everything else. Knowing how to evaluate models in a way that correlates with actual usefulness is a core competency for AI labs, and any new lab is"
François Chollet @fchollet · 2026-04-08 ·640 likes view on x
"Join the ARC Prize team -- help us build ARC-AGI-4 and ARC-AGI-5"
François Chollet @fchollet · 2026-04-07 ·128 likes view on x