voices
alexandr_wang on AI benchmarks
1 quotes from AI researchers about benchmarks, models, and evaluation
"We're quite upfront that our model does not perform well on ARC-AGI 2, for example, and publish those results for the community to understand."