lu_zai on AI benchmarks
1 quotes from AI researchers about benchmarks, models, and evaluation
"Agents could do about 20 steps by the end of last year. GLM 5.1 can do 1,700 right now. Autonomous work time may be the most important curve after scaling laws."
Lu @lu_zai · 2026-04-10 view on x