leeerob on AI benchmarks
1 quotes from AI researchers about benchmarks, models, and evaluation
"GPT 5.4 is currently the leader on our internal benchmarks. Our engineers find it to be more natural and assertive than previous models. It works through ambiguous problems without second-guessing itself and it is proactive in paralyzing work to keep things moving."
Lee Rob @leeerob · 2026-03-06 view on x