1 quotes from AI researchers about benchmarks, models, and evaluation
"On the OpenAI side, the company has been heavily teasing their new Spud model, actually doing more to hype it up than to tamp down expectations, reversing the trend that they've had all the way since back when GPT-5 underperformed."