1 quotes from AI researchers about benchmarks, models, and evaluation
"Claude Mythos is arguably the biggest step change in AI capabilities since the GPT-4 jump. When Mythos is allowed to think longer, act deeper, and better explore the solution space, it passes 92% of Terminal Bench task attempts."