YouTube · 2026-04-08
"So we have SWE-bench Pro. This is the creme de la creme of coding benchmarks. Opus 4.6, the best coding model on the planet. On SWE-bench Pro, it scored a 53.4. Mythos preview 77.8. That is not a minor version bump. This is significant."
Matthew Berman
AI YouTube commentator