Matthew Berman on Claude Mythos Preview

YouTube · 2026-04-08

"So we have SWE-bench Pro. This is the creme de la creme of coding benchmarks. Opus 4.6, the best coding model on the planet. On SWE-bench Pro, it scored a 53.4. Mythos preview 77.8. That is not a minor version bump. This is significant."

Matthew Berman

AI YouTube commentator

SWE-bench Pro Claude Mythos Preview

view original source → all researcher takes →