Wes Roth on Claude Mythos Preview

YouTube · 2026-04-08

"SWE-bench Verified 93.9%, way above Claude Opus 4.6, Gemini 3.1 Pro. SWE-bench Pro, same thing, above GPT-5.4. For each one of these software engineering tasks, the leap is pretty massive."

Wes Roth

AI YouTube commentator

SWE-bench Verified Claude Mythos Preview

view original source → all researcher takes →