josh on Claude Mythos Preview

YouTube · 2026-04-08

"Mythos scores 93% on this benchmark, and the current best public model, Opus, scores 80.8%. Nothing from OpenAI, Google, or any of the open source models are coming within 13 points of this."

josh

YouTuber, AI commentary channel

SWE-bench Verified Claude Mythos Preview

view original source → all researcher takes →