"On SWE-bench Pro, Mythos got a 78% when previously Opus only got a 53. And if you are curious, I found the numbers for GPT 5.4 it was a 57.7. A 24 point jump is a 50% improvement on one of the hardest software benches we have."
Theo
Tech YouTuber (t3.gg)
SWE-bench ProClaude Mythos Preview