"The SWE-bench multimodal implementation is nearly double as well. It is actually a bit over."
Theo
Tech YouTuber (t3.gg)
SWE-bench MultimodalClaude Mythos Preview