"Four is being built primarily for coding. Internal benchmarks, not yet independently verified, suggest it's targeting over 80% on SWE bench, the benchmark for solving real-world software engineering problems."
Julian Goldie
AI tools YouTuber / digital avatar
SWE-bench VerifiedDeepSeek V4