YouTube · 2026-02-20
"Five months ago, Claude Sonnet 4.5, which is their smaller model compared to Opus, scored 12%. But just last week, Claude Opus 4.6, five months further on, scored just 10%. You could say chess is a fairly pure measure of a general kind of forward-thinking reasoning prowess."
AI Explained
AI YouTube channel