"This issue affected 8% of reinforcement learning episodes and was isolated to three specific subdomains... GUI computer use, office related tasks, and a small set of STEM environments. We are uncertain about the extent to which this issue has affected the reasoning behavior of the final model."
Wes Roth
AI news YouTuber
Claude Mythos Preview