"Meta found that if you penalize the model for thinking too long, something weird happens. The model actually learns to compress its reasoning and it solves the same problem using fewer tokens."
TheAIGRID
AI YouTube channel host
Muse Spark