YouTube · 2026-04-09
"The most forbidden technique is training an AI using interpretability techniques. You train on the final output only. Never the method. If you train on the method, you are training the AI to obfuscate its thinking."
Z
AI safety researcher and blogger