YouTube · 2026-03-18
"When you add clamping within SAE you have no overhead, you can just do it in real time. In this case you want to do clamping with a diffusion model, you have to do 200 steps of diffusion per token generated. So it is not very real time but it does give us a better deeper look into what is happening in the model."
Vibhu Sapra
Paper Club Presenter