"Scientists solved this by adding back some random noise in the system. This noise is carefully crafted in a way that it averages to zero. They call this stochastic rounding and it is a genius idea."
Dr. Karoly Zsolnai-Feher
AI researcher and YouTube educator (Two Minute Papers)
Nemotron 3 Super 120B