"Even though it has 26 billion total parameters, only about 3.8 billion are active during inference, which gives it way better latency and efficiency."
AI Revolution
AI Revolution YouTube channel host
Gemma 4 26B A4B