"Some of the Gemma models have an E in the model name like E2B and E4B. That stands for effective parameters because these models incorporate per layer embeddings, which is like giving every layer in the neural network its own mini cheat sheet for each token."
Jeff Delaney
Fireship YouTube channel host
Gemma 4 E4B