"People in policy don't understand the algorithmic improvement rate for post-training in the era of agents with really long contexts. V4 *almost certainly* has the latent capability to do this shit. Maybe with more tokens, maybe even with more total compute."
Teortaxes
DeepSeek V3.2