"On my private Simple Bench, a test of trick questions or common sense reasoning, it beat its own previous record from Gemini 3 Pro, and got 79.6%. That essentially brings it within the margin of error for the human average baseline."
AI Explained
AI YouTube channel
Simple BenchGemini 3.1 Pro