AI Explained on GPT-5.4 — benchmark.space

YouTube · 2026-03-06

"GPT 5.4 on questions that probe for hallucinations, according to Artificial Analysis, does well. Not quite as well as GPT 5.3 codex, but measured by overall accuracy it is close to state-of-the-art. But when GPT 5.4 gets things wrong, it is more likely than other models to BS an answer, up here at 89%."

AI Explained

AI commentary YouTube channel

GPT-5.4

view original source → all researcher takes →