"GPT 5.4 on questions that probe for hallucinations, according to Artificial Analysis, does well. Not quite as well as GPT 5.3 codex, but measured by overall accuracy it is close to state-of-the-art. But when GPT 5.4 gets things wrong, it is more likely than other models to BS an answer, up here at 89%."
AI Explained
AI commentary YouTube channel
GPT-5.4