Dylan Patel on GPT-5.4 — benchmark.space

YouTube · 2026-03-13

"An H100 can serve more tokens per GPU of GPT-5.4 than if you had run GPT-4 on it. So it is producing more tokens of a model that is of higher quality."

Dylan Patel

Founder, SemiAnalysis

GPT-5.4

view original source → all researcher takes →