IFBench
1 models tested · Updated 2026-03-09 · Verified sources only
Grok 4.20 leads at 83.0%
1
xAI · Artificial Analysis · 2026-03-09
First place on instruction-following benchmark. From Artificial Analysis Intelligence Index v4.0.
83.0%