benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
xAI
Grok 4.20
3 benchmarks
τ2-bench
#2 of 5
97.0%
IFBench
#1 of 1
83.0%
ARC-AGI 3
#4 of 4
0.0%