benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
articles
X / Twitter · 2026-04-06
"A bigger problem: many third-party harnesses compress tool responses every 3 steps when approaching the context limit, leading to very low cache hit rates."
Fuli Luo
Xiaomi MiMo lead, ex-DeepSeek
view original source →
all researcher takes →