benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
voices
nicholas_caridi on AI benchmarks
1 quotes from AI researchers about benchmarks, models, and evaluation
"I found more bugs in the last few weeks with Mythos than in the rest of my entire life combined."
Nicholas Caridi
@nicholas_caridi
·
2026-04-09
view on x
Claude Mythos Preview