benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
Llama-3-Instruct-7B (base)
1 benchmarks
AIME
#73 of 74
0.0%