benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
SFT (Qwen2.5-Math-1.5B)
1 benchmarks
AIME
#97 of 103
11.7%