benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
SFT (Qwen2.5-Math-7B)
1 benchmarks
AIME
#92 of 103
22.2%