benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
Qwen3-1.7B + SeLaR
2 benchmarks
AIME
#65 of 103
53.33%
GPQA Diamond
#84 of 92
35.35%