benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
DeepSeek-R1-Distill-Qwen-32B (W16A16)
2 benchmarks
HumanEval
#9 of 17
81.71%
AIME
#51 of 103
76.67%