benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
DeepSeek-R1-Distill-Qwen-14B (W16A16)
2 benchmarks
AIME
#54 of 103
73.33%
HumanEval
#13 of 17
73.17%