benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
DeepSeek-R1-Distill-Qwen-14B SliderQuant W4A16
2 benchmarks
HumanEval
#15 of 17
72.56%
AIME
#57 of 103
70.0%