benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
Qwen 3 VL GRPO
2 benchmarks
DocVQA
#4 of 15
95.9%
MMMU
#16 of 26
69.7%