benchmark
.
space
benchmarks
rankings
compare
voices
transcripts
papers
articles
arxiv
Phi-4-reasoning-vision-15B
3 benchmarks
DocVQA
#11 of 11
76.0%
MathVista
#7 of 9
75.2%
MMMU
#13 of 16
54.3%