YouTube · 2026-04-04
"We introduced a benchmark at NeurIPS called RF100VL, Roboflow 100 Vision Language. We evaluated Gemini and SAM 3 and OpenAI and a number of multimodal LLMs. The best model at the time we published the work was Gemini 2, but that is 12.5% across all domains. The gap of how far these models have to go on segmentation is enormous."
Joseph Nelson
CEO of Roboflow