YouTube · 2026-04-09
"Most models are not actually natively built to be multimodal. Most of them are just simply text-based. Which is why when you do have companies like Google and Meta that train their models natively to be multimodal, you do get some very effective models that have multimodal reasoning capability."
TheAIGRID
AI YouTube channel host