MedXPertQA MM Leaderboard 2026 — Results Across 4 Real AI Models

MedXPertQA MM leaderboard

MedXPertQA MM

4 models tested · Updated 2026-04-02 · Verified sources only

      Gemma 4 31B leads at 61.3%
    

Medical multimodal QA benchmark.

61.3%

MoE 25.2B total, 3.8B active.

58.1%

Medical expert QA multimodal.

48.7%

LG AI Research · HuggingFace/LGAI-EXAONE · 2026-04-14

Beats GPT-5 mini (34.4). Comparable to Qwen3-VL 32B (41.6).

42.1%