Multimodal Medical Reasoning Benchmark
A comparative benchmarking platform for systematically evaluating multimodal AI models on real-world clinical reasoning tasks and diagnostic accuracy.
Rank
Model
Overall Score
Overall Pass@1
No results found
A comparative benchmarking platform for systematically evaluating multimodal AI models on real-world clinical reasoning tasks and diagnostic accuracy.
We acknowledge the support of UTHealth Houston and the contributions of the research team involved in data curation and evaluation design. We also recognize the use of publicly available tools, frameworks, and open-source technologies that enabled the development of this platform.