Dataset Registry
Curated benchmark datasets designed for evaluating multimodal medical reasoning across varying levels of complexity.
DatasetComplexitySizeDescriptionAction
Curated benchmark datasets designed for evaluating multimodal medical reasoning across varying levels of complexity.
We acknowledge the support of UTHealth Houston and the contributions of the research team involved in data curation and evaluation design. We also recognize the use of publicly available tools, frameworks, and open-source technologies that enabled the development of this platform.