BiomedXPro: Prompt Optimization for Explainable Diagnosis with Biomedical Vision Language Models

AI in healthcare
Published: arXiv: 2510.15866v1
Authors

Kaushitha Silva Mansitha Eashwara Sanduni Ubayasiri Ruwan Tennakoon Damayanthi Herath

Abstract

The clinical adoption of biomedical vision-language models is hindered by prompt optimization techniques that produce either uninterpretable latent vectors or single textual prompts. This lack of transparency and failure to capture the multi-faceted nature of clinical diagnosis, which relies on integrating diverse observations, limits their trustworthiness in high-stakes settings. To address this, we introduce BiomedXPro, an evolutionary framework that leverages a large language model as both a biomedical knowledge extractor and an adaptive optimizer to automatically generate a diverse ensemble of interpretable, natural-language prompt pairs for disease diagnosis. Experiments on multiple biomedical benchmarks show that BiomedXPro consistently outperforms state-of-the-art prompt-tuning methods, particularly in data-scarce few-shot settings. Furthermore, our analysis demonstrates a strong semantic alignment between the discovered prompts and statistically significant clinical features, grounding the model's performance in verifiable concepts. By producing a diverse ensemble of interpretable prompts, BiomedXPro provides a verifiable basis for model predictions, representing a critical step toward the development of more trustworthy and clinically-aligned AI systems.

Paper Summary

Problem
Accurate and transparent disease diagnosis is crucial in clinical practice. However, current computer vision systems struggle to provide interpretable outputs that align with established clinical reasoning processes. This limits their trustworthiness in high-stakes settings.
Key Innovation
Researchers introduce BiomedXPro, an evolutionary framework that leverages a large language model as both a biomedical knowledge extractor and an adaptive optimizer. This framework automatically generates a diverse ensemble of interpretable, natural-language prompt pairs for disease diagnosis. BiomedXPro addresses the limitations of uninterpretable soft prompts and single-prompt systems.
Practical Impact
BiomedXPro has the potential to improve the accuracy and trustworthiness of disease diagnosis in clinical practice. By providing interpretable and diverse prompt pairs, clinicians can better understand the underlying diagnostic rationale. This can lead to more accurate diagnoses and improved patient outcomes.
Analogy / Intuitive Explanation
Imagine you're trying to diagnose a patient's illness based on their symptoms. A doctor would typically ask a series of questions to gather more information, such as "Do you have a fever?" or "Have you recently traveled abroad?" BiomedXPro is like a sophisticated question-asking system that generates multiple, interpretable questions (or prompts) to help the computer vision system accurately diagnose the patient's illness. This approach allows the system to capture the multifaceted nature of clinical observations and provides a verifiable basis for model predictions.
Paper Information
Categories:
cs.CV cs.NE
Published Date:

arXiv ID:

2510.15866v1

Quick Actions