Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models

Hashmi, Anees Ur Rehman; Mahapatra, Dwarikanath; Yaqub, Mohammad

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2403

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models

Authors: Anees Ur Rehman Hashmi, Dwarikanath Mahapatra, Mohammad Yaqub

(Submitted on 27 Mar 2024)

Abstract: Explaining Deep Learning models is becoming increasingly important in the face of daily emerging multimodal models, particularly in safety-critical domains like medical imaging. However, the lack of detailed investigations into the performance of explainability methods on these models is widening the gap between their development and safe deployment. In this work, we analyze the performance of various explainable AI methods on a vision-language model, MedCLIP, to demystify its inner workings. We also provide a simple methodology to overcome the shortcomings of these methods. Our work offers a different new perspective on the explainability of a recent well-known VLM in the medical domain and our assessment method is generalizable to other current and possible future VLMs.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.18996 [cs.CV]
	(or arXiv:2403.18996v1 [cs.CV] for this version)

Submission history

From: Anees Ur Rehman Hashmi [view email]
[v1] Wed, 27 Mar 2024 20:30:01 GMT (11638kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2403.18996

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Envisioning MedCLIP: A Deep Dive into Explainability for Medical Vision-Language Models

Submission history