We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs

Abstract: The integration of Computer-Aided Diagnosis (CAD) with Large Language Models (LLMs) presents a promising frontier in clinical applications, notably in automating diagnostic processes akin to those performed by radiologists and providing consultations similar to a virtual family doctor. Despite the promising potential of this integration, current works face at least two limitations: (1) From the perspective of a radiologist, existing studies typically have a restricted scope of applicable imaging domains, failing to meet the diagnostic needs of different patients. Also, the insufficient diagnostic capability of LLMs further undermine the quality and reliability of the generated medical reports. (2) Current LLMs lack the requisite depth in medical expertise, rendering them less effective as virtual family doctors due to the potential unreliability of the advice provided during patient consultations. To address these limitations, we introduce ChatCAD+, to be universal and reliable. Specifically, it is featured by two main modules: (1) Reliable Report Generation and (2) Reliable Interaction. The Reliable Report Generation module is capable of interpreting medical images from diverse domains and generate high-quality medical reports via our proposed hierarchical in-context learning. Concurrently, the interaction module leverages up-to-date information from reputable medical websites to provide reliable medical advice. Together, these designed modules synergize to closely align with the expertise of human medical professionals, offering enhanced consistency and reliability for interpretation and advice. The source code is available at this https URL
Comments: Authors Zihao Zhao, Sheng Wang, Jinchen Gu, Yitao Zhu contributed equally to this work and should be considered co-first authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2305.15964 [cs.CV]
  (or arXiv:2305.15964v5 [cs.CV] for this version)

Submission history

From: Zihao Zhao [view email]
[v1] Thu, 25 May 2023 12:03:31 GMT (2481kb,D)
[v2] Fri, 26 May 2023 02:53:58 GMT (2481kb,D)
[v3] Thu, 29 Jun 2023 02:57:48 GMT (2932kb,D)
[v4] Fri, 7 Jul 2023 16:16:12 GMT (2984kb,D)
[v5] Wed, 17 Apr 2024 15:01:39 GMT (2932kb,D)

Link back to: arXiv, form interface, contact.