Enhancing chatbot performance for imaging recommendations: Leveraging GPT-4 and context-awareness for trustworthy clinical guidance.

ACR Guidelines Auditability Chatbot Clinical Decision Support Large Language Model Trust

Journal

European journal of radiology
ISSN: 1872-7727
Titre abrégé: Eur J Radiol
Pays: Ireland
ID NLM: 8106411

Informations de publication

Date de publication:
24 Sep 2024
Historique:
received: 25 04 2024
revised: 10 08 2024
accepted: 23 09 2024
medline: 27 9 2024
pubmed: 27 9 2024
entrez: 26 9 2024
Statut: aheadofprint

Résumé

To investigate if GPT-4 improves the accuracy, consistency, and trustworthiness of a context-aware chatbot to provide personalized imaging recommendations from American College of Radiology (ACR) appropriateness criteria documents using semantic similarity processing: In addition, we sought to enable auditability of the output by revealing the information source the decision relies on. We refined an existing chatbot that incorporated specialized knowledge of the ACR guidelines by upgrading GPT-3.5-Turbo to its successor GPT-4 by OpenAI, using the latest version of LlamaIndex, and improving the prompting strategy. This chatbot was compared to the previous version, generic GPT-3.5-Turbo and GPT-4, and general radiologists regarding the performance in applying the ACR appropriateness guidelines. The refined context-aware chatbot performed superior to the previous version using GPT-3.5-Turbo, generic chatbots GPT-3.5-Turbo and GPT-4, and general radiologists in providing "usually or may be appropriate" recommendations according to the ACR guidelines (all p < 0.001). It also outperformed GPT-3.5-Turbo and general radiologists in respect to "usually appropriate" recommendations (both p < 0.001). Moreover, the consistency in correct answers was higher with 78 % consistent correct "usually appropriate" answers and 94 % for "usually or may be appropriate" recommendations. In all cases, the same source documents were chosen, ensuring transparency. Our study demonstrates the significance of context awareness in ensuring the use of appropriate knowledge and proposes a strategy to enhance trust in chatbot-based outputs to provide transparency. The improvements in accuracy, consistency, and source transparency address trust issues and enhance the clinical decision support process. ACR, American College of Radiology; accGPT, appropriateness criteria context aware GPT; accGPT-4, appropriateness criteria context aware GPT using GPT-4; GPT, generative pre-trained transformer; LLM, Large Language Model.

Identifiants

pubmed: 39326236
pii: S0720-048X(24)00472-8
doi: 10.1016/j.ejrad.2024.111756
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

111756

Informations de copyright

Copyright © 2024 The Author(s). Published by Elsevier B.V. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: AR received grants from Berta-Ottenstein-Programme for Clinician Scientists, Faculty of Medicine, University of Freiburg.

Auteurs

Alexander Rau (A)

Department of Diagnostic and Interventional Radiology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany; Department of Neuroradiology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany. Electronic address: alexander.rau@uniklinik-freiburg.de.

Fabian Bamberg (F)

Department of Diagnostic and Interventional Radiology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany.

Anna Fink (A)

Department of Diagnostic and Interventional Radiology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany.

Phuong Hien Tran (P)

Department of Diagnostic and Interventional Radiology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany.

Marco Reisert (M)

Medical Physics, Department of Diagnostic and Interventional Radiology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany; Department of Stereotactic and Functional Neurosurgery, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany.

Maximilian F Russe (MF)

Department of Diagnostic and Interventional Radiology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany.

Classifications MeSH