Large Language Model-Based Neurosurgical Evaluation Matrix: A Novel Scoring Criteria to Assess the Efficacy of ChatGPT as an Educational Tool for Neurosurgery Board Preparation.

AI evaluation matrix Artificial intelligence ChatGPT Medical education technology Neurosurgical education

Journal

World neurosurgery
ISSN: 1878-8769
Titre abrégé: World Neurosurg
Pays: United States
ID NLM: 101528275

Informations de publication

Date de publication:
14 Oct 2023
Historique:
received: 03 10 2023
accepted: 07 10 2023
pubmed: 16 10 2023
medline: 16 10 2023
entrez: 15 10 2023
Statut: aheadofprint

Résumé

Technological advancements are reshaping medical education, with digital tools becoming essential in all levels of training. Amidst this transformation, the study explores the potential of ChatGPT, an artificial intelligence model by OpenAI, in enhancing neurosurgical board education. The focus extends beyond technology adoption to its effective utilization, with ChatGPT's proficiency evaluated against practice questions from the Primary Neurosurgery Written Board Exam. Using the Congress of Neurologic Surgeons (CNS) Self-Assessment Neurosurgery (SANS) Exam Board Review Prep questions, we conducted 3 rounds of analysis with ChatGPT. We developed a novel ChatGPT Neurosurgical Evaluation Matrix (CNEM) to assess the output quality, accuracy, concordance, and clarity of ChatGPT's answers. ChatGPT achieved spot-on accuracy for 66.7% of prompted questions, 59.4% of unprompted questions, and 63.9% of unprompted questions with a leading phrase. Stratified by topic, accuracy ranged from 50.0% (Vascular) to 78.8% (Neuropathology). In comparison to SANS explanations, ChatGPT output was considered better in 19.1% of questions, equal in 51.6%, and worse in 29.3%. Concordance analysis showed that 95.5% of unprompted ChatGPT outputs and 97.4% of unprompted outputs with a leading phrase were aligned. Our study evaluated the performance of ChatGPT in neurosurgical board education by assessing its accuracy, clarity, and concordance. The findings highlight the potential and challenges of integrating AI technologies like ChatGPT into medical and neurosurgical board education. Further research is needed to refine these tools and optimize their performance for enhanced medical education and patient care.

Identifiants

pubmed: 37839567
pii: S1878-8750(23)01448-1
doi: 10.1016/j.wneu.2023.10.043
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

Copyright © 2023 Elsevier Inc. All rights reserved.

Auteurs

Sneha Sai Mannam (SS)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Robert Subtirelu (R)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Daksh Chauhan (D)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Hasan S Ahmad (HS)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Irina Mihaela Matache (IM)

Department of Physiology, Faculty of Medicine, Carol Davila University of Medicine and Pharmacy, Bucharest, Romania.

Kevin Bryan (K)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Siddharth V K Chitta (SVK)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Shreya C Bathula (SC)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Ryan Turlip (R)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Connor Wathen (C)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Yohannes Ghenbot (Y)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Sonia Ajmera (S)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Rachel Blue (R)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

H Isaac Chen (HI)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Zarina S Ali (ZS)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Neil Malhotra (N)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Visish Srinivasan (V)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Ali K Ozturk (AK)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Jang W Yoon (JW)

Department of Neurosurgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, USA. Electronic address: jang.yoon@pennmedicine.upenn.edu.

Classifications MeSH