ChatGPT's quiz skills in different otolaryngology subspecialties: an analysis of 2576 single-choice and multiple-choice board certification preparation questions.
AI
Artificial intelligence
ChatGPT
Multiple-choice
Otolaryngology quiz
Single-choice
Journal
European archives of oto-rhino-laryngology : official journal of the European Federation of Oto-Rhino-Laryngological Societies (EUFOS) : affiliated with the German Society for Oto-Rhino-Laryngology - Head and Neck Surgery
ISSN: 1434-4726
Titre abrégé: Eur Arch Otorhinolaryngol
Pays: Germany
ID NLM: 9002937
Informations de publication
Date de publication:
Sep 2023
Sep 2023
Historique:
received:
23
05
2023
accepted:
26
05
2023
medline:
31
7
2023
pubmed:
7
6
2023
entrez:
7
6
2023
Statut:
ppublish
Résumé
With the increasing adoption of artificial intelligence (AI) in various domains, including healthcare, there is growing acceptance and interest in consulting AI models to provide medical information and advice. This study aimed to evaluate the accuracy of ChatGPT's responses to practice quiz questions designed for otolaryngology board certification and decipher potential performance disparities across different otolaryngology subspecialties. A dataset covering 15 otolaryngology subspecialties was collected from an online learning platform funded by the German Society of Oto-Rhino-Laryngology, Head and Neck Surgery, designed for board certification examination preparation. These questions were entered into ChatGPT, with its responses being analyzed for accuracy and variance in performance. The dataset included 2576 questions (479 multiple-choice and 2097 single-choice), of which 57% (n = 1475) were answered correctly by ChatGPT. An in-depth analysis of question style revealed that single-choice questions were associated with a significantly higher rate (p < 0.001) of correct responses (n = 1313; 63%) compared to multiple-choice questions (n = 162; 34%). Stratified by question categories, ChatGPT yielded the highest rate of correct responses (n = 151; 72%) in the field of allergology, whereas 7 out of 10 questions (n = 65; 71%) on legal otolaryngology aspects were answered incorrectly. The study reveals ChatGPT's potential as a supplementary tool for otolaryngology board certification preparation. However, its propensity for errors in certain otolaryngology areas calls for further refinement. Future research should address these limitations to improve ChatGPT's educational use. An approach, with expert collaboration, is recommended for the reliable and accurate integration of such AI models.
Identifiants
pubmed: 37285018
doi: 10.1007/s00405-023-08051-4
pii: 10.1007/s00405-023-08051-4
pmc: PMC10382366
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
4271-4278Commentaires et corrections
Type : CommentIn
Informations de copyright
© 2023. The Author(s).
Références
Laryngoscope. 2022 Dec;132(12):2516-2523
pubmed: 35638245
J Card Surg. 2022 Dec;37(12):4612-4620
pubmed: 36345692
Int J Biol Sci. 2021 Jan 1;17(2):475-486
pubmed: 33613106
J Pers Med. 2022 Oct 19;12(10):
pubmed: 36294878
Eur J Health Econ. 2022 Mar;23(2):211-223
pubmed: 34373958
Am J Obstet Gynecol. 2023 Jun;228(6):696-705
pubmed: 36924907
JMIR Med Educ. 2023 Mar 8;9:e46876
pubmed: 36867743
J Educ Eval Health Prof. 2023;20:1
pubmed: 36627845
JAMA Ophthalmol. 2023 Jun 1;141(6):589-597
pubmed: 37103928
Laryngorhinootologie. 2023 Oct;102(10):762-769
pubmed: 36977468
Nat Med. 2019 Jan;25(1):24-29
pubmed: 30617335
J Voice. 2022 Jan;36(1):2-3
pubmed: 33941396
JMIR Med Educ. 2023 Feb 8;9:e45312
pubmed: 36753318
J Clin Med. 2022 Aug 25;11(17):
pubmed: 36078928
PLOS Digit Health. 2023 Feb 9;2(2):e0000198
pubmed: 36812645