ChatGPT's quiz skills in different otolaryngology subspecialties: an analysis of 2576 single-choice and multiple-choice board certification preparation questions.

Humans Artificial Intelligence Certification Educational Status Otolaryngology Referral and Consultation

AI Artificial intelligence ChatGPT Multiple-choice Otolaryngology quiz Single-choice

Journal

European archives of oto-rhino-laryngology : official journal of the European Federation of Oto-Rhino-Laryngological Societies (EUFOS) : affiliated with the German Society for Oto-Rhino-Laryngology - Head and Neck Surgery

ISSN: 1434-4726

Titre abrégé: Eur Arch Otorhinolaryngol

Pays: Germany

ID NLM: 9002937

Informations de publication

Date de publication:
Sep 2023

Historique:

received: 23 05 2023

accepted: 26 05 2023

medline: 31 7 2023

pubmed: 7 6 2023

entrez: 7 6 2023

Statut: ppublish

Résumé

With the increasing adoption of artificial intelligence (AI) in various domains, including healthcare, there is growing acceptance and interest in consulting AI models to provide medical information and advice. This study aimed to evaluate the accuracy of ChatGPT's responses to practice quiz questions designed for otolaryngology board certification and decipher potential performance disparities across different otolaryngology subspecialties. A dataset covering 15 otolaryngology subspecialties was collected from an online learning platform funded by the German Society of Oto-Rhino-Laryngology, Head and Neck Surgery, designed for board certification examination preparation. These questions were entered into ChatGPT, with its responses being analyzed for accuracy and variance in performance. The dataset included 2576 questions (479 multiple-choice and 2097 single-choice), of which 57% (n = 1475) were answered correctly by ChatGPT. An in-depth analysis of question style revealed that single-choice questions were associated with a significantly higher rate (p < 0.001) of correct responses (n = 1313; 63%) compared to multiple-choice questions (n = 162; 34%). Stratified by question categories, ChatGPT yielded the highest rate of correct responses (n = 151; 72%) in the field of allergology, whereas 7 out of 10 questions (n = 65; 71%) on legal otolaryngology aspects were answered incorrectly. The study reveals ChatGPT's potential as a supplementary tool for otolaryngology board certification preparation. However, its propensity for errors in certain otolaryngology areas calls for further refinement. Future research should address these limitations to improve ChatGPT's educational use. An approach, with expert collaboration, is recommended for the reliable and accurate integration of such AI models.

Identifiants

DOI: 10.1007/s00405-023-08051-4 PMID: 37285018 PMC: PMC10382366

pubmed: 37285018

doi: 10.1007/s00405-023-08051-4

pii: 10.1007/s00405-023-08051-4

pmc: PMC10382366

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

4271-4278

Commentaires et corrections

Type : CommentIn

Informations de copyright

Références

Laryngoscope. 2022 Dec;132(12):2516-2523

pubmed: 35638245

J Card Surg. 2022 Dec;37(12):4612-4620

pubmed: 36345692

Int J Biol Sci. 2021 Jan 1;17(2):475-486

pubmed: 33613106

J Pers Med. 2022 Oct 19;12(10):

pubmed: 36294878

Eur J Health Econ. 2022 Mar;23(2):211-223

pubmed: 34373958

Am J Obstet Gynecol. 2023 Jun;228(6):696-705

pubmed: 36924907

JMIR Med Educ. 2023 Mar 8;9:e46876

pubmed: 36867743

J Educ Eval Health Prof. 2023;20:1

pubmed: 36627845

JAMA Ophthalmol. 2023 Jun 1;141(6):589-597

pubmed: 37103928

Laryngorhinootologie. 2023 Oct;102(10):762-769

pubmed: 36977468

Nat Med. 2019 Jan;25(1):24-29

pubmed: 30617335

J Voice. 2022 Jan;36(1):2-3

pubmed: 33941396

JMIR Med Educ. 2023 Feb 8;9:e45312

pubmed: 36753318

J Clin Med. 2022 Aug 25;11(17):

pubmed: 36078928

PLOS Digit Health. 2023 Feb 9;2(2):e0000198

pubmed: 36812645

ChatGPT's quiz skills in different otolaryngology subspecialties: an analysis of 2576 single-choice and multiple-choice board certification preparation questions.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Commentaires et corrections

Informations de copyright

Références

Auteurs

Cosima C Hoch (CC)

Barbara Wollenberg (B)

Jan-Christoffer Lüers (JC)

Samuel Knoedler (S)

Leonard Knoedler (L)

Konstantin Frank (K)

Sebastian Cotofana (S)

Michael Alfertshofer (M)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH