An evaluation of AI generated literature reviews in musculoskeletal radiology.

Artificial intelligence Literature review Sarcoma

Journal

The surgeon : journal of the Royal Colleges of Surgeons of Edinburgh and Ireland
ISSN: 1479-666X
Titre abrégé: Surgeon
Pays: Scotland
ID NLM: 101168329

Informations de publication

Date de publication:
12 Jan 2024
Historique:
received: 11 09 2023
revised: 20 12 2023
accepted: 27 12 2023
medline: 14 1 2024
pubmed: 14 1 2024
entrez: 13 1 2024
Statut: aheadofprint

Résumé

The use of artificial intelligence (AI) tools to aid in summarizing information in medicine and research has recently garnered a huge amount of interest. While tools such as ChatGPT produce convincing and naturally sounding output, the answers are sometimes incorrect. Some of these drawbacks, it is hoped, can be avoided by using programmes trained for a more specific scope. In this study we compared the performance of a new AI tool (the-literature.com) to the latest version OpenAI's ChatGPT (GPT-4) in summarizing topics that the authors have significantly contributed to. The AI tools were asked to produce a literature review on 7 topics. These were selected based on the research topics that the authors were intimately familiar with and have contributed to through their own publications. The output produced by the AI tools were graded on a 1-5 Likert scale for accuracy, comprehensiveness, and relevance by two fellowship trained consultant radiologists. The-literature.com produced 3 excellent summaries, 3 very poor summaries not relevant to the prompt, and one summary, which was relevant but did not include all relevant papers. All of the summaries produced by GPT-4 were relevant, but fewer relevant papers were identified. The average Likert rating was for the-literature was 2.88 and 3.86 for GPT-4. There was good agreement between the ratings of both radiologists (ICC = 0.883). Summaries produced by AI in its current state require careful human validation. GPT-4 on average provides higher quality summaries. Neither tool can reliably identify all relevant publications.

Identifiants

pubmed: 38218659
pii: S1479-666X(24)00008-8
doi: 10.1016/j.surge.2023.12.005
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

Copyright © 2024 Royal College of Surgeons of Edinburgh (Scottish charity number SC005317) and Royal College of Surgeons in Ireland. Published by Elsevier Ltd. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest None.

Auteurs

N Jenko (N)

Radiology, Royal Orthopaedic Hospital NHS Foundation Trust, Birmingham, UK. Electronic address: Nathan.jenko@nhs.net.

S Ariyaratne (S)

Radiology, Royal Orthopaedic Hospital NHS Foundation Trust, Birmingham, UK.

L Jeys (L)

Orthopaedic Surgery, Royal Orthopaedic Hospital NHS Foundation Trust, Birmingham, UK.

S Evans (S)

Orthopaedic Surgery, Royal Orthopaedic Hospital NHS Foundation Trust, Birmingham, UK.

K P Iyengar (KP)

Orthopaedic Surgery, Mersey and West Lancashire Teaching Hospitals NHS Trust, Southport, UK.

R Botchu (R)

Radiology, Royal Orthopaedic Hospital NHS Foundation Trust, Birmingham, UK.

Classifications MeSH