Validity of ChatGPT-generated musculoskeletal images.

Anatomical illustrations ChatGPT Figure generation Large language models Musculoskeletal studies Radiology

Journal

Skeletal radiology

ISSN: 1432-2161

Titre abrégé: Skeletal Radiol

Pays: Germany

ID NLM: 7701953

Informations de publication

Date de publication:
04 Mar 2024

Historique:

received: 23 01 2024

accepted: 27 02 2024

revised: 26 02 2024

pubmed: 5 3 2024

medline: 5 3 2024

entrez: 4 3 2024

Statut: aheadofprint

Résumé

In the evolving landscape of medical research and radiology, effective communication of intricate ideas is imperative, with visualizations playing a crucial role. This study explores the transformative potential of ChatGPT4, a powerful Large Language Model (LLM), in automating the creation of schematics and figures for radiology research papers, specifically focusing on its implications for musculoskeletal studies. Deploying ChatGPT4, the study aimed to assess the model's ability to generate anatomical images of six large joints-shoulder, elbow, wrist, hip, knee, and ankle. Four variations of a text prompt were utilized, to generate a coronal illustration with annotations for each joint. Evaluation parameters included anatomical correctness, correctness of annotations, aesthetic nature of illustrations, usability of figures in research papers, and cost-effectiveness. Four panellists performed the assessment using a 5-point Likert Scale. Overall analysis of the 24 illustrations encompassing the six joints of interest (4 of each) revealed significant limitations in ChatGPT4's performance. The anatomical design ranged from poor to good, all of the illustrations received a below-average rating for annotation, with the majority assessed as poor. All of them ranked below average for usability in research papers. There was good agreement between raters across all domains (ICC = 0.61). While LLMs like ChatGPT4 present promising prospects for rapid figure generation, their current capabilities fall short of meeting the rigorous standards demanded by musculoskeletal radiology research. Future developments should focus on iterative refinement processes to enhance the realism of LLM-generated musculoskeletal schematics.

Identifiants

DOI: 10.1007/s00256-024-04638-y PMID: 38438538

pubmed: 38438538

doi: 10.1007/s00256-024-04638-y

pii: 10.1007/s00256-024-04638-y

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Commentaires et corrections

Type : CommentIn

Informations de copyright

Références

Divecha C, Tullu M, Karande S. Utilizing tables, figures, charts and graphs to enhance the readability of a research paper. Medknow; 2023. p. 125–31.

Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature. 2023;620(7972):172–80.

doi: 10.1038/s41586-023-06291-2 pubmed: 37438534 pmcid: 10396962

Clusmann J, Kolbinger FR, Muti HS, Carrero ZI, Eckardt JN, Laleh NG, et al. The future landscape of large language models in medicine. Commun Med (Lond). 2023;3(1):141.

doi: 10.1038/s43856-023-00370-1 pubmed: 37816837

Naveed H, Khan AU, Qiu S, Saqib M, Anwar S, Usman M, et al. A comprehensive overview of large language models. arXiv preprint arXiv:230706435.2023.

Ariyaratne S, Iyengar KP, Botchu R. Will collaborative publishing with ChatGPT drive academic writing in the future? Br J Surg. 2023;110(9):1213–4.

doi: 10.1093/bjs/znad198 pubmed: 37368994

Botchu R, Iyengar KP. Will ChatGPT drive radiology in the future? Indian J Radiol Imaging. 2023;33(4):436–7.

Sullivan GM, Artino AR Jr. Analyzing and interpreting data from Likert-type scales. J Grad Med Educ. 2013;5(4):541–2.

doi: 10.4300/JGME-5-4-18 pubmed: 24454995 pmcid: 3886444

Corl FM, Garland MR, Fishman EK. Role of computer technology in medical illustration. Am J Roentgenol. 2000;175(6):1519–24.

doi: 10.2214/ajr.175.6.1751519

Stokel-Walker C, Van Noorden R. What ChatGPT and generative AI mean for science. Nature. 2023;614(7947):214–6.

doi: 10.1038/d41586-023-00340-6 pubmed: 36747115

Sallam M. ChatGPT utility in healthcare education, research, and practice: systematic review on the promising perspectives and valid concerns. Healthcare 2023;11(6):887. MDPI.

Liebrenz M, Schleifer R, Buadze A, Bhugra D, Smith A. Generating scholarly content with ChatGPT: ethical challenges for medical publishing. Lancet Digital Health. 2023;5(3):e105–6.

doi: 10.1016/S2589-7500(23)00019-5 pubmed: 36754725

Li H, Moon JT, Purkayastha S, Celi LA, Trivedi H, Gichoya JW. Ethics of large language models in medicine and medical research. Lancet Digital Health. 2023;5(6):e333–5.

doi: 10.1016/S2589-7500(23)00083-3 pubmed: 37120418

https://www.theverge.com/2023/11/6/23948386/chatgpt-active-user-count-openai-developer-conference (accessed: 26th December 2023, at 6:00 PM).

Validity of ChatGPT-generated musculoskeletal images.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Commentaires et corrections

Informations de copyright

Références

Auteurs

P Ajmera (P)

N Nischal (N)

S Ariyaratne (S)

B Botchu (B)

K D P Bhamidipaty (KDP)

K P Iyengar (KP)

S R Ajmera (SR)

N Jenko (N)

R Botchu (R)

Classifications MeSH