Validity of ChatGPT-generated musculoskeletal images.
Anatomical illustrations
ChatGPT
Figure generation
Large language models
Musculoskeletal studies
Radiology
Journal
Skeletal radiology
ISSN: 1432-2161
Titre abrégé: Skeletal Radiol
Pays: Germany
ID NLM: 7701953
Informations de publication
Date de publication:
04 Mar 2024
04 Mar 2024
Historique:
received:
23
01
2024
accepted:
27
02
2024
revised:
26
02
2024
pubmed:
5
3
2024
medline:
5
3
2024
entrez:
4
3
2024
Statut:
aheadofprint
Résumé
In the evolving landscape of medical research and radiology, effective communication of intricate ideas is imperative, with visualizations playing a crucial role. This study explores the transformative potential of ChatGPT4, a powerful Large Language Model (LLM), in automating the creation of schematics and figures for radiology research papers, specifically focusing on its implications for musculoskeletal studies. Deploying ChatGPT4, the study aimed to assess the model's ability to generate anatomical images of six large joints-shoulder, elbow, wrist, hip, knee, and ankle. Four variations of a text prompt were utilized, to generate a coronal illustration with annotations for each joint. Evaluation parameters included anatomical correctness, correctness of annotations, aesthetic nature of illustrations, usability of figures in research papers, and cost-effectiveness. Four panellists performed the assessment using a 5-point Likert Scale. Overall analysis of the 24 illustrations encompassing the six joints of interest (4 of each) revealed significant limitations in ChatGPT4's performance. The anatomical design ranged from poor to good, all of the illustrations received a below-average rating for annotation, with the majority assessed as poor. All of them ranked below average for usability in research papers. There was good agreement between raters across all domains (ICC = 0.61). While LLMs like ChatGPT4 present promising prospects for rapid figure generation, their current capabilities fall short of meeting the rigorous standards demanded by musculoskeletal radiology research. Future developments should focus on iterative refinement processes to enhance the realism of LLM-generated musculoskeletal schematics.
Identifiants
pubmed: 38438538
doi: 10.1007/s00256-024-04638-y
pii: 10.1007/s00256-024-04638-y
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Commentaires et corrections
Type : CommentIn
Informations de copyright
© 2024. The Author(s), under exclusive licence to International Skeletal Society (ISS).
Références
Divecha C, Tullu M, Karande S. Utilizing tables, figures, charts and graphs to enhance the readability of a research paper. Medknow; 2023. p. 125–31.
Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature. 2023;620(7972):172–80.
doi: 10.1038/s41586-023-06291-2
pubmed: 37438534
pmcid: 10396962
Clusmann J, Kolbinger FR, Muti HS, Carrero ZI, Eckardt JN, Laleh NG, et al. The future landscape of large language models in medicine. Commun Med (Lond). 2023;3(1):141.
doi: 10.1038/s43856-023-00370-1
pubmed: 37816837
Naveed H, Khan AU, Qiu S, Saqib M, Anwar S, Usman M, et al. A comprehensive overview of large language models. arXiv preprint arXiv:230706435.2023.
Ariyaratne S, Iyengar KP, Botchu R. Will collaborative publishing with ChatGPT drive academic writing in the future? Br J Surg. 2023;110(9):1213–4.
doi: 10.1093/bjs/znad198
pubmed: 37368994
Botchu R, Iyengar KP. Will ChatGPT drive radiology in the future? Indian J Radiol Imaging. 2023;33(4):436–7.
Sullivan GM, Artino AR Jr. Analyzing and interpreting data from Likert-type scales. J Grad Med Educ. 2013;5(4):541–2.
doi: 10.4300/JGME-5-4-18
pubmed: 24454995
pmcid: 3886444
Corl FM, Garland MR, Fishman EK. Role of computer technology in medical illustration. Am J Roentgenol. 2000;175(6):1519–24.
doi: 10.2214/ajr.175.6.1751519
Stokel-Walker C, Van Noorden R. What ChatGPT and generative AI mean for science. Nature. 2023;614(7947):214–6.
doi: 10.1038/d41586-023-00340-6
pubmed: 36747115
Sallam M. ChatGPT utility in healthcare education, research, and practice: systematic review on the promising perspectives and valid concerns. Healthcare 2023;11(6):887. MDPI.
Liebrenz M, Schleifer R, Buadze A, Bhugra D, Smith A. Generating scholarly content with ChatGPT: ethical challenges for medical publishing. Lancet Digital Health. 2023;5(3):e105–6.
doi: 10.1016/S2589-7500(23)00019-5
pubmed: 36754725
Li H, Moon JT, Purkayastha S, Celi LA, Trivedi H, Gichoya JW. Ethics of large language models in medicine and medical research. Lancet Digital Health. 2023;5(6):e333–5.
doi: 10.1016/S2589-7500(23)00083-3
pubmed: 37120418
https://www.theverge.com/2023/11/6/23948386/chatgpt-active-user-count-openai-developer-conference (accessed: 26th December 2023, at 6:00 PM).