Artificial Intelligence for Detecting Acute Fractures in Patients Admitted to an Emergency Department: Real-Life Performance of Three Commercial Algorithms.
Artificial intelligence
Deep learning
Fracture
Medical imaging
Musculoskeletal imaging
Journal
Academic radiology
ISSN: 1878-4046
Titre abrégé: Acad Radiol
Pays: United States
ID NLM: 9440159
Informations de publication
Date de publication:
10 2023
10 2023
Historique:
received:
11
04
2023
revised:
08
06
2023
accepted:
20
06
2023
medline:
25
9
2023
pubmed:
20
7
2023
entrez:
19
7
2023
Statut:
ppublish
Résumé
Interpreting radiographs in emergency settings is stressful and a burden for radiologists. The main objective was to assess the performance of three commercially available artificial intelligence (AI) algorithms for detecting acute peripheral fractures on radiographs in daily emergency practice. Radiographs were collected from consecutive patients admitted for skeletal trauma at our emergency department over a period of 2 months. Three AI algorithms-SmartUrgence, Rayvolve, and BoneView-were used to analyze 13 body regions. Four musculoskeletal radiologists determined the ground truth from radiographs. The diagnostic performance of the three AI algorithms was calculated at the level of the radiography set. Accuracies, sensitivities, and specificities for each algorithm and two-by-two comparisons between algorithms were obtained. Analyses were performed for the whole population and for subgroups of interest (sex, age, body region). A total of 1210 patients were included (mean age 41.3 ± 18.5 years; 742 [61.3%] men), corresponding to 1500 radiography sets. The fracture prevalence among the radiography sets was 23.7% (356/1500). Accuracy was 90.1%, 71.0%, and 88.8% for SmartUrgence, Rayvolve, and BoneView, respectively; sensitivity 90.2%, 92.6%, and 91.3%, with specificity 92.5%, 70.4%, and 90.5%. Accuracy and specificity were significantly higher for SmartUrgence and BoneView than Rayvolve for the whole population (P < .0001) and for subgroups. The three algorithms did not differ in sensitivity (P = .27). For SmartUrgence, subgroups did not significantly differ in accuracy, specificity, or sensitivity. For Rayvolve, accuracy and specificity were significantly higher with age 27-36 than ≥53 years (P = .0029 and P = .0019). Specificity was higher for the subgroup knee than foot (P = .0149). For BoneView, accuracy was significantly higher for the subgroups knee than foot (P = .0006) and knee than wrist/hand (P = .0228). Specificity was significantly higher for the subgroups knee than foot (P = .0003) and ankle than foot (P = .0195). The performance of AI detection of acute peripheral fractures in daily radiological practice in an emergency department was good to high and was related to the AI algorithm, patient age, and body region examined.
Identifiants
pubmed: 37468377
pii: S1076-6332(23)00311-2
doi: 10.1016/j.acra.2023.06.016
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
2118-2139Informations de copyright
Copyright © 2023 The Association of University Radiologists. Published by Elsevier Inc. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of Competing Interest Valérie Bousson is a paid consultant for Milvue starting July 1, 2022. Grégoire Attané was paid by Gleamer between March and August 2019 to label radiographs. The remaining authors declare none. The three companies, Milvue, AZmed, and Gleamer, provided the AI algorithms to our radiology department for free. None of the three companies had access to the study methodology or data during the course of the study.