AI-Based CXR First Reading: Current Limitations to Ensure Practical Value.
AI for chest X-ray first reading
external validation
local test set
prospective validation
Journal
Diagnostics (Basel, Switzerland)
ISSN: 2075-4418
Titre abrégé: Diagnostics (Basel)
Pays: Switzerland
ID NLM: 101658402
Informations de publication
Date de publication:
16 Apr 2023
16 Apr 2023
Historique:
received:
18
03
2023
revised:
04
04
2023
accepted:
13
04
2023
medline:
16
5
2023
pubmed:
16
5
2023
entrez:
16
5
2023
Statut:
epublish
Résumé
We performed a multicenter external evaluation of the practical and clinical efficacy of a commercial AI algorithm for chest X-ray (CXR) analysis (Lunit INSIGHT CXR). A retrospective evaluation was performed with a multi-reader study. For a prospective evaluation, the AI model was run on CXR studies; the results were compared to the reports of 226 radiologists. In the multi-reader study, the area under the curve (AUC), sensitivity, and specificity of the AI were 0.94 (CI95%: 0.87-1.0), 0.9 (CI95%: 0.79-1.0), and 0.89 (CI95%: 0.79-0.98); the AUC, sensitivity, and specificity of the radiologists were 0.97 (CI95%: 0.94-1.0), 0.9 (CI95%: 0.79-1.0), and 0.95 (CI95%: 0.89-1.0). In most regions of the ROC curve, the AI performed a little worse or at the same level as an average human reader. The McNemar test showed no statistically significant differences between AI and radiologists. In the prospective study with 4752 cases, the AUC, sensitivity, and specificity of the AI were 0.84 (CI95%: 0.82-0.86), 0.77 (CI95%: 0.73-0.80), and 0.81 (CI95%: 0.80-0.82). Lower accuracy values obtained during the prospective validation were mainly associated with false-positive findings considered by experts to be clinically insignificant and the false-negative omission of human-reported "opacity", "nodule", and calcification. In a large-scale prospective validation of the commercial AI algorithm in clinical practice, lower sensitivity and specificity values were obtained compared to the prior retrospective evaluation of the data of the same population.
Identifiants
pubmed: 37189531
pii: diagnostics13081430
doi: 10.3390/diagnostics13081430
pmc: PMC10138086
pii:
doi:
Types de publication
Journal Article
Langues
eng
Subventions
Organisme : "Moscow Center for Innovative Technologies in Healthcare"
ID : 122112400040-1
Références
Sci Rep. 2021 Dec 13;11(1):23895
pubmed: 34903808
Insights Imaging. 2018 Jun;9(3):337-341
pubmed: 29777451
J Digit Imaging. 2019 Aug;32(4):625-637
pubmed: 31011956
Cancer. 1950 Jan;3(1):32-5
pubmed: 15405679
Insights Imaging. 2021 Feb 10;12(1):13
pubmed: 33564955
Biom J. 2008 Jun;50(3):419-30
pubmed: 18435502
JAMA Netw Open. 2019 Mar 1;2(3):e191095
pubmed: 30901052
Eur Respir J. 2021 May 20;57(5):
pubmed: 33243843
PLoS One. 2019 Sep 3;14(9):e0221339
pubmed: 31479448
JMIR Res Protoc. 2022 Nov 16;11(11):e39536
pubmed: 36383419
Nat Med. 2020 Jun;26(6):807-808
pubmed: 32514173
Radiol Cardiothorac Imaging. 2020 Dec 10;2(6):e190222
pubmed: 33778635
JAMA Netw Open. 2022 Aug 1;5(8):e2229289
pubmed: 36044215
Eur Respir J. 2021 Sep 9;58(3):
pubmed: 33574070
Biometrics. 1977 Mar;33(1):159-74
pubmed: 843571
Diagnostics (Basel). 2023 Feb 15;13(4):
pubmed: 36832231
Sci Rep. 2021 Jan 13;11(1):858
pubmed: 33441578
JAMA Netw Open. 2020 Sep 1;3(9):e2017135
pubmed: 32970157
J Med Imaging Radiat Sci. 2021 Mar;52(1):112-120
pubmed: 33082122
Stat Med. 1991 Dec;10(12):1981-91
pubmed: 1805322
J Am Coll Radiol. 2021 Mar;18(3 Pt A):413-424
pubmed: 33096088
Diagnostics (Basel). 2022 Nov 07;12(11):
pubmed: 36359565
Radiology. 2019 Apr;291(1):196-202
pubmed: 30667333
Eur Radiol. 2016 Sep;26(9):3262-71
pubmed: 26747264
JAMA Netw Open. 2020 Oct 1;3(10):e2022779
pubmed: 33034642
Radiology. 2008 Mar;246(3):697-722
pubmed: 18195376
Diagnostics (Basel). 2023 Jan 06;13(2):
pubmed: 36673027
Radiology. 2019 Aug;292(2):331-342
pubmed: 31210611
Can Assoc Radiol J. 2021 Feb;72(1):60-72
pubmed: 32757950
Nat Med. 2020 Sep;26(9):1364-1374
pubmed: 32908283