Performance evaluation methods for improvements at post-market of artificial intelligence/machine learning-based computer-aided detection/diagnosis/triage in the United States.
Journal
PLOS digital health
ISSN: 2767-3170
Titre abrégé: PLOS Digit Health
Pays: United States
ID NLM: 9918335064206676
Informations de publication
Date de publication:
Mar 2023
Mar 2023
Historique:
received:
23
09
2022
accepted:
07
02
2023
entrez:
8
3
2023
pubmed:
9
3
2023
medline:
9
3
2023
Statut:
epublish
Résumé
Computer-aided detection (CADe), computer-aided diagnosis (CADx), and computer-aided simple triage (CAST), which incorporate artificial intelligence (AI) and machine learning (ML), are continually undergoing post-market improvement. Therefore, understanding the evaluation and approval process of improved products is important. This study intended to conduct a comprehensive survey of AI/ML-based CAD products approved by the U.S. Food and Drug Administration (FDA) that had been improved post-market to gain insights into the efficacy and safety required for market approval. A survey of the product code database published by the FDA identified eight products that were improved post-market. The methods used to evaluate the performance of improvements were analysed, and post-market improvements were approved with retrospective data. Reader study testing (RT) or software standalone testing (SA) procedures were conducted retrospectively. Six RT procedures were conducted because of modifications to the intended use. An average of 17.3 readers (minimum 14, maximum 24) participated, and the area under the curve (AUC) was considered the primary endpoint. The addition of study learning data that did not change the intended use and changes in the analysis algorithm were evaluated by SA. The average sensitivity, specificity, and AUC were 93% (minimum 91.1, maximum 97), 89.6% (minimum 85.9, maximum 96), and 0.96 (minimum 0.96, maximum 0.97), respectively. The average interval between applications was 348 days (minimum -18, maximum 975), which showed that the improvements were implemented within approximately one year. This is the first comprehensive study on AI/ML-based CAD products that have been improved post-market to elucidate evaluation points for post-market improvements. The findings will be informative for the industry and academia in developing and improving AI/ML-based CAD.
Identifiants
pubmed: 36888573
doi: 10.1371/journal.pdig.0000209
pii: PDIG-D-22-00275
pmc: PMC9994700
doi:
Types de publication
Journal Article
Langues
eng
Pagination
e0000209Informations de copyright
Copyright: © 2023 Yuba, Iwasaki. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
NPJ Digit Med. 2020 Sep 11;3:118
pubmed: 32984550
Radiol Artif Intell. 2019 Jul 31;1(4):e180096
pubmed: 32076660
Eur Radiol. 2019 Sep;29(9):4825-4832
pubmed: 30993432
Lancet Digit Health. 2021 Mar;3(3):e195-e203
pubmed: 33478929
Gastroenterology. 2020 Aug;159(2):512-520.e7
pubmed: 32371116
Sci Rep. 2022 Oct 7;12(1):16874
pubmed: 36207474
Eur Radiol. 2021 Nov;31(11):8682-8691
pubmed: 33948701
Eur Radiol. 2021 Mar;31(3):1687-1692
pubmed: 32876835
J Natl Cancer Inst. 2019 Sep 1;111(9):916-922
pubmed: 30834436
Clin Imaging. 2022 Sep;89:61-67
pubmed: 35716432
Radiology. 2019 Feb;290(2):305-314
pubmed: 30457482
Radiology. 2022 Mar;302(3):535-542
pubmed: 34904872
Breast Cancer. 2020 Jul;27(4):642-651
pubmed: 32052311