Clinical application of radiological AI for pulmonary nodule evaluation: Replicability and susceptibility to the population shift caused by the COVID-19 pandemic.

Humans Pandemics COVID-19 / diagnostic imaging Radiography Radiology Tomography, X-Ray Computed

Artificial Intelligence COVID-19 Fine-tuning Lung Cancer Medical Image Analysis Replicability of medical machine learning

Journal

International journal of medical informatics

ISSN: 1872-8243

Titre abrégé: Int J Med Inform

Pays: Ireland

ID NLM: 9711057

Informations de publication

Date de publication:
10 2023

Historique:

received: 19 03 2023

revised: 04 08 2023

accepted: 07 08 2023

medline: 23 10 2023

pubmed: 22 8 2023

entrez: 21 8 2023

Statut: ppublish

Résumé

replicability and generalizability of medical AI are the recognized challenges that hinder a broad AI deployment in clinical practice. Pulmonary nodes detection and characterization based on chest CT images is one of the demanded use cases for automatization by means of AI, and multiple AI solutions addressing this task are becoming available. Here, we evaluated and compared the performance of several commercially available radiological AI with the same clinical task on the same external datasets acquired before and during the pandemic of COVID-19. 5 commercially available AI models for pulmonary nodule detection were tested on two external datasets labelled by experts according to the intended clinical task. Dataset1 was acquired before the pandemic and did not contain radiological signs of COVID-19; dataset2 was collected during the pandemic and did contain radiological signs of COVID-19. ROC-analysis was applied separately for the dataset1 and dataset2 to select probability thresholds for each dataset separately. AUROC, sensitivity and specificity metrics were used to assess and compare the results of AI performance. Statistically significant differences in AUROC values were observed between the AI models for the dataset1. Whereas for the dataset2 the differences of AUROC values became statistically insignificant. Sensitivity and specificity differed statistically significantly between the AI models for the dataset1. This difference was insignificant for the dataset2 when we applied the probability threshold initially selected for the dataset1. An update of the probability threshold based on the dataset2 created statistically significant differences of sensitivity and specificity between AI models for the dataset2. For 3 out of 5 AI models, the update of the probability threshold was valuable to compensate for the degradation of AI model performances with the population shift caused by the pandemic. Population shift in the data is able to deteriorate differences of AI models performance. Update of the probability threshold together with the population shift seems to be valuable to preserve AI models performance without retraining them.

Identifiants

DOI: 10.1016/j.ijmedinf.2023.105190 PMID: 37603940

pubmed: 37603940

pii: S1386-5056(23)00208-3

doi: 10.1016/j.ijmedinf.2023.105190

pii:

doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

Pagination

105190

Informations de copyright

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Clinical application of radiological AI for pulmonary nodule evaluation: Replicability and susceptibility to the population shift caused by the COVID-19 pandemic.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Informations de copyright

Déclaration de conflit d'intérêts

Auteurs

Yuriy Vasilev (Y)

Anton Vladzymyrskyy (A)

Kirill Arzamasov (K)

Olga Omelyanskaya (O)

Igor Shulkin (I)

Darya Kozikhina (D)

Inna Goncharova (I)

Roman Reshetnikov (R)

Sergey Chetverikov (S)

Ivan Blokhin (I)

Tatiana Bobrovskaya (T)

Anna Andreychenko (A)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH