Pseudoprospective Paraclinical Interaction of Radiology Residents With a Deep Learning System for Prostate Cancer Detection: Experience, Performance, and Identification of the Need for Intermittent Recalibration.
Journal
Investigative radiology
ISSN: 1536-0210
Titre abrégé: Invest Radiol
Pays: United States
ID NLM: 0045377
Informations de publication
Date de publication:
01 Sep 2022
01 Sep 2022
Historique:
pubmed:
26
4
2022
medline:
9
8
2022
entrez:
25
4
2022
Statut:
ppublish
Résumé
The aim of this study was to estimate the prospective utility of a previously retrospectively validated convolutional neural network (CNN) for prostate cancer (PC) detection on prostate magnetic resonance imaging (MRI). The biparametric (T2-weighted and diffusion-weighted) portion of clinical multiparametric prostate MRI from consecutive men included between November 2019 and September 2020 was fully automatically and individually analyzed by a CNN briefly after image acquisition (pseudoprospective design). Radiology residents performed 2 research Prostate Imaging Reporting and Data System (PI-RADS) assessments of the multiparametric dataset independent from clinical reporting (paraclinical design) before and after review of the CNN results and completed a survey. Presence of clinically significant PC was determined by the presence of an International Society of Urological Pathology grade 2 or higher PC on combined targeted and extended systematic transperineal MRI/transrectal ultrasound fusion biopsy. Sensitivities and specificities on a patient and prostate sextant basis were compared using the McNemar test and compared with the receiver operating characteristic (ROC) curve of CNN. Survey results were summarized as absolute counts and percentages. A total of 201 men were included. The CNN achieved an ROC area under the curve of 0.77 on a patient basis. Using PI-RADS ≥3-emulating probability threshold (c3), CNN had a patient-based sensitivity of 81.8% and specificity of 54.8%, not statistically different from the current clinical routine PI-RADS ≥4 assessment at 90.9% and 54.8%, respectively ( P = 0.30/ P = 1.0). In general, residents achieved similar sensitivity and specificity before and after CNN review. On a prostate sextant basis, clinical assessment possessed the highest ROC area under the curve of 0.82, higher than CNN (AUC = 0.76, P = 0.21) and significantly higher than resident performance before and after CNN review (AUC = 0.76 / 0.76, P ≤ 0.03). The resident survey indicated CNN to be helpful and clinically useful. Pseudoprospective paraclinical integration of fully automated CNN-based detection of suspicious lesions on prostate multiparametric MRI was demonstrated and showed good acceptance among residents, whereas no significant improvement in resident performance was found. General CNN performance was preserved despite an observed shift in CNN calibration, identifying the requirement for continuous quality control and recalibration.
Identifiants
pubmed: 35467572
doi: 10.1097/RLI.0000000000000878
pii: 00004424-202209000-00006
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
601-612Informations de copyright
Copyright © 2022 Wolters Kluwer Health, Inc. All rights reserved.
Déclaration de conflit d'intérêts
Conflicts of interest and sources of funding: This research received research support from the Bundesministerium für Wirtschaft und Klimaschutz (BMWK): 01MT21004B. M.K. reported receiving funding from the Bundesministerium für Bildung und Forschung. A.S. declares being part of advisory board/speaker's bureau of AstraZeneca, Bayer, Bristol Myers Squibb, Eli Lilly, Illumina, Janssen, MSD, Novartis, Pfizer, Roche, Seattle Genetics, Thermo Fisher Scientific; and declares receiving grants from Bayer, Bristol Myers Squibb, and Chugai. H.-P.S. declares receiving consulting fee or honorarium from Siemens, Curagita, Profound, and Bayer; declares receiving travel support from Siemens, Curagita, Profound, and Bayer; is a board member of Curagita; provides consultancy for Curagita and Bayer; declares receiving grants/grants pending from BMBF, Deutsche Krebshilfe, Dietmar Hopp Stiftung, and Roland Ernst Stiftung; and declares receiving payment for lectures from Siemens, Curagita, Profound, and Bayer. D.B. reported receiving payment for lectures from Bayer Vital in the past. For the remaining authors none were declared.
Références
Johnson LM, Turkbey B, Figg WD, et al. Multiparametric MRI in prostate cancer management. Nat Rev Clin Oncol . 2014;11:346–353.
Siegel R, Naishadham D, Jemal A. Cancer statistics, 2013. CA Cancer J Clin . 2013;63:11–30.
Kasivisvanathan V, Rannikko AS, Borghi M, et al. MRI-targeted or standard biopsy for prostate-cancer diagnosis. N Engl J Med . 2018;378:1767–1777.
Ahdoot M, Wilbur AR, Reese SE, et al. MRI-targeted, systematic, and combined biopsy for prostate cancer diagnosis. N Engl J Med . 2020;382:917–928.
Ahmed HU, El-Shater Bosaily A, Brown LC, et al. Diagnostic accuracy of multi-parametric MRI and TRUS biopsy in prostate cancer (PROMIS): a paired validating confirmatory study. Lancet . 2017;389:815–822.
Elkhoury FF, Felker ER, Kwan L, et al. Comparison of targeted vs systematic prostate biopsy in men who are biopsy naive: the Prospective Assessment of Image Registration in the Diagnosis of Prostate Cancer (PAIREDCAP) Study. JAMA Surg . 2019;154:811–818.
Schelb P, Kohl S, Radtke JP, et al. Classification of Cancer at Prostate MRI: Deep Learning versus Clinical PI-RADS Assessment. Radiology . 2019;293:607–617.
Yoo S, Gujrathi I, Haider MA, et al. Prostate cancer detection using deep convolutional neural networks. Sci Rep . 2019;9:19518.
Hiremath A, Shiradkar R, Merisaari H, et al. Test-retest repeatability of a deep learning architecture in detecting and segmenting clinically significant prostate cancer on apparent diffusion coefficient (ADC) maps. Eur Radiol . 2021;31:379–391.
Winkel DJ, Wetterauer C, Matthias MO, et al. Autonomous detection and classification of PI-RADS lesions in an MRI screening population incorporating multicenter-labeled deep learning and biparametric imaging: proof of concept. Diagnostics (Basel) . 2020;10:951.
Winkel DJ, Tong A, Lou B, et al. A novel deep learning based computer-aided diagnosis system improves the accuracy and efficiency of radiologists in reading biparametric magnetic resonance images of the prostate: results of a multireader, multicase study. Invest Radiol . 2021;56:605–613.
Castillo T. JM, Arif M, Niessen WJ, et al. Automated classification of significant prostate cancer on MRI: a systematic review on the performance of machine learning applications. Cancers (Basel) . 2020;12:1606.
Egevad L, Delahunt B, Srigley JR, et al. International Society of Urological Pathology (ISUP) grading of prostate cancer—an ISUP consensus on contemporary grading. APMIS . 2016;124:433–435.
Turkbey B, Rosenkrantz AB, Haider MA, et al. Prostate Imaging Reporting and Data System version 2.1: 2019 Update of Prostate Imaging Reporting and Data System version 2. Eur Urol . 2019;76:340–351.
Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation . Cham, Switzerland: Springer International Publishing; 2015:234–241.
Schelb P, Wang X, Radtke JP, et al. Simulated clinical deployment of fully automatic deep learning for clinical prostate MRI assessment. Eur Radiol . 2021;31:302–313.
Kuru TH, Wadhwa K, Chang RT, et al. Definitions of terms, processes and a minimum dataset for transperineal prostate biopsies: a standardization approach of the Ginsburg Study Group for Enhanced Prostate Diagnostics. BJU Int . 2013;112:568–577.
Bonekamp D, Schelb P, Wiesenfarth M, et al. Histopathological to multiparametric MRI spatial mapping of extended systematic sextant and MR/TRUS-fusion-targeted biopsy of the prostate. Eur Radiol . 2019;29:1820–1830.
McNemar Q. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika . 1947;12:153–157.
Durkalski VL, Palesch YY, Lipsitz SR, et al. Analysis of clustered matched-pair data. Stat Med . 2003;22:2417–2428.
DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics . 1988;44:837–845.
Obuchowski NA. Nonparametric analysis of clustered ROC curve data. Biometrics . 1997;53:567–578.
Holm S. A simple sequentially rejective multiple test procedure. Scand J Stat . 1979;6:65–70.
Bonekamp D, Kohl S, Wiesenfarth M, et al. Radiomic Machine Learning for Characterization of Prostate Lesions with MRI: Comparison to ADC Values. Radiology . 2018;289:128–137.
Makowski MR, Bressem KK, Franz L, et al. De novo radiomics approach using image augmentation and features from T1 mapping to predict Gleason scores in prostate cancer. Invest Radiol . 2021;56:661–668.
Fehr D, Veeraraghavan H, Wibmer A, et al. Automatic classification of prostate cancer Gleason scores from multiparametric magnetic resonance images. Proc Natl Acad Sci U S A . 2015;112:E6265–E6273.
Mai J, Abubrig M, Lehmann T, et al. T2 mapping in prostate cancer. Invest Radiol . 2019;54:146–152.
Panda A, Obmann VC, Lo WC, et al. MR fingerprinting and ADC mapping for characterization of lesions in the transition zone of the prostate gland. Radiology . 2019;292:685–694.
Panda A, O'Connor G, Lo WC, et al. Targeted biopsy validation of peripheral zone prostate cancer characterization with magnetic resonance fingerprinting and diffusion mapping. Invest Radiol . 2019;54:485–493.
Zhang KS, Schelb P, Kohl S, et al. Improvement of PI-RADS-dependent prostate cancer classification by quantitative image assessment using radiomics or mean ADC. Magn Reson Imaging . 2021;82:9–17.
Langbein BJ, Szczepankiewicz F, Westin CF, et al. A pilot study of multidimensional diffusion MRI for assessment of tissue heterogeneity in prostate cancer. Invest Radiol . 2021;56:845–853.
Tavakoli AA, Kuder TA, Tichy D, et al. Measured Multipoint Ultra-High b-Value Diffusion MRI in the Assessment of MRI-Detected Prostate Lesions. Invest Radiol . 2021;56:94–102.
Breit HC, Block TK, Winkel DJ, et al. Revisiting DCE-MRI: classification of prostate tissue using descriptive signal enhancement features derived from DCE-MRI acquisition with high spatiotemporal resolution. Invest Radiol . 2021;56:553–562.
Hagiwara A, Fujita S, Ohno Y, et al. Variability and standardization of quantitative imaging: monoparametric to multiparametric quantification, radiomics, and artificial intelligence. Invest Radiol . 2020;55:601–616.
Zhong X, Cao R, Shakeri S, et al. Deep transfer learning–based prostate cancer classification using 3 Tesla multi-parametric MRI. Abdom Radiol (NY) . 2019;44:2030–2039.
Yuan Y, Qin W, Buyyounouski M, et al. Prostate cancer classification with multiparametric MRI transfer learning model. Med Phys . 2019;46:756–765.
Aldoj N, Lukas S, Dewey M, et al. Semi-automatic classification of prostate cancer on multi-parametric MR imaging using a multi-channel 3D convolutional neural network. Eur Radiol . 2020;30:1243–1253.
Sanford T, Harmon SA, Turkbey EB, et al. Deep-learning-based artificial intelligence for PI-RADS classification to assist multiparametric prostate MRI interpretation: a development study. J Magn Reson Imaging . 2020;52:1499–1507.
Wildeboer RR, van Sloun RJG, Wijkstra H, et al. Artificial intelligence in multiparametric prostate cancer imaging with focus on deep-learning methods. Comput Methods Programs Biomed . 2020;189:105316.
Li D, Han X, Gao J, et al. Deep learning in prostate cancer diagnosis using multiparametric magnetic resonance imaging with whole-mount histopathology referenced delineations. Front Med (Lausanne) . 2021;8:810995.
Bhattacharya I, Seetharaman A, Kunder C, et al. Selective identification and localization of indolent and aggressive prostate cancers via CorrSigNIA: an MRI-pathology correlation and deep learning framework. Med Image Anal . 2022;75:102288.
Cao R, Zhong X, Afshari S, et al. Performance of deep learning and genitourinary radiologists in detection of prostate cancer using 3-T multiparametric magnetic resonance imaging. J Magn Reson Imaging . 2021;54:474–483.
Arif M, Schoots IG, Castillo Tovar J, et al. Clinically significant prostate cancer detection and segmentation in low-risk patients using a convolutional neural network on multi-parametric MRI. Eur Radiol . 2020;30:6582–6592.
Netzer N, Weisser C, Schelb P, et al. Fully Automatic Deep Learning in Bi-institutional Prostate Magnetic Resonance Imaging: Effects of Cohort Size and Heterogeneity. Invest Radiol . 2021.
Davis SE, Lasko TA, Chen G, et al. Calibration drift in regression and machine learning models for acute kidney injury. J Am Med Inform Assoc . 2017;24:1052–1061.
Cuocolo R, Cipullo MB, Stanzione A, et al. Machine learning for the identification of clinically significant prostate cancer on MRI: a meta-analysis. Eur Radiol . 2020;30:6877–6887.
Gaur S, Lay N, Harmon SA, et al. Can computer-aided diagnosis assist in the identification of prostate cancer on prostate MRI? a multi-center, multi-reader investigation. Oncotarget . 2018;9:33804–33817.
Radtke JP, Schwab C, Wolf MB, et al. Multiparametric magnetic resonance imaging (MRI) and MRI-transrectal ultrasound fusion biopsy for index tumor detection: correlation with radical prostatectomy specimen. Eur Urol . 2016;70:846–853.