Are deep learning classification results obtained on CT scans fair and interpretable?

Chest CT DNNs Interpretability and reliability Malignancy classification

Journal

Physical and engineering sciences in medicine
ISSN: 2662-4737
Titre abrégé: Phys Eng Sci Med
Pays: Switzerland
ID NLM: 101760671

Informations de publication

Date de publication:
04 Apr 2024
Historique:
received: 18 11 2023
accepted: 12 03 2024
medline: 4 4 2024
pubmed: 4 4 2024
entrez: 4 4 2024
Statut: aheadofprint

Résumé

Following the great success of various deep learning methods in image and object classification, the biomedical image processing society is also overwhelmed with their applications to various automatic diagnosis cases. Unfortunately, most of the deep learning-based classification attempts in the literature solely focus on the aim of extreme accuracy scores, without considering interpretability, or patient-wise separation of training and test data. For example, most lung nodule classification papers using deep learning randomly shuffle data and split it into training, validation, and test sets, causing certain images from the Computed Tomography (CT) scan of a person to be in the training set, while other images of the same person to be in the validation or testing image sets. This can result in reporting misleading accuracy rates and the learning of irrelevant features, ultimately reducing the real-life usability of these models. When the deep neural networks trained on the traditional, unfair data shuffling method are challenged with new patient images, it is observed that the trained models perform poorly. In contrast, deep neural networks trained with strict patient-level separation maintain their accuracy rates even when new patient images are tested. Heat map visualizations of the activations of the deep neural networks trained with strict patient-level separation indicate a higher degree of focus on the relevant nodules. We argue that the research question posed in the title has a positive answer only if the deep neural networks are trained with images of patients that are strictly isolated from the validation and testing patient sets.

Identifiants

pubmed: 38573489
doi: 10.1007/s13246-024-01419-8
pii: 10.1007/s13246-024-01419-8
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© 2024. The Author(s).

Références

Abbas Q (2017) Nodular-deep: classification of pulmonary nodules using deep neural network. Int J Med Res Heal Sci 6(8):111–118
Aggarwal T., Furqan A., Kalra K (2015) Feature extraction and lda based classification of lung nodules in chest ct scan images. In: 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pages 1189–1193. IEEE
Agushaka JO, Ezugwu AE, Abualigah L (2022) Dwarf mongoose optimization algorithm. Comput Methods Appl Mech Eng 391:114570
doi: 10.1016/j.cma.2022.114570
Agushaka JO, Ezugwu AE, Abualigah L (2023) Gazelle optimization algorithm: a novel nature-inspired metaheuristic optimizer. Neural Comput Appl 35(5):4099–4131
doi: 10.1007/s00521-022-07854-6
Armato SG III, McLennan G, Bidaut L, McNitt-Gray MF, Meyer CR, Reeves AP, Zhao B, Aberle DR, Henschke CI, Hoffman EA et al (2011) The lung image database consortium (lidc) and image database resource initiative (idri): a completed reference database of lung nodules on ct scans. Med Phys 38(2):915–931
doi: 10.1118/1.3528204 pubmed: 21452728 pmcid: 3041807
Bisong E (2019) Google Colaboratory. Apress, Berkeley, pp 59–64
Boscolo Galazzo I, Cruciani F, Brusini L, Salih A, Radeva P, Storti SF, Menegaz G (2022) Explainable artificial intelligence for magnetic resonance imaging aging brainprints: Grounds and challenges. IEEE Signal Process Magaz 39(2):99–116
doi: 10.1109/MSP.2021.3126573
Ezugwu AE, Agushaka JO, Abualigah L, Mirjalili S, Gandomi AH (2022) Prairie dog optimization algorithm. Neural Comput Appl 34(22):20017–20065
doi: 10.1007/s00521-022-07530-9
Ghasemi M, Zare M, Zahedi A, Akbari M-A, Mirjalili S, Abualigah L (2023) Geyser inspired algorithm: a new geological-inspired meta-heuristic for real-parameter and constrained engineering optimization. J Bionic Eng 21(1):374–408
doi: 10.1007/s42235-023-00437-8
Goodfellow I, Bengio Y (2016) Courville A Deep learning. MIT press, Cambridge
Gruetzemacher R, Gupta A (2016) Using deep learning for pulmonary nodule detection & diagnosis
Gu Y, Chi J, Liu J, Yang L, Zhang B, Yu D, Zhao Y, Lu X (2021) A survey of computer-aided diagnosis of lung nodules from ct scans using deep learning. Comput Biol Med 137:104806
doi: 10.1016/j.compbiomed.2021.104806 pubmed: 34461501
Gürsoy Çoruh A, Yenigün B, Uzun Ç, Kahya Y, Büyükceran EU, Elhan A, Orhan K, Kayı Cangır A (2021) A comparison of the fusion model of deep learning neural networks with human observation for lung nodule detection and classification. British J Radiol 94(1123):20210222
doi: 10.1259/bjr.20210222
Hu G, Guo Y, Wei G, Abualigah L (2023) Genghis khan shark optimizer: a novel nature-inspired algorithm for engineering optimization. Adv Eng Inform 58:102210
doi: 10.1016/j.aei.2023.102210
Hussein S, Gillies R, Cao K, Song Q, Bagci U (2017) Tumornet: Lung nodule characterization using multi-view convolutional neural network with gaussian process. In 2017 IEEE 14th international symposium on biomedical imaging (ISBI 2017), pages 1007–1010. IEEE
Kohoutová L, Heo J, Cha S, Lee S, Moon T, Wager TD, Woo C-W (2020) Toward a unified framework for interpreting machine-learning models in neuroimaging. Nat Protocols 15(4):1399–1435
doi: 10.1038/s41596-019-0289-5 pubmed: 32203486
Kumar D, Wong A, Clausi DA (2015) Lung nodule classification using deep features in ct images. In 2015 12th conference on computer and robot vision, pages 133–138. IEEE
Lakshmanaprabu S, Mohanty SN, Shankar K, Arunkumar N, Ramirez G (2019) Optimal deep learning model for classification of lung cancer on ct images. Future Gen Comput Syst 92:374–382
doi: 10.1016/j.future.2018.10.009
Liao F, Liang M, Li Z, Hu X, Song S (2019) Evaluate the malignancy of pulmonary nodules using the 3-d deep leaky noisy-or network. IEEE Trans Neural Netw Learn Syst 30(11):3484–3495
doi: 10.1109/TNNLS.2019.2892409 pubmed: 30794190
Liu K, Kang G (2017) Multiview convolutional neural networks for lung nodule classification. Int J Imag Syst Technol 27(1):12–22
doi: 10.1002/ima.22206
Nibali A, He Z, Wollersheim D (2017) Pulmonary nodule classification with deep residual networks. Int J Comput Assist Radiol Surg 12:1799–1808
doi: 10.1007/s11548-017-1605-6 pubmed: 28501942
Pandit BR, Alsadoon A, Prasad P, Al Aloussi S, Rashid TA, Alsadoon OH, Jerew OD (2023) Deep learning neural network for lung cancer classification: enhanced optimization function. Multimed Tools Appl 82(5):6605–6624
doi: 10.1007/s11042-022-13566-9
Paul R, Hall L, Goldgof D, Schabath M, Gillies R (2018) Predicting nodule malignancy using a cnn ensemble approach. In 2018 international joint conference on neural networks (IJCNN), pages 1–8. IEEE
Potghan S, Rajamenakshi R, Bhise A (2018) Multi-layer perceptron based lung tumor classification. In 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA), pages 499–502. IEEE
Ratti E, Graves M (2022) Explainable machine learning practices: opening another black box for reliable medical ai. AI Ethics 2, page 801-814
Sahu P, Yu D, Dasari M, Hou F, Qin H (2018) A lightweight multi-section cnn for lung nodule classification and malignancy estimation. IEEE J Biomed Health Inform 23(3):960–968
doi: 10.1109/JBHI.2018.2879834 pubmed: 30418891
Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L-C (2018) Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4510–4520
Sedgwick P (2012) Pearson’s correlation coefficient. Bmj, 345
Selvaraju RR, Das A, Vedantam R, Cogswell M, Parikh D, Batra D (2017) Grad-cam: Why did you say that?
Shaffie A, Soliman A, Khalifeh HA, Taher F, Ghazal M, Dunlap N, Elmaghraby A, Keynton R, El-Baz A (2019) A novel ct-based descriptors for precise diagnosis of pulmonary nodules. In: 2019 IEEE International conference on image processing (ICIP), pages 1400–1404. IEEE
Shafi I, Din S, Khan A, Díez IDLT, Casanova RdJP, Pifarre KT, Ashraf I (2022) An effective method for lung cancer diagnosis from ct scan using deep learning-based support vector network. Cancers 14(21):5457
doi: 10.3390/cancers14215457 pubmed: 36358875 pmcid: 9657078
Shah AA, Malik HAM, Muhammad A, Alourani A, Butt ZA (2023) Deep learning ensemble 2d cnn approach towards the detection of lung cancer. Sci Rep 13(1):2987
doi: 10.1038/s41598-023-29656-z pubmed: 36807576 pmcid: 9941084
Shen W, Zhou M, Yang F, Yu D, Dong D, Yang C, Zang Y, Tian J (2017) Multi-crop convolutional neural networks for lung nodule malignancy suspiciousness classification. Pattern Recognit 61:663–673
doi: 10.1016/j.patcog.2016.05.029
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Song Q, Zhao L, Luo X, Dou X et al (2017) Using deep learning for classification of lung nodules on computed tomography images. J Healthc Eng. https://doi.org/10.1155/2017/8314740
doi: 10.1155/2017/8314740 pubmed: 29077370 pmcid: 5592388
Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, pages 6105–6114. PMLR
Teramoto A, Tsukamoto T, Kiriyama Y, Fujita H et al (2017) Automated classification of lung cancer types from cytological images using deep convolutional neural networks. BioMed research international, 2017
Toğaçar M, Ergen B, Cömert Z (2020) Detection of lung cancer on chest ct images using minimum redundancy maximum relevance feature selection method with convolutional neural networks. Biocybern Biomed Eng 40(1):23–39
doi: 10.1016/j.bbe.2019.11.004
Tran GS, Nghiem TP, Nguyen VT, Luong CM, Burie J-C et al (2019) Improving accuracy of lung nodule classification using deep learning with focal loss. Journal of healthcare engineering, 2019
Utkin L, Meldo A, Kovalev M, Kasimov E (2019) An ensemble of triplet neural networks for differential diagnostics of lung cancer. In 2019 25th Conference of Open Innovations Association (FRUCT), pages 346–352. IEEE
Wang L (2022) Deep learning techniques to diagnose lung cancer. Cancers 14(22):5569
doi: 10.3390/cancers14225569 pubmed: 36428662 pmcid: 9688236
Wu B, Zhou Z, Wang J, Wang Y (2018) Joint learning for pulmonary nodule segmentation, attributes and malignancy prediction. In:: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), pages 1109–1113. IEEE
Xie Y, Xia Y, Zhang J, Feng DD, Fulham M, Cai W (2017) Transferable multi-model ensemble for benign-malignant lung nodule classification on chest ct. In Medical Image Computing and Computer Assisted Intervention- MICCAI 2017: 20th International Conference, Quebec City, QC, Canada, September 11-13, 2017, Proceedings, Part III 20, pages 656–664. Springer
Xie Y, Xia Y, Zhang J, Song Y, Feng D, Fulham M, Cai W (2018) Knowledge-based collaborative deep learning for benign-malignant lung nodule classification on chest ct. IEEE Trans Med Imag 38(4):991–1004
doi: 10.1109/TMI.2018.2876510
Xie Y, Zhang J, Xia Y, Fulham M, Zhang Y (2018) Fusing texture, shape and deep model-learned information at decision level for automated classification of lung nodules on chest ct. Inform Fusion 42:102–110
doi: 10.1016/j.inffus.2017.10.005
Xu Q-S, Liang Y-Z (2001) Monte carlo cross validation. Chemom Intell Lab Syst 56(1):1–11
doi: 10.1016/S0169-7439(00)00122-2
Yetgin ÖE, Benligiray B, Gerek ÖN (2019) Power line recognition from aerial images with deep learning. IEEE Trans Aerosp Electron Syst 55(5):2241–2252
doi: 10.1109/TAES.2018.2883879
Zare M, Ghasemi M, Zahedi A, Golalipour K, Mohammadi SK, Mirjalili S, Abualigah L (2023) A global best-guided firefly algorithm for engineering problems. J Bionic Eng 20(5):2359–2388
doi: 10.1007/s42235-023-00386-2
Zhao D, Zhu D, Lu J, Luo Y, Zhang G (2018) Synthetic medical images using f &bgan for improved lung nodules classification by multi-scale vgg16. Symmetry 10(10):519
doi: 10.3390/sym10100519

Auteurs

Mohamad M A Ashames (MMA)

Department of Electrical and Electronics Engineering, Eskisehir Osmangazi University, Eskisehir, Turkey.

Ahmet Demir (A)

Department of Electrical and Electronics Engineering, Eskisehir Osmangazi University, Eskisehir, Turkey.

Omer N Gerek (ON)

Department of Electrical and Electronics Engineering, Eskisehir Technical University, Eskisehir, Turkey.

Mehmet Fidan (M)

Vocational School of Transportation, Eskisehir Technical University, Eskisehir, Turkey.

M Bilginer Gulmezoglu (MB)

Department of Electrical and Electronics Engineering, Eskisehir Osmangazi University, Eskisehir, Turkey.

Semih Ergin (S)

Department of Electrical and Electronics Engineering, Eskisehir Osmangazi University, Eskisehir, Turkey.

Rifat Edizkan (R)

Department of Electrical and Electronics Engineering, Eskisehir Osmangazi University, Eskisehir, Turkey.

Mehmet Koc (M)

Department of Computer Engineering, Eskisehir Technical University, Eskisehir, Turkey. mehmetkoc@eskisehir.edu.tr.

Atalay Barkana (A)

Department of Electrical and Electronics Engineering, Eskisehir Technical University, Eskisehir, Turkey.

Cuneyt Calisir (C)

Department of Radiology, Manisa Celal Bayar University, Manisa, Turkey.

Classifications MeSH