BreastNet18: A High Accuracy Fine-Tuned VGG16 Model Evaluated Using Ablation Study for Diagnosing Breast Cancer from Enhanced Mammography Images.

CBIS-DDSM ablation study breast cancer classification data augmentation deep learning feature map analysis fine-tuned VGG16 image preprocessing mammograms transfer learning models

Journal

Biology
ISSN: 2079-7737
Titre abrégé: Biology (Basel)
Pays: Switzerland
ID NLM: 101587988

Informations de publication

Date de publication:
17 Dec 2021
Historique:
received: 13 11 2021
revised: 12 12 2021
accepted: 14 12 2021
entrez: 24 12 2021
pubmed: 25 12 2021
medline: 25 12 2021
Statut: epublish

Résumé

Identification and treatment of breast cancer at an early stage can reduce mortality. Currently, mammography is the most widely used effective imaging technique in breast cancer detection. However, an erroneous mammogram based interpretation may result in false diagnosis rate, as distinguishing cancerous masses from adjacent tissue is often complex and error-prone. Six pre-trained and fine-tuned deep CNN architectures: VGG16, VGG19, MobileNetV2, ResNet50, DenseNet201, and InceptionV3 are evaluated to determine which model yields the best performance. We propose a BreastNet18 model using VGG16 as foundational base, since VGG16 performs with the highest accuracy. An ablation study is performed on BreastNet18, to evaluate its robustness and achieve the highest possible accuracy. Various image processing techniques with suitable parameter values are employed to remove artefacts and increase the image quality. A total dataset of 1442 preprocessed mammograms was augmented using seven augmentation techniques, resulting in a dataset of 11,536 images. To investigate possible overfitting issues, a k-fold cross validation is carried out. The model was then tested on noisy mammograms to evaluate its robustness. Results were compared with previous studies. Proposed BreastNet18 model performed best with a training accuracy of 96.72%, a validating accuracy of 97.91%, and a test accuracy of 98.02%. In contrast to this, VGGNet19 yielded test accuracy of 96.24%, MobileNetV2 77.84%, ResNet50 79.98%, DenseNet201 86.92%, and InceptionV3 76.87%. Our proposed approach based on image processing, transfer learning, fine-tuning, and ablation study has demonstrated a high correct breast cancer classification while dealing with a limited number of complex medical images.

Sections du résumé

BACKGROUND BACKGROUND
Identification and treatment of breast cancer at an early stage can reduce mortality. Currently, mammography is the most widely used effective imaging technique in breast cancer detection. However, an erroneous mammogram based interpretation may result in false diagnosis rate, as distinguishing cancerous masses from adjacent tissue is often complex and error-prone.
METHODS METHODS
Six pre-trained and fine-tuned deep CNN architectures: VGG16, VGG19, MobileNetV2, ResNet50, DenseNet201, and InceptionV3 are evaluated to determine which model yields the best performance. We propose a BreastNet18 model using VGG16 as foundational base, since VGG16 performs with the highest accuracy. An ablation study is performed on BreastNet18, to evaluate its robustness and achieve the highest possible accuracy. Various image processing techniques with suitable parameter values are employed to remove artefacts and increase the image quality. A total dataset of 1442 preprocessed mammograms was augmented using seven augmentation techniques, resulting in a dataset of 11,536 images. To investigate possible overfitting issues, a k-fold cross validation is carried out. The model was then tested on noisy mammograms to evaluate its robustness. Results were compared with previous studies.
RESULTS RESULTS
Proposed BreastNet18 model performed best with a training accuracy of 96.72%, a validating accuracy of 97.91%, and a test accuracy of 98.02%. In contrast to this, VGGNet19 yielded test accuracy of 96.24%, MobileNetV2 77.84%, ResNet50 79.98%, DenseNet201 86.92%, and InceptionV3 76.87%.
CONCLUSIONS CONCLUSIONS
Our proposed approach based on image processing, transfer learning, fine-tuning, and ablation study has demonstrated a high correct breast cancer classification while dealing with a limited number of complex medical images.

Identifiants

pubmed: 34943262
pii: biology10121347
doi: 10.3390/biology10121347
pmc: PMC8698892
pii:
doi:

Types de publication

Journal Article

Langues

eng

Références

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:1132-1135
pubmed: 33018186
IEEE Trans Med Imaging. 2004 Feb;23(2):232-45
pubmed: 14964567
Med Image Anal. 2017 Jan;35:303-312
pubmed: 27497072
Diagnostics (Basel). 2019 Aug 25;9(3):
pubmed: 31450720
J Am Coll Radiol. 2019 Feb;16(2):185-193
pubmed: 30545710
Biomed Res Int. 2017;2017:3640901
pubmed: 28191461
J Am Coll Radiol. 2020 Jun;17(6):796-803
pubmed: 32068005
Front Neurosci. 2018 Nov 08;12:818
pubmed: 30467462
IEEE Trans Med Imaging. 2016 May;35(5):1285-98
pubmed: 26886976
CA Cancer J Clin. 2015 Mar;65(2):87-108
pubmed: 25651787
Int J Med Inform. 2018 Sep;117:44-54
pubmed: 30032964
J Natl Cancer Inst Monogr. 1997;(22):99-104
pubmed: 9709284
Sensors (Basel). 2020 Aug 05;20(16):
pubmed: 32764398
Biology (Basel). 2021 Sep 02;10(9):
pubmed: 34571736
J Med Signals Sens. 2020 Jul 03;10(3):158-173
pubmed: 33062608
J Thorac Dis. 2020 Sep;12(9):4690-4701
pubmed: 33145042
IEEE Trans Inf Technol Biomed. 2009 Mar;13(2):236-51
pubmed: 19171527
Nature. 2017 Feb 2;542(7639):115-118
pubmed: 28117445
Sci Rep. 2018 Mar 15;8(1):4165
pubmed: 29545529
IEEE Trans Pattern Anal Mach Intell. 2010 Mar;32(3):569-75
pubmed: 20075479
PeerJ. 2019 Jan 28;7:e6201
pubmed: 30713814
Multidimens Syst Signal Process. 2021 Jan 11;:1-19
pubmed: 33456204
J Digit Imaging. 2010 Oct;23(5):611-31
pubmed: 20127270
CA Cancer J Clin. 2017 Jan;67(1):7-30
pubmed: 28055103
Br J Radiol. 2018 Sep;91(1089):20170545
pubmed: 29565644
Biology (Basel). 2021 Feb 26;10(3):
pubmed: 33652727

Auteurs

Sidratul Montaha (S)

Department of Computer Science and Engineering, Daffodil International University, Dhaka 1207, Bangladesh.

Sami Azam (S)

College of Engineering, IT and Environment, Charles Darwin University, Darwin, NT 0909, Australia.

Abul Kalam Muhammad Rakibul Haque Rafid (AKMRH)

Department of Computer Science and Engineering, Daffodil International University, Dhaka 1207, Bangladesh.

Pronab Ghosh (P)

Department of Computer Science (CS), Lakehead University, 955 Oliver Rd, Thunder Bay, ON P7B 5E1, Canada.

Md Zahid Hasan (MZ)

Department of Computer Science and Engineering, Daffodil International University, Dhaka 1207, Bangladesh.

Mirjam Jonkman (M)

College of Engineering, IT and Environment, Charles Darwin University, Darwin, NT 0909, Australia.

Friso De Boer (F)

College of Engineering, IT and Environment, Charles Darwin University, Darwin, NT 0909, Australia.

Classifications MeSH