The Effectiveness of Image Augmentation in Deep Learning Networks for Detecting COVID-19: A Geometric Transformation Perspective.
artificial intelligence
chest X-ray
corona virus
data augmentation
digital health
machine learning
radiology
transfer learning
Journal
Frontiers in medicine
ISSN: 2296-858X
Titre abrégé: Front Med (Lausanne)
Pays: Switzerland
ID NLM: 101648047
Informations de publication
Date de publication:
2021
2021
Historique:
received:
13
11
2020
accepted:
29
01
2021
entrez:
18
3
2021
pubmed:
19
3
2021
medline:
19
3
2021
Statut:
epublish
Résumé
Chest X-ray imaging technology used for the early detection and screening of COVID-19 pneumonia is both accessible worldwide and affordable compared to other non-invasive technologies. Additionally, deep learning methods have recently shown remarkable results in detecting COVID-19 on chest X-rays, making it a promising screening technology for COVID-19. Deep learning relies on a large amount of data to avoid overfitting. While overfitting can result in perfect modeling on the original training dataset, on a new testing dataset it can fail to achieve high accuracy. In the image processing field, an image augmentation step (i.e., adding more training data) is often used to reduce overfitting on the training dataset, and improve prediction accuracy on the testing dataset. In this paper, we examined the impact of geometric augmentations as implemented in several recent publications for detecting COVID-19. We compared the performance of 17 deep learning algorithms with and without different geometric augmentations. We empirically examined the influence of augmentation with respect to detection accuracy, dataset diversity, augmentation methodology, and network size. Contrary to expectation, our results show that the removal of recently used geometrical augmentation steps actually improved the Matthews correlation coefficient (MCC) of 17 models. The MCC without augmentation (MCC = 0.51) outperformed four recent geometrical augmentations (MCC = 0.47 for Data Augmentation 1, MCC = 0.44 for Data Augmentation 2, MCC = 0.48 for Data Augmentation 3, and MCC = 0.49 for Data Augmentation 4). When we retrained a recently published deep learning without augmentation on the same dataset, the detection accuracy significantly increased, with a
Identifiants
pubmed: 33732718
doi: 10.3389/fmed.2021.629134
pmc: PMC7956964
doi:
Types de publication
Journal Article
Langues
eng
Pagination
629134Informations de copyright
Copyright © 2021 Elgendi, Nasir, Tang, Smith, Grenier, Batte, Spieler, Leslie, Menon, Fletcher, Howard, Ward, Parker and Nicolaou.
Déclaration de conflit d'intérêts
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Références
Front Med (Lausanne). 2020 Jul 14;7:427
pubmed: 32760732
Comput Methods Programs Biomed. 2020 Oct;194:105532
pubmed: 32446037
Phys Eng Sci Med. 2020 Jun;43(2):635-640
pubmed: 32524445
Radiology. 2021 Feb;298(2):E88-E97
pubmed: 32969761
BMC Genomics. 2020 Jan 2;21(1):6
pubmed: 31898477
Appl Intell (Dordr). 2021;51(1):571-585
pubmed: 34764547
Sci Rep. 2020 Oct 16;10(1):17532
pubmed: 33067538
Nat Commun. 2020 Jul 15;11(1):3543
pubmed: 32669540
J Big Data. 2021;8(1):101
pubmed: 34306963
Med Hypotheses. 2020 Jul;140:109761
pubmed: 32344309
IEEE Trans Neural Netw Learn Syst. 2015 May;26(5):1019-34
pubmed: 25014970
Radiol Cardiothorac Imaging. 2020 Sep 03;2(5):e200280
pubmed: 33778626
Psychometrika. 1948 Sep;13(3):185-7
pubmed: 18885738
Comput Methods Programs Biomed. 2020 Nov;196:105581
pubmed: 32534344
Front Med (Lausanne). 2020 Aug 18;7:550
pubmed: 33015100
Lancet. 2020 Oct 31;396(10260):e71-e72
pubmed: 33069277
Chaos Solitons Fractals. 2020 Sep;138:109944
pubmed: 32536759
SLAS Technol. 2020 Dec;25(6):553-565
pubmed: 32948098
Comput Biol Med. 2020 Jun;121:103792
pubmed: 32568675
J Clin Microbiol. 2009 Sep;47(9):2812-7
pubmed: 19571025
Psychometrika. 1947 Jun;12(2):153-7
pubmed: 20254758
J Med Biol Eng. 2020;40(3):462-469
pubmed: 32412551
Comput Methods Programs Biomed. 2020 Nov;196:105608
pubmed: 32599338
Comput Biol Med. 2020 Jul;122:103869
pubmed: 32658740
Chaos Solitons Fractals. 2020 Nov;140:110071
pubmed: 32834627
Inform Med Unlocked. 2020;20:100412
pubmed: 32835084