Data augmentation for medical imaging: A systematic literature review.

Data augmentation Deep learning Generative adversarial networks MRI Medical imaging

Journal

Computers in biology and medicine
ISSN: 1879-0534
Titre abrégé: Comput Biol Med
Pays: United States
ID NLM: 1250250

Informations de publication

Date de publication:
01 2023
Historique:
received: 01 07 2022
revised: 22 11 2022
accepted: 29 11 2022
pubmed: 23 12 2022
medline: 6 1 2023
entrez: 22 12 2022
Statut: ppublish

Résumé

Recent advances in Deep Learning have largely benefited from larger and more diverse training sets. However, collecting large datasets for medical imaging is still a challenge due to privacy concerns and labeling costs. Data augmentation makes it possible to greatly expand the amount and variety of data available for training without actually collecting new samples. Data augmentation techniques range from simple yet surprisingly effective transformations such as cropping, padding, and flipping, to complex generative models. Depending on the nature of the input and the visual task, different data augmentation strategies are likely to perform differently. For this reason, it is conceivable that medical imaging requires specific augmentation strategies that generate plausible data samples and enable effective regularization of deep neural networks. Data augmentation can also be used to augment specific classes that are underrepresented in the training set, e.g., to generate artificial lesions. The goal of this systematic literature review is to investigate which data augmentation strategies are used in the medical domain and how they affect the performance of clinical tasks such as classification, segmentation, and lesion detection. To this end, a comprehensive analysis of more than 300 articles published in recent years (2018-2022) was conducted. The results highlight the effectiveness of data augmentation across organs, modalities, tasks, and dataset sizes, and suggest potential avenues for future research.

Identifiants

pubmed: 36549032
pii: S0010-4825(22)01099-X
doi: 10.1016/j.compbiomed.2022.106391
pii:
doi:

Types de publication

Systematic Review Journal Article Review

Langues

eng

Sous-ensembles de citation

IM

Pagination

106391

Informations de copyright

Copyright © 2022 Elsevier Ltd. All rights reserved.

Auteurs

Fabio Garcea (F)

Dipartimento di Automatica e Informatica, Politecnico di Torino, C.so Duca degli Abruzzi, 24, Torino, 10129, Italy.

Alessio Serra (A)

Dipartimento di Automatica e Informatica, Politecnico di Torino, C.so Duca degli Abruzzi, 24, Torino, 10129, Italy.

Fabrizio Lamberti (F)

Dipartimento di Automatica e Informatica, Politecnico di Torino, C.so Duca degli Abruzzi, 24, Torino, 10129, Italy.

Lia Morra (L)

Dipartimento di Automatica e Informatica, Politecnico di Torino, C.so Duca degli Abruzzi, 24, Torino, 10129, Italy. Electronic address: lia.morra@polito.it.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Unsupervised learning for real-time and continuous gait phase detection.

Dollaporn Anopas, Yodchanan Wongsawat, Jetsada Arnin
1.00
Humans Gait Neural Networks, Computer Unsupervised Machine Learning Walking
Humans Shoulder Fractures Tomography, X-Ray Computed Neural Networks, Computer Female
Humans Artificial Intelligence Neoplasms Prognosis Image Processing, Computer-Assisted

Classifications MeSH