A Data Set and Deep Learning Algorithm for the Detection of Masses and Architectural Distortions in Digital Breast Tomosynthesis Images.
Journal
JAMA network open
ISSN: 2574-3805
Titre abrégé: JAMA Netw Open
Pays: United States
ID NLM: 101729235
Informations de publication
Date de publication:
02 08 2021
02 08 2021
Historique:
entrez:
16
8
2021
pubmed:
17
8
2021
medline:
6
1
2022
Statut:
epublish
Résumé
Breast cancer screening is among the most common radiological tasks, with more than 39 million examinations performed each year. While it has been among the most studied medical imaging applications of artificial intelligence, the development and evaluation of algorithms are hindered by the lack of well-annotated, large-scale publicly available data sets. To curate, annotate, and make publicly available a large-scale data set of digital breast tomosynthesis (DBT) images to facilitate the development and evaluation of artificial intelligence algorithms for breast cancer screening; to develop a baseline deep learning model for breast cancer detection; and to test this model using the data set to serve as a baseline for future research. In this diagnostic study, 16 802 DBT examinations with at least 1 reconstruction view available, performed between August 26, 2014, and January 29, 2018, were obtained from Duke Health System and analyzed. From the initial cohort, examinations were divided into 4 groups and split into training and test sets for the development and evaluation of a deep learning model. Images with foreign objects or spot compression views were excluded. Data analysis was conducted from January 2018 to October 2020. Screening DBT. The detection algorithm was evaluated with breast-based free-response receiver operating characteristic curve and sensitivity at 2 false positives per volume. The curated data set contained 22 032 reconstructed DBT volumes that belonged to 5610 studies from 5060 patients with a mean (SD) age of 55 (11) years and 5059 (100.0%) women. This included 4 groups of studies: (1) 5129 (91.4%) normal studies; (2) 280 (5.0%) actionable studies, for which where additional imaging was needed but no biopsy was performed; (3) 112 (2.0%) benign biopsied studies; and (4) 89 studies (1.6%) with cancer. Our data set included masses and architectural distortions that were annotated by 2 experienced radiologists. Our deep learning model reached breast-based sensitivity of 65% (39 of 60; 95% CI, 56%-74%) at 2 false positives per DBT volume on a test set of 460 examinations from 418 patients. The large, diverse, and curated data set presented in this study could facilitate the development and evaluation of artificial intelligence algorithms for breast cancer screening by providing data for training as well as a common set of cases for model validation. The performance of the model developed in this study showed that the task remains challenging; its performance could serve as a baseline for future model development.
Identifiants
pubmed: 34398205
pii: 2783046
doi: 10.1001/jamanetworkopen.2021.19100
pmc: PMC8369362
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
e2119100Subventions
Organisme : NIBIB NIH HHS
ID : R01 EB021360
Pays : United States
Références
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149
pubmed: 27295650
JAMA Netw Open. 2020 Mar 2;3(3):e200265
pubmed: 32119094
Med Phys. 2016 Dec;43(12):6654
pubmed: 27908154
Phys Med Biol. 2021 Jan 30;66(3):035028
pubmed: 32485700
Radiology. 2019 Nov;293(2):246-259
pubmed: 31549948
Radiology. 2015 Dec;277(3):663-84
pubmed: 26599926
Nature. 2020 Jan;577(7788):89-94
pubmed: 31894144
Lancet Digit Health. 2020 Mar;2(3):e138-e148
pubmed: 33334578
Nat Med. 2021 Feb;27(2):244-249
pubmed: 33432172
Med Image Anal. 2017 Dec;42:60-88
pubmed: 28778026
Acad Radiol. 2019 Jun;26(6):735-743
pubmed: 30076083
Med Phys. 2018 Mar;45(3):1150-1158
pubmed: 29356028
Front Mol Biosci. 2020 Nov 11;7:599333
pubmed: 33263004
Clin Radiol. 2019 May;74(5):357-366
pubmed: 30898381
Neural Netw. 2018 Oct;106:249-259
pubmed: 30092410