A Data Set and Deep Learning Algorithm for the Detection of Masses and Architectural Distortions in Digital Breast Tomosynthesis Images.

Aged Breast / diagnostic imaging Breast Neoplasms / diagnosis Datasets as Topic Deep Learning Early Detection of Cancer / methods False Positive Reactions Female Humans Mammography Middle Aged ROC Curve Reproducibility of Results

Journal

JAMA network open

ISSN: 2574-3805

Titre abrégé: JAMA Netw Open

Pays: United States

ID NLM: 101729235

Informations de publication

Date de publication:
02 08 2021

Historique:

entrez: 16 8 2021

pubmed: 17 8 2021

medline: 6 1 2022

Statut: epublish

Résumé

Breast cancer screening is among the most common radiological tasks, with more than 39 million examinations performed each year. While it has been among the most studied medical imaging applications of artificial intelligence, the development and evaluation of algorithms are hindered by the lack of well-annotated, large-scale publicly available data sets. To curate, annotate, and make publicly available a large-scale data set of digital breast tomosynthesis (DBT) images to facilitate the development and evaluation of artificial intelligence algorithms for breast cancer screening; to develop a baseline deep learning model for breast cancer detection; and to test this model using the data set to serve as a baseline for future research. In this diagnostic study, 16 802 DBT examinations with at least 1 reconstruction view available, performed between August 26, 2014, and January 29, 2018, were obtained from Duke Health System and analyzed. From the initial cohort, examinations were divided into 4 groups and split into training and test sets for the development and evaluation of a deep learning model. Images with foreign objects or spot compression views were excluded. Data analysis was conducted from January 2018 to October 2020. Screening DBT. The detection algorithm was evaluated with breast-based free-response receiver operating characteristic curve and sensitivity at 2 false positives per volume. The curated data set contained 22 032 reconstructed DBT volumes that belonged to 5610 studies from 5060 patients with a mean (SD) age of 55 (11) years and 5059 (100.0%) women. This included 4 groups of studies: (1) 5129 (91.4%) normal studies; (2) 280 (5.0%) actionable studies, for which where additional imaging was needed but no biopsy was performed; (3) 112 (2.0%) benign biopsied studies; and (4) 89 studies (1.6%) with cancer. Our data set included masses and architectural distortions that were annotated by 2 experienced radiologists. Our deep learning model reached breast-based sensitivity of 65% (39 of 60; 95% CI, 56%-74%) at 2 false positives per DBT volume on a test set of 460 examinations from 418 patients. The large, diverse, and curated data set presented in this study could facilitate the development and evaluation of artificial intelligence algorithms for breast cancer screening by providing data for training as well as a common set of cases for model validation. The performance of the model developed in this study showed that the task remains challenging; its performance could serve as a baseline for future model development.

Identifiants

DOI: 10.1001/jamanetworkopen.2021.19100 PMID: 34398205 PMC: PMC8369362

pubmed: 34398205

pii: 2783046

doi: 10.1001/jamanetworkopen.2021.19100

pmc: PMC8369362

doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

Pagination

e2119100

Subventions

Organisme : NIBIB NIH HHS

ID : R01 EB021360

Pays : United States

Références

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149

pubmed: 27295650

JAMA Netw Open. 2020 Mar 2;3(3):e200265

pubmed: 32119094

Med Phys. 2016 Dec;43(12):6654

pubmed: 27908154

Phys Med Biol. 2021 Jan 30;66(3):035028

pubmed: 32485700

Radiology. 2019 Nov;293(2):246-259

pubmed: 31549948

Radiology. 2015 Dec;277(3):663-84

pubmed: 26599926

Nature. 2020 Jan;577(7788):89-94

pubmed: 31894144

Lancet Digit Health. 2020 Mar;2(3):e138-e148

pubmed: 33334578

Nat Med. 2021 Feb;27(2):244-249

pubmed: 33432172

Med Image Anal. 2017 Dec;42:60-88

pubmed: 28778026

Acad Radiol. 2019 Jun;26(6):735-743

pubmed: 30076083

Med Phys. 2018 Mar;45(3):1150-1158

pubmed: 29356028

Front Mol Biosci. 2020 Nov 11;7:599333

pubmed: 33263004

Clin Radiol. 2019 May;74(5):357-366

pubmed: 30898381

Neural Netw. 2018 Oct;106:249-259

pubmed: 30092410

A Data Set and Deep Learning Algorithm for the Detection of Masses and Architectural Distortions in Digital Breast Tomosynthesis Images.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Subventions

Références

Auteurs

Mateusz Buda (M)

Ashirbani Saha (A)

Ruth Walsh (R)

Sujata Ghate (S)

Nianyi Li (N)

Albert Swiecicki (A)

Joseph Y Lo (JY)

Maciej A Mazurowski (MA)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH