Machine Learning to Discern Interactive Clusters of Risk Factors for Late Recurrence of Metastatic Breast Cancer.

Markov Blanket and Interactive Risk Factor Learner (MBIL) causal learning machine learning metastasis metastatic breast cancer risk factors

Journal

Cancers
ISSN: 2072-6694
Titre abrégé: Cancers (Basel)
Pays: Switzerland
ID NLM: 101526829

Informations de publication

Date de publication:
05 Jan 2022
Historique:
received: 29 11 2021
revised: 22 12 2021
accepted: 30 12 2021
entrez: 11 1 2022
pubmed: 12 1 2022
medline: 12 1 2022
Statut: epublish

Résumé

Risk of metastatic recurrence of breast cancer after initial diagnosis and treatment depends on the presence of a number of risk factors. Although most univariate risk factors have been identified using classical methods, machine-learning methods are also being used to tease out non-obvious contributors to a patient's individual risk of developing late distant metastasis. Bayesian-network algorithms can identify not only risk factors but also interactions among these risks, which consequently may increase the risk of developing metastatic breast cancer. We proposed to apply a previously developed machine-learning method to discern risk factors of 5-, 10- and 15-year metastases. We applied a previously validated algorithm named the Markov Blanket and Interactive Risk Factor Learner (MBIL) to the electronic health record (EHR)-based Lynn Sage Database (LSDB) from the Lynn Sage Comprehensive Breast Center at Northwestern Memorial Hospital. This algorithm provided an output of both single and interactive risk factors of 5-, 10-, and 15-year metastases from the LSDB. We individually examined and interpreted the clinical relevance of these interactions based on years to metastasis and reliance on interactivity between risk factors. We found that, with lower alpha values (low interactivity score), the prevalence of variables with an independent influence on long-term metastasis was higher (i.e., HER2, TNEG). As the value of alpha increased to 480, stronger interactions were needed to define clusters of factors that increased the risk of metastasis (i.e., ER, smoking, race, alcohol usage). MBIL identified single and interacting risk factors of metastatic breast cancer, many of which were supported by clinical evidence. These results strongly recommend the development of further large data studies with different databases to validate the degree to which some of these variables impact metastatic breast cancer in the long term.

Sections du résumé

BACKGROUND BACKGROUND
Risk of metastatic recurrence of breast cancer after initial diagnosis and treatment depends on the presence of a number of risk factors. Although most univariate risk factors have been identified using classical methods, machine-learning methods are also being used to tease out non-obvious contributors to a patient's individual risk of developing late distant metastasis. Bayesian-network algorithms can identify not only risk factors but also interactions among these risks, which consequently may increase the risk of developing metastatic breast cancer. We proposed to apply a previously developed machine-learning method to discern risk factors of 5-, 10- and 15-year metastases.
METHODS METHODS
We applied a previously validated algorithm named the Markov Blanket and Interactive Risk Factor Learner (MBIL) to the electronic health record (EHR)-based Lynn Sage Database (LSDB) from the Lynn Sage Comprehensive Breast Center at Northwestern Memorial Hospital. This algorithm provided an output of both single and interactive risk factors of 5-, 10-, and 15-year metastases from the LSDB. We individually examined and interpreted the clinical relevance of these interactions based on years to metastasis and reliance on interactivity between risk factors.
RESULTS RESULTS
We found that, with lower alpha values (low interactivity score), the prevalence of variables with an independent influence on long-term metastasis was higher (i.e., HER2, TNEG). As the value of alpha increased to 480, stronger interactions were needed to define clusters of factors that increased the risk of metastasis (i.e., ER, smoking, race, alcohol usage).
CONCLUSION CONCLUSIONS
MBIL identified single and interacting risk factors of metastatic breast cancer, many of which were supported by clinical evidence. These results strongly recommend the development of further large data studies with different databases to validate the degree to which some of these variables impact metastatic breast cancer in the long term.

Identifiants

pubmed: 35008417
pii: cancers14010253
doi: 10.3390/cancers14010253
pmc: PMC8750735
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : BLRD VA
ID : I01 BX003368
Pays : United States
Organisme : United States Department of Defense
ID : W81XWH1910495

Références

Nature. 2021 Dec 7;:
pubmed: 34875674
Breast Cancer Res Treat. 2019 Jan;173(2):465-474
pubmed: 30328050
J Surg Oncol. 2016 May;113(6):609-15
pubmed: 26991020
PLoS One. 2019 Mar 8;14(3):e0213292
pubmed: 30849111
World J Surg Oncol. 2016 Aug 24;14(1):223
pubmed: 27557635
J Clin Oncol. 2010 Oct 10;28(29):4410-6
pubmed: 20805458
Breast. 2015 Oct;24(5):594-600
pubmed: 26144637
J Am Coll Surg. 2018 Apr;226(4):406-412.e1
pubmed: 29366844
JAMA Oncol. 2021 Mar 1;7(3):370-378
pubmed: 33475714
BMC Bioinformatics. 2016 May 26;17(1):221
pubmed: 27230078
Clin Cancer Res. 2010 Dec 15;16(24):6100-10
pubmed: 21169259
Int J Cancer. 2016 May 1;138(9):2088-97
pubmed: 26606746
N Engl J Med. 2017 Nov 9;377(19):1836-1846
pubmed: 29117498
Breast Cancer Res. 2015 Jan 27;17:10
pubmed: 25848913
Lancet. 2013 Mar 9;381(9869):805-16
pubmed: 23219286
Breast Cancer Res Treat. 2018 Jan;167(1):171-181
pubmed: 28861753
Eur J Cancer. 2019 Mar;110:53-61
pubmed: 30769227
BMC Bioinformatics. 2020 Jul 10;21(1):298
pubmed: 32650714
PLoS One. 2015 Dec 01;10(12):e0143247
pubmed: 26624895

Auteurs

Juan Luis Gomez Marti (JL)

Department of Pathology, University of Pittsburgh, Pittsburgh, PA 15213, USA.
R&D Service, Pittsburgh VA Health System, Pittsburgh, PA 15240, USA.

Adam Brufsky (A)

Department of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA.
Hillman Cancer Center, University of Pittsburgh Medical Center, Pittsburgh, PA 15232, USA.

Alan Wells (A)

Department of Pathology, University of Pittsburgh, Pittsburgh, PA 15213, USA.
R&D Service, Pittsburgh VA Health System, Pittsburgh, PA 15240, USA.
Hillman Cancer Center, University of Pittsburgh Medical Center, Pittsburgh, PA 15232, USA.

Xia Jiang (X)

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206, USA.

Classifications MeSH