Machine Learning to Discern Interactive Clusters of Risk Factors for Late Recurrence of Metastatic Breast Cancer.
Markov Blanket and Interactive Risk Factor Learner (MBIL)
causal learning
machine learning
metastasis
metastatic breast cancer
risk factors
Journal
Cancers
ISSN: 2072-6694
Titre abrégé: Cancers (Basel)
Pays: Switzerland
ID NLM: 101526829
Informations de publication
Date de publication:
05 Jan 2022
05 Jan 2022
Historique:
received:
29
11
2021
revised:
22
12
2021
accepted:
30
12
2021
entrez:
11
1
2022
pubmed:
12
1
2022
medline:
12
1
2022
Statut:
epublish
Résumé
Risk of metastatic recurrence of breast cancer after initial diagnosis and treatment depends on the presence of a number of risk factors. Although most univariate risk factors have been identified using classical methods, machine-learning methods are also being used to tease out non-obvious contributors to a patient's individual risk of developing late distant metastasis. Bayesian-network algorithms can identify not only risk factors but also interactions among these risks, which consequently may increase the risk of developing metastatic breast cancer. We proposed to apply a previously developed machine-learning method to discern risk factors of 5-, 10- and 15-year metastases. We applied a previously validated algorithm named the Markov Blanket and Interactive Risk Factor Learner (MBIL) to the electronic health record (EHR)-based Lynn Sage Database (LSDB) from the Lynn Sage Comprehensive Breast Center at Northwestern Memorial Hospital. This algorithm provided an output of both single and interactive risk factors of 5-, 10-, and 15-year metastases from the LSDB. We individually examined and interpreted the clinical relevance of these interactions based on years to metastasis and reliance on interactivity between risk factors. We found that, with lower alpha values (low interactivity score), the prevalence of variables with an independent influence on long-term metastasis was higher (i.e., HER2, TNEG). As the value of alpha increased to 480, stronger interactions were needed to define clusters of factors that increased the risk of metastasis (i.e., ER, smoking, race, alcohol usage). MBIL identified single and interacting risk factors of metastatic breast cancer, many of which were supported by clinical evidence. These results strongly recommend the development of further large data studies with different databases to validate the degree to which some of these variables impact metastatic breast cancer in the long term.
Sections du résumé
BACKGROUND
BACKGROUND
Risk of metastatic recurrence of breast cancer after initial diagnosis and treatment depends on the presence of a number of risk factors. Although most univariate risk factors have been identified using classical methods, machine-learning methods are also being used to tease out non-obvious contributors to a patient's individual risk of developing late distant metastasis. Bayesian-network algorithms can identify not only risk factors but also interactions among these risks, which consequently may increase the risk of developing metastatic breast cancer. We proposed to apply a previously developed machine-learning method to discern risk factors of 5-, 10- and 15-year metastases.
METHODS
METHODS
We applied a previously validated algorithm named the Markov Blanket and Interactive Risk Factor Learner (MBIL) to the electronic health record (EHR)-based Lynn Sage Database (LSDB) from the Lynn Sage Comprehensive Breast Center at Northwestern Memorial Hospital. This algorithm provided an output of both single and interactive risk factors of 5-, 10-, and 15-year metastases from the LSDB. We individually examined and interpreted the clinical relevance of these interactions based on years to metastasis and reliance on interactivity between risk factors.
RESULTS
RESULTS
We found that, with lower alpha values (low interactivity score), the prevalence of variables with an independent influence on long-term metastasis was higher (i.e., HER2, TNEG). As the value of alpha increased to 480, stronger interactions were needed to define clusters of factors that increased the risk of metastasis (i.e., ER, smoking, race, alcohol usage).
CONCLUSION
CONCLUSIONS
MBIL identified single and interacting risk factors of metastatic breast cancer, many of which were supported by clinical evidence. These results strongly recommend the development of further large data studies with different databases to validate the degree to which some of these variables impact metastatic breast cancer in the long term.
Identifiants
pubmed: 35008417
pii: cancers14010253
doi: 10.3390/cancers14010253
pmc: PMC8750735
pii:
doi:
Types de publication
Journal Article
Langues
eng
Subventions
Organisme : BLRD VA
ID : I01 BX003368
Pays : United States
Organisme : United States Department of Defense
ID : W81XWH1910495
Références
Nature. 2021 Dec 7;:
pubmed: 34875674
Breast Cancer Res Treat. 2019 Jan;173(2):465-474
pubmed: 30328050
J Surg Oncol. 2016 May;113(6):609-15
pubmed: 26991020
PLoS One. 2019 Mar 8;14(3):e0213292
pubmed: 30849111
World J Surg Oncol. 2016 Aug 24;14(1):223
pubmed: 27557635
J Clin Oncol. 2010 Oct 10;28(29):4410-6
pubmed: 20805458
Breast. 2015 Oct;24(5):594-600
pubmed: 26144637
J Am Coll Surg. 2018 Apr;226(4):406-412.e1
pubmed: 29366844
JAMA Oncol. 2021 Mar 1;7(3):370-378
pubmed: 33475714
BMC Bioinformatics. 2016 May 26;17(1):221
pubmed: 27230078
Clin Cancer Res. 2010 Dec 15;16(24):6100-10
pubmed: 21169259
Int J Cancer. 2016 May 1;138(9):2088-97
pubmed: 26606746
N Engl J Med. 2017 Nov 9;377(19):1836-1846
pubmed: 29117498
Breast Cancer Res. 2015 Jan 27;17:10
pubmed: 25848913
Lancet. 2013 Mar 9;381(9869):805-16
pubmed: 23219286
Breast Cancer Res Treat. 2018 Jan;167(1):171-181
pubmed: 28861753
Eur J Cancer. 2019 Mar;110:53-61
pubmed: 30769227
BMC Bioinformatics. 2020 Jul 10;21(1):298
pubmed: 32650714
PLoS One. 2015 Dec 01;10(12):e0143247
pubmed: 26624895