OBLIQUE RANDOM SURVIVAL FORESTS.
60K35
Cardiovascular Disease
Machine Learning
Penalized Regression
Primary 60K35
Random Forest
Survival
secondary 60K35
Journal
The annals of applied statistics
ISSN: 1932-6157
Titre abrégé: Ann Appl Stat
Pays: United States
ID NLM: 101479511
Informations de publication
Date de publication:
Sep 2019
Sep 2019
Historique:
entrez:
27
1
2023
pubmed:
1
9
2019
medline:
1
9
2019
Statut:
ppublish
Résumé
We introduce and evaluate the oblique random survival forest (ORSF). The ORSF is an ensemble method for right-censored survival data that uses linear combinations of input variables to recursively partition a set of training data. Regularized Cox proportional hazard models are used to identify linear combinations of input variables in each recursive partitioning step. Benchmark results using simulated and real data indicate that the ORSF's predicted risk function has high prognostic value in comparison to random survival forests, conditional inference forests, regression, and boosting. In an application to data from the Jackson Heart Study, we demonstrate variable and partial dependence using the ORSF and highlight characteristics of its 10-year predicted risk function for atherosclerotic cardiovascular disease events (ASCVD; stroke, coronary heart disease). We present visualizations comparing variable and partial effect estimation according to the ORSF, the conditional inference forest, and the Pooled Cohort Risk equations. The obliqueRSF R package, which provides functions to fit the ORSF and create variable and partial dependence plots, is available on the comprehensive R archive network (CRAN).
Identifiants
pubmed: 36704751
doi: 10.1214/19-aoas1261
pmc: PMC9875945
mid: NIHMS1827649
doi:
Types de publication
Journal Article
Langues
eng
Pagination
1847-1883Subventions
Organisme : NHLBI NIH HHS
ID : R01 HL080477
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL165452
Pays : United States
Organisme : NINDS NIH HHS
ID : U01 NS041588
Pays : United States
Références
JAMA. 2011 May 11;305(18):1873-81
pubmed: 21558518
J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
J Am Stat Assoc. 2015;110(512):1770-1784
pubmed: 26903687
BMC Med Res Methodol. 2017 Jul 28;17(1):115
pubmed: 28754093
Stat Med. 2013 Jun 15;32(13):2173-84
pubmed: 23172755
Psychol Methods. 2009 Dec;14(4):323-48
pubmed: 19968396
J Clin Oncol. 2011 Apr 20;29(12):1578-86
pubmed: 21422418
Biometrics. 2005 Mar;61(1):92-105
pubmed: 15737082
JAMA. 1982 May 14;247(18):2543-6
pubmed: 7069920
Nature. 2002 Jan 31;415(6871):530-6
pubmed: 11823860
Ethn Dis. 2005 Autumn;15(4 Suppl 6):S6-4-17
pubmed: 16320381
Ann Intern Med. 2009 May 5;150(9):604-12
pubmed: 19414839
J Stat Softw. 2012 Sep;50(11):1-23
pubmed: 25317082
JAMA. 2012 Nov 7;308(17):1768-74
pubmed: 23117777
Stat Med. 2006 Sep 30;25(18):3201-16
pubmed: 16143967
J Am Coll Cardiol. 2018 May 15;71(19):e127-e248
pubmed: 29146535
Stat Med. 1999 Sep 15-30;18(17-18):2529-45
pubmed: 10474158
BMC Bioinformatics. 2007 Jan 25;8:25
pubmed: 17254353
Biostatistics. 2019 Apr 1;20(2):347-357
pubmed: 29462286
Stat Med. 2004 Jan 15;23(1):77-91
pubmed: 14695641
Bioinformatics. 2008 May 1;24(9):1154-60
pubmed: 18325927
Neuroepidemiology. 2005;25(3):135-43
pubmed: 15990444
Biom J. 2017 Jul;59(4):685-701
pubmed: 27862181
J Clin Oncol. 1994 Oct;12(10):2086-93
pubmed: 7931478
Stat Med. 2019 May 20;38(11):2074-2102
pubmed: 30652356
J Stat Softw. 2011 Mar;39(5):1-13
pubmed: 27065756
Biostatistics. 2010 Apr;11(2):242-53
pubmed: 20097884
Ann Appl Stat. 2019 Sep;13(3):1847-1883
pubmed: 36704751
Biometrics. 2000 Jun;56(2):337-44
pubmed: 10877287