Comparing machine learning approaches to incorporate time-varying covariates in predicting cancer survival time.
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
25 01 2023
25 01 2023
Historique:
received:
19
07
2022
accepted:
18
01
2023
entrez:
25
1
2023
pubmed:
26
1
2023
medline:
28
1
2023
Statut:
epublish
Résumé
The Cox proportional hazards model is commonly used in evaluating risk factors in cancer survival data. The model assumes an additive, linear relationship between the risk factors and the log hazard. However, this assumption may be too simplistic. Further, failure to take time-varying covariates into account, if present, may lower prediction accuracy. In this retrospective, population-based, prognostic study of data from patients diagnosed with cancer from 2008 to 2015 in Ontario, Canada, we applied machine learning-based time-to-event prediction methods and compared their predictive performance in two sets of analyses: (1) yearly-cohort-based time-invariant and (2) fully time-varying covariates analysis. Machine learning-based methods-gradient boosting model (gbm), random survival forest (rsf), elastic net (enet), lasso and ridge-were compared to the traditional Cox proportional hazards (coxph) model and the prior study which used the yearly-cohort-based time-invariant analysis. Using Harrell's C index as our primary measure, we found that using both machine learning techniques and incorporating time-dependent covariates can improve predictive performance. Gradient boosting machine showed the best performance on test data in both time-invariant and time-varying covariates analysis.
Identifiants
pubmed: 36697455
doi: 10.1038/s41598-023-28393-7
pii: 10.1038/s41598-023-28393-7
pmc: PMC9877029
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1370Subventions
Organisme : CIHR
ID : 379009
Pays : Canada
Organisme : CIHR
ID : 383402
Pays : Canada
Informations de copyright
© 2023. The Author(s).
Références
Hepatogastroenterology. 2003 Jan-Feb;50(49):250-3
pubmed: 12630033
Sci Rep. 2020 Nov 23;10(1):20410
pubmed: 33230128
Stat Med. 1996 Feb 28;15(4):361-87
pubmed: 8668867
Technol Health Care. 2016;24(1):31-42
pubmed: 26409558
Comput Struct Biotechnol J. 2014 Nov 15;13:8-17
pubmed: 25750696
Stat Methods Med Res. 2022 Nov;31(11):2217-2236
pubmed: 35895510
J Clin Oncol. 2011 Mar 20;29(9):1151-8
pubmed: 21300920
J Palliat Med. 2016 Jan;19(1):16-21
pubmed: 26488947
Ann Palliat Med. 2016 Jan;5(1):22-9
pubmed: 26841812
JAMA Netw Open. 2020 Apr 1;3(4):e201768
pubmed: 32236529
J Stat Softw. 2011 Mar;39(5):1-13
pubmed: 27065756
Biometrics. 2000 Jun;56(2):337-44
pubmed: 10877287
Biostatistics. 2014 Oct;15(4):757-73
pubmed: 24728979
Artif Intell Med. 2010 Jul;49(3):187-95
pubmed: 20483571
Stat Med. 2013 Jun 15;32(13):2173-84
pubmed: 23172755
PLoS One. 2018 Dec 31;13(12):e0208808
pubmed: 30596658
BMC Med Res Methodol. 2018 Feb 26;18(1):24
pubmed: 29482517
BMJ Open. 2014 Mar 17;4(3):e004007
pubmed: 24643167
Clin Oncol (R Coll Radiol). 2001;13(3):209-18
pubmed: 11527298