Survival analysis with semi-supervised predictive clustering trees.

Predictive clustering trees Random forests Random survival forests Semi-supervised learning Survival analysis

Journal

Computers in biology and medicine
ISSN: 1879-0534
Titre abrégé: Comput Biol Med
Pays: United States
ID NLM: 1250250

Informations de publication

Date de publication:
02 2022
Historique:
received: 18 03 2021
revised: 26 10 2021
accepted: 27 10 2021
pubmed: 17 11 2021
medline: 18 3 2022
entrez: 16 11 2021
Statut: ppublish

Résumé

Many clinical studies follow patients over time and record the time until the occurrence of an event of interest (e.g., recovery, death, …). When patients drop out of the study or when their event did not happen before the study ended, the collected dataset is said to contain censored observations. Given the rise of personalized medicine, clinicians are often interested in accurate risk prediction models that predict, for unseen patients, a survival profile, including the expected time until the event. Survival analysis methods are used to detect associations or compare subpopulations of patients in this context. In this article, we propose to cast the time-to-event prediction task as a multi-target regression task, with censored observations modeled as partially labeled examples. We then apply semi-supervised learning to the resulting data representation. More specifically, we use semi-supervised predictive clustering trees and ensembles thereof. Empirical results over eleven real-life datasets demonstrate superior or equivalent predictive performance of the proposed approach as compared to three competitor methods. Moreover, smaller models are obtained compared to random survival forests, another tree ensemble method. Finally, we illustrate the informative feature selection mechanism of our method, by interpreting the splits induced by a single tree model when predicting survival for amyotrophic lateral sclerosis patients.

Identifiants

pubmed: 34782112
pii: S0010-4825(21)00795-2
doi: 10.1016/j.compbiomed.2021.105001
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

105001

Informations de copyright

Copyright © 2021 Elsevier Ltd. All rights reserved.

Auteurs

Bijit Roy (B)

L-BioStat, KU Leuven, Leuven, Belgium.

Tomaž Stepišnik (T)

Jozef Stefan Institute, Jamova 39, Ljubljana, Slovenia; Jozef Stefan International Postgraduate School, Ljubljana, Slovenia.

Celine Vens (C)

KU Leuven, Dept. of Public Health and Primary Care, Kortrijk, Belgium; ITEC, IMEC Research Group at KU Leuven, Kortrijk, Belgium.

Sašo Džeroski (S)

Jozef Stefan Institute, Jamova 39, Ljubljana, Slovenia; Jozef Stefan International Postgraduate School, Ljubljana, Slovenia. Electronic address: saso.dzeroski@ijs.si.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH