Comprehensive Evaluation of Machine Learning Models and Gene Expression Signatures for Prostate Cancer Prognosis Using Large Population Cohorts.
Journal
Cancer research
ISSN: 1538-7445
Titre abrégé: Cancer Res
Pays: United States
ID NLM: 2984705R
Informations de publication
Date de publication:
03 05 2022
03 05 2022
Historique:
received:
11
09
2021
revised:
07
01
2022
accepted:
07
03
2022
pubmed:
1
4
2022
medline:
6
5
2022
entrez:
31
3
2022
Statut:
ppublish
Résumé
Overtreatment remains a pervasive problem in prostate cancer management due to the highly variable and often indolent course of disease. Molecular signatures derived from gene expression profiling have played critical roles in guiding prostate cancer treatment decisions. Many gene expression signatures have been developed to improve the risk stratification of prostate cancer and some of them have already been applied to clinical practice. However, no comprehensive evaluation has been performed to compare the performance of these signatures. In this study, we conducted a systematic and unbiased evaluation of 15 machine learning (ML) algorithms and 30 published prostate cancer gene expression-based prognostic signatures leveraging 10 transcriptomics datasets with 1,558 primary patients with prostate cancer from public data repositories. This analysis revealed that survival analysis models outperformed binary classification models for risk assessment, and the performance of the survival analysis methods-Cox model regularized with ridge penalty (Cox-Ridge) and partial least squares (PLS) regression for Cox model (Cox-PLS)-were generally more robust than the other methods. Based on the Cox-Ridge algorithm, several top prognostic signatures displayed comparable or even better performance than commercial panels. These findings will facilitate the identification of existing prognostic signatures that are promising for further validation in prospective studies and promote the development of robust prognostic models to guide clinical decision-making. Moreover, this study provides a valuable data resource from large primary prostate cancer cohorts, which can be used to develop, validate, and evaluate novel statistical methodologies and molecular signatures to improve prostate cancer management. This systematic evaluation of 15 machine learning algorithms and 30 published gene expression signatures for the prognosis of prostate cancer will assist clinical decision-making.
Identifiants
pubmed: 35358302
pii: 682141
doi: 10.1158/0008-5472.CAN-21-3074
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
1832-1843Informations de copyright
©2022 American Association for Cancer Research.