Machine Learning Improves Prediction Over Logistic Regression on Resected Colon Cancer Patients.

Colonic Neoplasms / surgery Humans Logistic Models Machine Learning Patient Readmission ROC Curve

Colon cancer Machine learning Outcomes Prediction Risk

Journal

The Journal of surgical research

ISSN: 1095-8673

Titre abrégé: J Surg Res

Pays: United States

ID NLM: 0376340

Informations de publication

Date de publication:
07 2022

Historique:

received: 04 02 2021

revised: 29 09 2021

accepted: 25 01 2022

pubmed: 15 3 2022

medline: 27 4 2022

entrez: 14 3 2022

Statut: ppublish

Résumé

Despite advances, readmission and mortality rates for surgical patients with colon cancer remain high. Prediction models using regression techniques allows for risk stratification to aid periprocedural care. Technological advances have enabled large data to be analyzed using machine learning (ML) algorithms. A national database of colon cancer patients was selected to determine whether ML methods better predict outcomes following surgery compared to conventional methods. Surgical colon cancer patients were identified using the 2013 National Cancer Database (NCDB). The negative outcome was defined as a composite of 30-d unplanned readmission and 30- and 90-d mortality. ML models, including Random Forest and XGBoost, were built and compared with conventional logistic regression. For the accounting of unbalanced outcomes, a synthetic minority oversampling technique (SMOTE) was implemented and applied using XGBoost. Analysis included 528,060 patients. The negative outcome occurred in 11.6% of patients. Model building utilized 30 variables. The primary metric for model comparison was area under the curve (AUC). In comparison to logistic regression (AUC 0.730, 95% CI: 0.725-0.735), AUC's for ML algorithms ranged between 0.748 and 0.757, with the Random Forest model (AUC 0.757, 95% CI: 0.752-0.762) outperforming XGBoost (AUC 0.756, 95% CI: 0.751-0.761) and XGBoost using SMOTE data (AUC 0.748, 95% CI: 0.743-0.753). We show that a large registry of surgical colon cancer patients can be utilized to build ML models to improve outcome prediction with differential discriminative ability. These results reveal the potential of these methods to enhance risk prediction, leading to improved strategies to mitigate those risks.

Identifiants

DOI: 10.1016/j.jss.2022.01.012 PMID: 35287027

pubmed: 35287027

pii: S0022-4804(22)00031-2

doi: 10.1016/j.jss.2022.01.012

pii:

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

181-193

Machine Learning Improves Prediction Over Logistic Regression on Resected Colon Cancer Patients.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Informations de copyright

Auteurs

Grey Leonard (G)

Charles South (C)

Courtney Balentine (C)

Matthew Porembka (M)

John Mansour (J)

Sam Wang (S)

Adam Yopp (A)

Patricio Polanco (P)

Herbert Zeh (H)

Mathew Augustine (M)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH