Prediction of five-year survival among esophageal cancer patients using machine learning.

Esophageal cancer Machine learning Prediction model Public health challenges Survival

Journal

Heliyon
ISSN: 2405-8440
Titre abrégé: Heliyon
Pays: England
ID NLM: 101672560

Informations de publication

Date de publication:
Dec 2023
Historique:
received: 12 10 2023
revised: 16 11 2023
accepted: 16 11 2023
medline: 21 12 2023
pubmed: 21 12 2023
entrez: 21 12 2023
Statut: epublish

Résumé

Considering the silent progression of esophageal cancer, the survival prediction of this disease is crucial in enhancing the quality of life of these patients globally. So far, no prediction solution has been introduced for the survival of EC in Iran based on the machine learning approach. So, this study aims to develop a prediction model for the five-year survival of EC based on the ML approach to promote clinical outcomes and various treatment and preventive plans. In this retrospective study, we investigated the 1656 cases of survived and non-survived EC patients belonging to Imam Khomeini Hospital in Sari City from 2013 to 2020. The multivariable regression analysis was used to select the best predictors of five-year survival. We leveraged random forest, eXtreme Gradient Boosting, support vector machine, artificial neural networks, Bayesian networks, J-48 decision tree, and K-nearest neighborhood to develop the prediction models. To get the best model for predicting the five-year survival of EC, we compared them using the area under the receiver operator characteristics. The age at diagnosis, body mass index, smoking, obstruction, dysphagia, weight loss, lymphadenopathy, chemotherapy, radiotherapy, family history of EC, tumor stage, type of appearance, histological type, grade of differentiation, tumor location, tumor size, lymphatic invasion, vascular invasion, and platelet albumin ratio were considered as the best predictors associated with the five-year survival of EC based on the regression analysis. In this respect, the random forest with the area under the receiver operator characteristics of 0.95 was identified as a superior model. The experimental results of the current study showed that the random forest could have a significant role in enhancing the quality of care in EC patients by increasing the effectiveness of follow-up and treatment measures introduced by care providers.

Sections du résumé

Background and aim UNASSIGNED
Considering the silent progression of esophageal cancer, the survival prediction of this disease is crucial in enhancing the quality of life of these patients globally. So far, no prediction solution has been introduced for the survival of EC in Iran based on the machine learning approach. So, this study aims to develop a prediction model for the five-year survival of EC based on the ML approach to promote clinical outcomes and various treatment and preventive plans.
Material and methods UNASSIGNED
In this retrospective study, we investigated the 1656 cases of survived and non-survived EC patients belonging to Imam Khomeini Hospital in Sari City from 2013 to 2020. The multivariable regression analysis was used to select the best predictors of five-year survival. We leveraged random forest, eXtreme Gradient Boosting, support vector machine, artificial neural networks, Bayesian networks, J-48 decision tree, and K-nearest neighborhood to develop the prediction models. To get the best model for predicting the five-year survival of EC, we compared them using the area under the receiver operator characteristics.
Results UNASSIGNED
The age at diagnosis, body mass index, smoking, obstruction, dysphagia, weight loss, lymphadenopathy, chemotherapy, radiotherapy, family history of EC, tumor stage, type of appearance, histological type, grade of differentiation, tumor location, tumor size, lymphatic invasion, vascular invasion, and platelet albumin ratio were considered as the best predictors associated with the five-year survival of EC based on the regression analysis. In this respect, the random forest with the area under the receiver operator characteristics of 0.95 was identified as a superior model.
Conclusion UNASSIGNED
The experimental results of the current study showed that the random forest could have a significant role in enhancing the quality of care in EC patients by increasing the effectiveness of follow-up and treatment measures introduced by care providers.

Identifiants

pubmed: 38125437
doi: 10.1016/j.heliyon.2023.e22654
pii: S2405-8440(23)09862-6
pmc: PMC10730993
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e22654

Informations de copyright

© 2023 The Author.

Déclaration de conflit d'intérêts

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Raoof Nopour (R)

Department of Health Information Management, Student Research Committee, School of Health Management and Information Sciences Branch, Iran University of Medical Sciences, Tehran, Iran.

Classifications MeSH