Development and Validation of an XGBoost-Algorithm-Powered Survival Model for Predicting In-Hospital Mortality Based on 545,388 Isolated Severe Traumatic Brain Injury Patients from the TQIP Database.
Trauma Quality Improvement Program (TQIP)
extreme gradient boosting (XGBoost)
machine learning
prediction model
survival analysis
traumatic brain injury
Journal
Journal of personalized medicine
ISSN: 2075-4426
Titre abrégé: J Pers Med
Pays: Switzerland
ID NLM: 101602269
Informations de publication
Date de publication:
19 Sep 2023
19 Sep 2023
Historique:
received:
16
08
2023
revised:
04
09
2023
accepted:
14
09
2023
medline:
28
9
2023
pubmed:
28
9
2023
entrez:
28
9
2023
Statut:
epublish
Résumé
Traumatic brain injury (TBI) represents a significant global health issue; the traditional tools such as the Glasgow Coma Scale (GCS) and Abbreviated Injury Scale (AIS) which have been used for injury severity grading, struggle to capture outcomes after TBI. This paper aims to implement extreme gradient boosting (XGBoost), a powerful machine learning algorithm that combines the predictions of multiple weak models to create a strong predictive model with high accuracy and efficiency, in order to develop and validate a predictive model for in-hospital mortality in patients with isolated severe traumatic brain injury and to identify the most influential predictors. In total, 545,388 patients from the 2013-2021 American College of Surgeons Trauma Quality Improvement Program (TQIP) database were included in the current study, with 80% of the patients used for model training and 20% of the patients for the final model test. The primary outcome of the study was in-hospital mortality. Predictors were patients' demographics, admission status, as well as comorbidities, and clinical characteristics. Penalized Cox regression models were used to investigate the associations between the survival outcomes and the predictors and select the best predictors. An extreme gradient boosting (XGBoost)-powered Cox regression model was then used to predict the survival outcome. The performance of the models was evaluated using the Harrell's concordance index (C-index). The time-dependent area under the receiver operating characteristic curve (AUC) was used to evaluate the dynamic cumulative performance of the models. The importance of the predictors in the final prediction model was evaluated using the Shapley additive explanations (SHAP) value. On average, the final XGBoost-powered Cox regression model performed at an acceptable level for patients with a length of stay up to 250 days (mean time-dependent AUC = 0.713) in the test dataset. However, for patients with a length of stay between 20 and 213 days, the performance of the model was relatively poor (time-dependent AUC < 0.7). When limited to patients with a length of stay ≤20 days, which accounts for 95.4% of all the patients, the model achieved an excellent performance (mean time-dependent AUC = 0.813). When further limited to patients with a length of stay ≤5 days, which accounts for two-thirds of all the patients, the model achieved an outstanding performance (mean time-dependent AUC = 0.917). The XGBoost-powered Cox regression model can achieve an outstanding predictive ability for in-hospital mortality during the first 5 days, primarily based on the severity of the injury, the GCS on admission, and the patient's age. These variables continue to demonstrate an excellent predictive ability up to 20 days after admission, a period of care that accounts for over 95% of severe TBI patients. Past 20 days of care, other factors appear to be the primary drivers of in-hospital mortality, indicating a potential window of opportunity for improving outcomes.
Sections du résumé
BACKGROUND
BACKGROUND
Traumatic brain injury (TBI) represents a significant global health issue; the traditional tools such as the Glasgow Coma Scale (GCS) and Abbreviated Injury Scale (AIS) which have been used for injury severity grading, struggle to capture outcomes after TBI.
AIM AND METHODS
OBJECTIVE
This paper aims to implement extreme gradient boosting (XGBoost), a powerful machine learning algorithm that combines the predictions of multiple weak models to create a strong predictive model with high accuracy and efficiency, in order to develop and validate a predictive model for in-hospital mortality in patients with isolated severe traumatic brain injury and to identify the most influential predictors. In total, 545,388 patients from the 2013-2021 American College of Surgeons Trauma Quality Improvement Program (TQIP) database were included in the current study, with 80% of the patients used for model training and 20% of the patients for the final model test. The primary outcome of the study was in-hospital mortality. Predictors were patients' demographics, admission status, as well as comorbidities, and clinical characteristics. Penalized Cox regression models were used to investigate the associations between the survival outcomes and the predictors and select the best predictors. An extreme gradient boosting (XGBoost)-powered Cox regression model was then used to predict the survival outcome. The performance of the models was evaluated using the Harrell's concordance index (C-index). The time-dependent area under the receiver operating characteristic curve (AUC) was used to evaluate the dynamic cumulative performance of the models. The importance of the predictors in the final prediction model was evaluated using the Shapley additive explanations (SHAP) value.
RESULTS
RESULTS
On average, the final XGBoost-powered Cox regression model performed at an acceptable level for patients with a length of stay up to 250 days (mean time-dependent AUC = 0.713) in the test dataset. However, for patients with a length of stay between 20 and 213 days, the performance of the model was relatively poor (time-dependent AUC < 0.7). When limited to patients with a length of stay ≤20 days, which accounts for 95.4% of all the patients, the model achieved an excellent performance (mean time-dependent AUC = 0.813). When further limited to patients with a length of stay ≤5 days, which accounts for two-thirds of all the patients, the model achieved an outstanding performance (mean time-dependent AUC = 0.917).
CONCLUSION
CONCLUSIONS
The XGBoost-powered Cox regression model can achieve an outstanding predictive ability for in-hospital mortality during the first 5 days, primarily based on the severity of the injury, the GCS on admission, and the patient's age. These variables continue to demonstrate an excellent predictive ability up to 20 days after admission, a period of care that accounts for over 95% of severe TBI patients. Past 20 days of care, other factors appear to be the primary drivers of in-hospital mortality, indicating a potential window of opportunity for improving outcomes.
Identifiants
pubmed: 37763168
pii: jpm13091401
doi: 10.3390/jpm13091401
pmc: PMC10533165
pii:
doi:
Types de publication
Journal Article
Langues
eng
Références
J Trauma. 2011 Nov;71(5):1172-8
pubmed: 22071922
Front Neurol. 2020 Jan 24;10:1366
pubmed: 32038454
Handb Clin Neurol. 2015;127:3-13
pubmed: 25702206
Med Clin North Am. 2020 Mar;104(2):213-238
pubmed: 32035565
BMC Med Inform Decis Mak. 2020 Dec 14;20(1):336
pubmed: 33317528
Injury. 2009 Sep;40(9):973-7
pubmed: 19540490
Emerg Top Life Sci. 2021 Dec 20;:
pubmed: 34927670
Cureus. 2020 Oct 28;12(10):e11225
pubmed: 33269153
J Neurotrauma. 2023 Jul;40(13-14):1366-1375
pubmed: 37062757
Lancet Neurol. 2022 Nov;21(11):1004-1060
pubmed: 36183712
Int J Nurs Stud. 2021 Nov;123:104043
pubmed: 34388366
Crit Care. 2016 Jun 21;20(1):148
pubmed: 27323708
BMC Med. 2015 Jan 06;13:1
pubmed: 25563062
J Trauma Nurs. 2015 Jul-Aug;22(4):204-8; quiz E3-4
pubmed: 26165873
J Neurosurg. 2018 Apr 27;130(4):1080-1097
pubmed: 29701556
J Clin Med. 2021 Jan 04;10(1):
pubmed: 33406786
Lancet Neurol. 2019 Jan;18(1):56-87
pubmed: 30497965
World J Surg. 2020 Jun;44(6):1844-1853
pubmed: 32002583
BMC Med Res Methodol. 2017 Apr 7;17(1):53
pubmed: 28388943
J Trauma. 2007 Apr;62(4):946-50
pubmed: 17426553
J Neurotrauma. 2015 Dec 1;32(23):1834-48
pubmed: 25158206
EClinicalMedicine. 2023 Apr 28;59:101975
pubmed: 37180469
Comput Methods Programs Biomed. 2022 Feb;214:106584
pubmed: 34942412
Brain Sci. 2023 Jan 03;13(1):
pubmed: 36672075
J Am Coll Surg. 2004 Aug;199(2):216-22
pubmed: 15275876
Lancet Public Health. 2016 Dec;1(2):e76-e83
pubmed: 29253420
Front Cell Neurosci. 2019 Nov 27;13:528
pubmed: 31827423
Dtsch Med Wochenschr. 1967 Oct 27;92(43):1947-50
pubmed: 5299769
Korean J Anesthesiol. 2018 Feb;71(1):12-21
pubmed: 29441170