Construction and validation of risk prediction models for pulmonary embolism in hospitalized patients based on different machine learning methods.
hospitalized patients
machine learning
prediction models
pulmonary embolism
random forest
Journal
Frontiers in cardiovascular medicine
ISSN: 2297-055X
Titre abrégé: Front Cardiovasc Med
Pays: Switzerland
ID NLM: 101653388
Informations de publication
Date de publication:
2024
2024
Historique:
received:
06
10
2023
accepted:
11
06
2024
medline:
10
7
2024
pubmed:
10
7
2024
entrez:
10
7
2024
Statut:
epublish
Résumé
This study aims to apply different machine learning (ML) methods to construct risk prediction models for pulmonary embolism (PE) in hospitalized patients, and to evaluate and compare the predictive efficacy and clinical benefit of each model. We conducted a retrospective study involving 332 participants (172 PE positive cases and 160 PE negative cases) recruited from Guangdong Medical University. Participants were randomly divided into a training group (70%) and a validation group (30%). Baseline data were analyzed using univariate analysis, and potential independent risk factors associated with PE were further identified through univariate and multivariate logistic regression analysis. Six ML models, namely Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Naive Bayes (NB), Support Vector Machine (SVM), and AdaBoost were developed. The predictive efficacy of each model was compared using the receiver operating characteristic (ROC) curve analysis and the area under the curve (AUC). Clinical benefit was assessed using decision curve analysis (DCA). Logistic regression analysis identified lower extremity deep venous thrombosis, elevated D-dimer, shortened activated partial prothrombin time, and increased red blood cell distribution width as potential independent risk factors for PE. Among the six ML models, the RF model achieved the highest AUC of 0.778. Additionally, DCA consistently indicated that the RF model offered the greatest clinical benefit. This study developed six ML models, with the RF model exhibiting the highest predictive efficacy and clinical benefit in the identification and prediction of PE occurrence in hospitalized patients.
Identifiants
pubmed: 38984357
doi: 10.3389/fcvm.2024.1308017
pmc: PMC11232034
doi:
Types de publication
Journal Article
Langues
eng
Pagination
1308017Informations de copyright
© 2024 Huang, Huang, Peng, Pang, Sun, Wu, He, Fu, Wu and Sun.
Déclaration de conflit d'intérêts
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.