Construction and validation of risk prediction models for pulmonary embolism in hospitalized patients based on different machine learning methods.

hospitalized patients machine learning prediction models pulmonary embolism random forest

Journal

Frontiers in cardiovascular medicine

ISSN: 2297-055X

Titre abrégé: Front Cardiovasc Med

Pays: Switzerland

ID NLM: 101653388

Informations de publication

Date de publication:
2024

Historique:

received: 06 10 2023

accepted: 11 06 2024

medline: 10 7 2024

pubmed: 10 7 2024

entrez: 10 7 2024

Statut: epublish

Résumé

This study aims to apply different machine learning (ML) methods to construct risk prediction models for pulmonary embolism (PE) in hospitalized patients, and to evaluate and compare the predictive efficacy and clinical benefit of each model. We conducted a retrospective study involving 332 participants (172 PE positive cases and 160 PE negative cases) recruited from Guangdong Medical University. Participants were randomly divided into a training group (70%) and a validation group (30%). Baseline data were analyzed using univariate analysis, and potential independent risk factors associated with PE were further identified through univariate and multivariate logistic regression analysis. Six ML models, namely Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Naive Bayes (NB), Support Vector Machine (SVM), and AdaBoost were developed. The predictive efficacy of each model was compared using the receiver operating characteristic (ROC) curve analysis and the area under the curve (AUC). Clinical benefit was assessed using decision curve analysis (DCA). Logistic regression analysis identified lower extremity deep venous thrombosis, elevated D-dimer, shortened activated partial prothrombin time, and increased red blood cell distribution width as potential independent risk factors for PE. Among the six ML models, the RF model achieved the highest AUC of 0.778. Additionally, DCA consistently indicated that the RF model offered the greatest clinical benefit. This study developed six ML models, with the RF model exhibiting the highest predictive efficacy and clinical benefit in the identification and prediction of PE occurrence in hospitalized patients.

Identifiants

DOI: 10.3389/fcvm.2024.1308017 PMID: 38984357 PMC: PMC11232034

pubmed: 38984357

doi: 10.3389/fcvm.2024.1308017

pmc: PMC11232034

doi:

Types de publication

Journal Article

Langues

eng

Pagination

1308017

Informations de copyright

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Construction and validation of risk prediction models for pulmonary embolism in hospitalized patients based on different machine learning methods.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Pagination

Informations de copyright

Déclaration de conflit d'intérêts

Auteurs

Tao Huang (T)

Zhihai Huang (Z)

Xiaodong Peng (X)

Lingpin Pang (L)

Jie Sun (J)

Jinbo Wu (J)

Jinman He (J)

Kaili Fu (K)

Jun Wu (J)

Xishi Sun (X)

Classifications MeSH