StackDPPred: Multiclass Prediction of Defensin Peptides using Stacked Ensemble Learning with Optimized Features.


Journal

Methods (San Diego, Calif.)
ISSN: 1095-9130
Titre abrégé: Methods
Pays: United States
ID NLM: 9426302

Informations de publication

Date de publication:
20 Aug 2024
Historique:
received: 01 07 2024
revised: 30 07 2024
accepted: 13 08 2024
medline: 23 8 2024
pubmed: 23 8 2024
entrez: 22 8 2024
Statut: aheadofprint

Résumé

Host defense or antimicrobial peptides(AMPs) are promising candidates for protecting host against microbial pathogens for example bacteria, virus, fungi, yeast. Defensins are the type of AMPs that act as potential therapeutic drug agent and perform vital role in various biological process. Conventional Experiments to identify defensin peptides (DPs) are time consuming and expensive. Thus, the shortcomings of wet lab experiments are leveraged by computational methods to accurately predict the functional types of DPs. In this paper, we aim to propose a novel multi-class ensemble-based prediction model called StackDPPred for identifying the properties of DPs. The peptide sequences are encoded using split amino acid composition(SAAC), segmented position specific scoring matrix(SegPSSM), histogram of oriented gradients-based PSSM(HOGPSSM) and feature extraction based graphical and statistical(FEGS) descriptors.Next, principal component analysis (PCA) is used to select the best subset of attributes.After that, the optimized features are fed into single machine learning and stacking-based ensemble classifiers. Furthermore, the ablation study demonstrate the robustness and efficacy of the stacking approach using reduced features for predicting DPs and their families.The proposed StackDPPred method improves the overall accuracy for all five family types by 13.41% and 7.62% compare to other existing DPs predictors iDPF-PseRAAC and iDEF-PseRAAC on validation test.Additionally, we applied the local interpretable model-agnostic explainations(LIME) algorithm to understand the contribution of selected features to the overall prediction.We believe, StackDPPred could serve as a valuable tool accelerating the screening of large-scale DPs and peptide-based drug discovery process.

Identifiants

pubmed: 39173785
pii: S1046-2023(24)00182-8
doi: 10.1016/j.ymeth.2024.08.001
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

Copyright © 2024 The Author(s). Published by Elsevier Inc. All rights reserved.

Auteurs

Muhammad Arif (M)

College of Science and Engineering, Hamad Bin Khalifa University, Doha 34110, Qatar.

Saleh Musleh (S)

College of Science and Engineering, Hamad Bin Khalifa University, Doha 34110, Qatar.

Ali Ghulam (A)

Information Technology Centre, Sindh Agriculture University, Sindh, Pakistan.

Huma Fida (H)

Department of Microbiology, Abdul Wali Khan University Mardan 23200, KPK, Pakistan.

Yasser Alqahtani (Y)

Independent Researcher, Madinah, Saudi Arabia.

Tanvir Alam (T)

College of Science and Engineering, Hamad Bin Khalifa University, Doha 34110, Qatar. Electronic address: talam@hbku.edu.qa.

Classifications MeSH