Prediction of protein aggregation propensity employing SqFt-based logistic regression model.
Logistic regression
Machine learning
Protein aggregation
Journal
International journal of biological macromolecules
ISSN: 1879-0003
Titre abrégé: Int J Biol Macromol
Pays: Netherlands
ID NLM: 7909578
Informations de publication
Date de publication:
30 Sep 2023
30 Sep 2023
Historique:
received:
12
04
2023
revised:
28
06
2023
accepted:
26
07
2023
medline:
27
9
2023
pubmed:
30
7
2023
entrez:
29
7
2023
Statut:
ppublish
Résumé
Here we present a novel machine-learning approach to predict protein aggregation propensity (PAP) which is a key factor in the formation of amyloid fibrils based on logistic regression (LR). Amyloid fibrils are associated with various neurodegenerative diseases (ND) such as Alzheimer's disease (AD) and Parkinson's disease (PD), which are caused by oxidative stress and impaired protein homeostasis. Accordingly, the paper uses a dataset of hexapeptides with known aggregation tendencies and eight physiochemical features to train and test the LR model. Also, it evaluates the performance of the LR model using F-measure and Matthews correlation coefficient (MCC) as metrics and compares it with other existing methods. Moreover, it investigates the effect of combining sequence and feature information in the prediction. In conclusion, the LR model with sequence and feature information achieves high F-measure (0.841) and MCC (0.6692), outperforming other methods and demonstrating its efficiency and reliability for PAP prediction. In addition, the overall performance of the concluded method was higher than the other known servers, for instance, Aggrescan, Metamyl, Foldamyloid, and PASTA 2.0. The LR model can be accessed at: https://github.com/KatherineEshari/Protein-aggregation-prediction.
Identifiants
pubmed: 37516225
pii: S0141-8130(23)02931-8
doi: 10.1016/j.ijbiomac.2023.126036
pii:
doi:
Substances chimiques
Protein Aggregates
0
Amyloid
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
126036Informations de copyright
Copyright © 2023. Published by Elsevier B.V.
Déclaration de conflit d'intérêts
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.