Prediction of protein aggregation propensity employing SqFt-based logistic regression model.


Journal

International journal of biological macromolecules
ISSN: 1879-0003
Titre abrégé: Int J Biol Macromol
Pays: Netherlands
ID NLM: 7909578

Informations de publication

Date de publication:
30 Sep 2023
Historique:
received: 12 04 2023
revised: 28 06 2023
accepted: 26 07 2023
medline: 27 9 2023
pubmed: 30 7 2023
entrez: 29 7 2023
Statut: ppublish

Résumé

Here we present a novel machine-learning approach to predict protein aggregation propensity (PAP) which is a key factor in the formation of amyloid fibrils based on logistic regression (LR). Amyloid fibrils are associated with various neurodegenerative diseases (ND) such as Alzheimer's disease (AD) and Parkinson's disease (PD), which are caused by oxidative stress and impaired protein homeostasis. Accordingly, the paper uses a dataset of hexapeptides with known aggregation tendencies and eight physiochemical features to train and test the LR model. Also, it evaluates the performance of the LR model using F-measure and Matthews correlation coefficient (MCC) as metrics and compares it with other existing methods. Moreover, it investigates the effect of combining sequence and feature information in the prediction. In conclusion, the LR model with sequence and feature information achieves high F-measure (0.841) and MCC (0.6692), outperforming other methods and demonstrating its efficiency and reliability for PAP prediction. In addition, the overall performance of the concluded method was higher than the other known servers, for instance, Aggrescan, Metamyl, Foldamyloid, and PASTA 2.0. The LR model can be accessed at: https://github.com/KatherineEshari/Protein-aggregation-prediction.

Identifiants

pubmed: 37516225
pii: S0141-8130(23)02931-8
doi: 10.1016/j.ijbiomac.2023.126036
pii:
doi:

Substances chimiques

Protein Aggregates 0
Amyloid 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

126036

Informations de copyright

Copyright © 2023. Published by Elsevier B.V.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Fatemeh Eshari (F)

Protein Biotechnology Research Lab (PBRL), School of Biology, College of Science, University of Tehran, Tehran, Iran.

Fahime Momeni (F)

School of Mathematics, Statistics and Computer Sciences, College of Science, University of Tehran, Tehran, Iran.

Amirreza Faraj Nezhadi (AF)

Protein Biotechnology Research Lab (PBRL), School of Biology, College of Science, University of Tehran, Tehran, Iran; School of Chemical Engineering, College of Engineering, University of Tehran, Tehran, Iran.

Soudabeh Shemehsavar (S)

School of Mathematics, Statistics and Computer Sciences, College of Science, University of Tehran, Tehran, Iran.

Mehran Habibi-Rezaei (M)

Protein Biotechnology Research Lab (PBRL), School of Biology, College of Science, University of Tehran, Tehran, Iran; Center of Excellence in NanoBiomedicine, University of Tehran, Tehran, Iran. Electronic address: mhabibi@ut.ac.ir.

Articles similaires

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Humans Middle Aged Female Male Surveys and Questionnaires

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis

Classifications MeSH