Census of exposed aggregation-prone regions in proteomes.

amyloids bioinformatics evolution kingdoms of life large-scale analysis protein aggregation

Journal

Briefings in bioinformatics
ISSN: 1477-4054
Titre abrégé: Brief Bioinform
Pays: England
ID NLM: 100912837

Informations de publication

Date de publication:
20 07 2023
Historique:
received: 23 12 2022
revised: 30 03 2023
accepted: 21 04 2023
medline: 23 10 2023
pubmed: 18 5 2023
entrez: 18 5 2023
Statut: ppublish

Résumé

Loss of solubility usually leads to the detrimental elimination of protein function. In some cases, the protein aggregation is also required for beneficial functions. Given the duality of this phenomenon, it remains a fundamental question how natural selection controls the aggregation. The exponential growth of genomic sequence data and recent progress with in silico predictors of the aggregation allows approaching this problem by a large-scale bioinformatics analysis. Most of the aggregation-prone regions are hidden within the 3D structure, rendering them inaccessible for the intermolecular interactions responsible for aggregation. Thus, the most realistic census of the aggregation-prone regions requires crossing aggregation prediction with information about the location of the natively unfolded regions. This allows us to detect so-called 'exposed aggregation-prone regions' (EARs). Here, we analyzed the occurrence and distribution of the EARs in 76 reference proteomes from the three kingdoms of life. For this purpose, we used a bioinformatics pipeline, which provides a consensual result based on several predictors of aggregation. Our analysis revealed a number of new statistically significant correlations about the presence of EARs in different organisms, their dependence on protein length, cellular localizations, co-occurrence with short linear motifs and the level of protein expression. We also obtained a list of proteins with the conserved aggregation-prone sequences for further experimental tests. Insights gained from this work led to a deeper understanding of the relationship between protein evolution and aggregation.

Identifiants

pubmed: 37200152
pii: 7169139
doi: 10.1093/bib/bbad183
pii:
doi:

Substances chimiques

Proteome 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2023. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Auteurs

Théo Falgarone (T)

Centre de Recherche en Biologie cellulaire de Montpellier, CNRS, Université Montpellier, Montpellier, 34293, France.

Etienne Villain (E)

Centre de Recherche en Biologie cellulaire de Montpellier, CNRS, Université Montpellier, Montpellier, 34293, France.

Francois Richard (F)

Centre de Recherche en Biologie cellulaire de Montpellier, CNRS, Université Montpellier, Montpellier, 34293, France.

Zarifa Osmanli (Z)

Centre de Recherche en Biologie cellulaire de Montpellier, CNRS, Université Montpellier, Montpellier, 34293, France.
Biophysics Institute, Ministry of Science and Education of Azerbaijan Republic, Az1141, Baku, Azerbaijan.

Andrey V Kajava (AV)

Centre de Recherche en Biologie cellulaire de Montpellier, CNRS, Université Montpellier, Montpellier, 34293, France.
Institut de Biologie Computationnelle, Université Montpellier, 34095 Montpellier, France.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Humans Molecular Chaperones Brain Protein Folding Mutation

Brain malformations and seizures by impaired chaperonin function of TRiC.

Florian Kraft, Piere Rodriguez-Aliaga, Weimin Yuan et al.
1.00
Humans Chaperonin Containing TCP-1 Brain Seizures Protein Folding

Classifications MeSH