Emergent time scales of epistasis in protein evolution.

epistasis generative probabilistic models protein evolution sequence landscapes

Journal

Proceedings of the National Academy of Sciences of the United States of America
ISSN: 1091-6490
Titre abrégé: Proc Natl Acad Sci U S A
Pays: United States
ID NLM: 7505876

Informations de publication

Date de publication:
Oct 2024
Historique:
medline: 26 9 2024
pubmed: 26 9 2024
entrez: 26 9 2024
Statut: ppublish

Résumé

We introduce a data-driven epistatic model of protein evolution, capable of generating evolutionary trajectories spanning very different time scales reaching from individual mutations to diverged homologs. Our in silico evolution encompasses random nucleotide mutations, insertions and deletions, and models selection using a fitness landscape, which is inferred via a generative probabilistic model for protein families. We show that the proposed framework accurately reproduces the sequence statistics of both short-time (experimental) and long-time (natural) protein evolution, suggesting applicability also to relatively data-poor intermediate evolutionary time scales, which are currently inaccessible to evolution experiments. Our model uncovers a highly collective nature of epistasis, gradually changing the fitness effect of mutations in a diverging sequence context, rather than acting via strong interactions between individual mutations. This collective nature triggers the emergence of a long evolutionary time scale, separating fast mutational processes inside a given sequence context, from the slow evolution of the context itself. The model quantitatively reproduces epistatic phenomena such as contingency and entrenchment, as well as the loss of predictability in protein evolution observed in deep mutational scanning experiments of distant homologs. It thereby deepens our understanding of the interplay between mutation and selection in shaping protein diversity and functions, allows one to statistically forecast evolution, and challenges the prevailing independent-site models of protein evolution, which are unable to capture the fundamental importance of epistasis.

Identifiants

pubmed: 39325427
doi: 10.1073/pnas.2406807121
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e2406807121

Déclaration de conflit d'intérêts

Competing interests statement:The authors declare no competing interest.

Auteurs

Leonardo Di Bari (L)

Dipartimento Scienza Applicata e Tecnologia, Politecnico di Torino, I-10129 Torino, Italy.

Matteo Bisardi (M)

Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative, Paris F-75005, France.

Sabrina Cotogno (S)

Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative, Paris F-75005, France.

Martin Weigt (M)

Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratoire de Biologie Computationnelle et Quantitative, Paris F-75005, France.

Francesco Zamponi (F)

Dipartimento di Fisica, Sapienza Università di Roma, 00185 Rome, Italy.

Articles similaires

T-Lymphocytes, Regulatory Lung Neoplasms Proto-Oncogene Proteins p21(ras) Animals Humans

Pathogenic mitochondrial DNA mutations inhibit melanoma metastasis.

Spencer D Shelton, Sara House, Luiza Martins Nascentes Melo et al.
1.00
DNA, Mitochondrial Humans Melanoma Mutation Neoplasm Metastasis
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Prevalence and implications of fragile X premutation screening in Thailand.

Areerat Hnoonual, Sunita Kaewfai, Chanin Limwongse et al.
1.00
Humans Fragile X Mental Retardation Protein Thailand Male Female

Classifications MeSH