Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes.


Journal

Proceedings of the National Academy of Sciences of the United States of America
ISSN: 1091-6490
Titre abrégé: Proc Natl Acad Sci U S A
Pays: United States
ID NLM: 7505876

Informations de publication

Date de publication:
25 01 2022
Historique:
accepted: 13 12 2021
entrez: 13 1 2022
pubmed: 14 1 2022
medline: 27 1 2022
Statut: ppublish

Résumé

The emergence of new variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a major concern given their potential impact on the transmissibility and pathogenicity of the virus as well as the efficacy of therapeutic interventions. Here, we predict the mutability of all positions in SARS-CoV-2 protein domains to forecast the appearance of unseen variants. Using sequence data from other coronaviruses, preexisting to SARS-CoV-2, we build statistical models that not only capture amino acid conservation but also more complex patterns resulting from epistasis. We show that these models are notably superior to conservation profiles in estimating the already observable SARS-CoV-2 variability. In the receptor binding domain of the spike protein, we observe that the predicted mutability correlates well with experimental measures of protein stability and that both are reliable mutability predictors (receiver operating characteristic areas under the curve ∼0.8). Most interestingly, we observe an increasing agreement between our model and the observed variability as more data become available over time, proving the anticipatory capacity of our model. When combined with data concerning the immune response, our approach identifies positions where current variants of concern are highly overrepresented. These results could assist studies on viral evolution and future viral outbreaks and, in particular, guide the exploration and anticipation of potentially harmful future SARS-CoV-2 variants.

Identifiants

pubmed: 35022216
pii: 2113118119
doi: 10.1073/pnas.2113118119
pmc: PMC8795541
pii:
doi:

Substances chimiques

Epitopes 0
Spike Glycoprotein, Coronavirus 0
Viral Proteins 0
spike protein, SARS-CoV-2 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

Copyright © 2022 the Author(s). Published by PNAS.

Déclaration de conflit d'intérêts

The authors declare no competing interest.

Références

Bioinformatics. 2012 Jan 15;28(2):184-90
pubmed: 22101153
Sci Immunol. 2020 Jun 11;5(48):
pubmed: 32527802
Nat Biotechnol. 2017 Feb;35(2):128-135
pubmed: 28092658
Science. 2018 Feb 16;359(6377):765-770
pubmed: 29449486
Cell Rep. 2021 Oct 12;37(2):109814
pubmed: 34599871
Bioinformatics. 2018 Dec 1;34(23):4121-4123
pubmed: 29790939
Cell Rep Med. 2021 Jun 15;2(6):100312
pubmed: 34056627
Nucleic Acids Res. 2017 Jan 4;45(D1):D482-D490
pubmed: 27899678
Nature. 2020 May;581(7807):215-220
pubmed: 32225176
Proc Natl Acad Sci U S A. 2021 Feb 2;118(5):
pubmed: 33514660
Bioinformatics. 2015 Mar 15;31(6):926-32
pubmed: 25398609
Proc Natl Acad Sci U S A. 2011 Jul 12;108(28):11530-5
pubmed: 21690407
Infect Genet Evol. 2020 Sep;83:104351
pubmed: 32387564
Nat Commun. 2020 Nov 26;11(1):6013
pubmed: 33243994
Cell. 2021 Apr 29;184(9):2372-2383.e9
pubmed: 33743213
Nature. 2020 Sep;585(7824):174-177
pubmed: 32901123
Med Drug Discov. 2021 Jun;10:100086
pubmed: 33681755
Biochemistry. 1990 Sep 4;29(35):8033-41
pubmed: 2261461
Proc Natl Acad Sci U S A. 2011 Dec 6;108(49):E1293-301
pubmed: 22106262
Proc Natl Acad Sci U S A. 2018 Jan 23;115(4):E564-E573
pubmed: 29311326
Cell. 2021 Apr 29;184(9):2384-2393.e12
pubmed: 33794143
Proc Natl Acad Sci U S A. 2020 Dec 8;117(49):31519-31526
pubmed: 33203681
Proc Natl Acad Sci U S A. 2009 Jan 6;106(1):67-72
pubmed: 19116270
Nephron. 2021;145(4):392-403
pubmed: 33910211
Cell Host Microbe. 2021 Mar 10;29(3):463-476.e6
pubmed: 33592168
Nat Methods. 2018 Oct;15(10):816-822
pubmed: 30250057
Science. 2021 Apr 9;372(6538):
pubmed: 33658326
Immunity. 2013 Mar 21;38(3):606-17
pubmed: 23521886
Proc Natl Acad Sci U S A. 2020 Sep 22;117(38):23652-23662
pubmed: 32868447
Mol Biol Evol. 2016 Jan;33(1):268-80
pubmed: 26446903
Cell Rep Med. 2020 Jun 23;1(3):100036
pubmed: 32835302
Rep Prog Phys. 2018 Mar;81(3):032601
pubmed: 29120346
J Virol. 2013 Jun;87(12):7039-45
pubmed: 23596293
BMC Bioinformatics. 2011 Mar 17;12:77
pubmed: 21414208
Nat Rev Immunol. 2021 Jun;21(6):340-341
pubmed: 33927376
Nature. 2021 Aug;596(7871):276-280
pubmed: 34237773
Cell Mol Immunol. 2020 Jun;17(6):613-620
pubmed: 32203189
Nat Microbiol. 2021 Jul;6(7):821-823
pubmed: 34108654
Cell. 2020 Sep 3;182(5):1295-1310.e20
pubmed: 32841599
PLoS Comput Biol. 2011 Oct;7(10):e1002195
pubmed: 22039361
Biology (Basel). 2021 Jan 26;10(2):
pubmed: 33530355
Nucleic Acids Res. 2019 Jan 8;47(D1):D339-D343
pubmed: 30357391
Nat Protoc. 2020 Jul;15(7):2141-2142
pubmed: 32555466
Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419
pubmed: 33125078
FEBS Lett. 2021 May;595(10):1454-1461
pubmed: 33728680
Nat Commun. 2021 Oct 4;12(1):5800
pubmed: 34608136
Glob Chall. 2017 Jan 10;1(1):33-46
pubmed: 31565258
Nucleic Acids Res. 2015 Jan;43(Database issue):D571-7
pubmed: 25428358
Annu Rev Biochem. 1993;62:139-60
pubmed: 8352587
Nat Ecol Evol. 2017 Feb 21;1(3):77
pubmed: 28812721
Nat Microbiol. 2020 Nov;5(11):1403-1407
pubmed: 32669681
Science. 1966 Apr 15;152(3720):363-6
pubmed: 17775169

Auteurs

Juan Rodriguez-Rivas (J)

CNRS, Institut de Biologie Paris Seine, Laboratory of Computational and Quantitative Biology, Sorbonne Université, 75005 Paris, France.

Giancarlo Croce (G)

Department of Oncology, Ludwig Institute for Cancer Research Lausanne, University of Lausanne, 1011 Lausanne, Switzerland.
Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland.

Maureen Muscat (M)

CNRS, Institut de Biologie Paris Seine, Laboratory of Computational and Quantitative Biology, Sorbonne Université, 75005 Paris, France.

Martin Weigt (M)

CNRS, Institut de Biologie Paris Seine, Laboratory of Computational and Quantitative Biology, Sorbonne Université, 75005 Paris, France; martin.weigt@sorbonne-universite.fr.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH