Increased Frequency of Indels in Hypervariable Regions of SARS-CoV-2 Proteins-A Possible Signature of Adaptive Selection.
SARS-CoV-2
hypervariable regions (HVR)
indels
protein loop
variants of concern (VOCs)
Journal
Frontiers in genetics
ISSN: 1664-8021
Titre abrégé: Front Genet
Pays: Switzerland
ID NLM: 101560621
Informations de publication
Date de publication:
2022
2022
Historique:
received:
14
02
2022
accepted:
08
04
2022
entrez:
20
6
2022
pubmed:
21
6
2022
medline:
21
6
2022
Statut:
epublish
Résumé
Most attention in the surveillance of evolving SARS-CoV-2 genome has been centered on nucleotide substitutions in the spike glycoprotein. We show that, as the pandemic extends into its second year, the numbers and ratio of genomes with in-frame insertions and deletions (indels) increases significantly, especially among the variants of concern (VOCs). Monitoring of the SARS-CoV-2 genome evolution shows that co-occurrence (i.e., highly correlated presence) of indels, especially deletions on spike N-terminal domain and non-structural protein 6 (NSP6) is a shared feature in several VOCs such as Alpha, Beta, Delta, and Omicron. Indels distribution is correlated with spike mutations associated with immune escape and growth in the number of genomes with indels coincides with the increasing population resistance due to vaccination and previous infections. Indels occur most frequently in the spike, but also in other proteins, especially those involved in interactions with the host immune system. We also showed that indels concentrate in regions of individual SARS-CoV-2 proteins known as hypervariable regions (HVRs) that are mostly located in specific loop regions. Structural analysis suggests that indels remodel viral proteins' surfaces at common epitopes and interaction interfaces, affecting the virus' interactions with host proteins. We hypothesize that the increased frequency of indels, the non-random distribution of them and their independent co-occurrence in several VOCs is another mechanism of response to elevated global population immunity.
Identifiants
pubmed: 35719386
doi: 10.3389/fgene.2022.875406
pii: 875406
pmc: PMC9201826
doi:
Types de publication
Journal Article
Langues
eng
Pagination
875406Subventions
Organisme : NIGMS NIH HHS
ID : R35 GM118187
Pays : United States
Informations de copyright
Copyright © 2022 Alisoltani, Jaroszewski, Iyer, Iranzadeh and Godzik.
Déclaration de conflit d'intérêts
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Références
Science. 2021 Aug 6;373(6555):642-648
pubmed: 34168070
PLoS Genet. 2020 Nov 18;16(11):e1009175
pubmed: 33206635
Nucleic Acids Res. 2000 Jan 1;28(1):235-42
pubmed: 10592235
Science. 2021 Apr 9;372(6538):
pubmed: 33658326
Sci Bull (Beijing). 2021 Nov 30;66(22):2297-2311
pubmed: 33585048
J Mol Biol. 2001 Jan 19;305(3):567-80
pubmed: 11152613
PLoS Biol. 2018 Aug 13;16(8):e3000003
pubmed: 30102691
Lancet. 2021 Dec 11;398(10317):2126-2128
pubmed: 34871545
N Engl J Med. 2021 May 20;384(20):1885-1898
pubmed: 33725432
Bioinformatics. 2020 Aug 1;36(15):4360-4362
pubmed: 32470119
Virus Evol. 2019 Jul 21;5(2):vez022
pubmed: 31341641
Nature. 2020 Mar;579(7798):270-273
pubmed: 32015507
Infect Genet Evol. 2020 Sep;83:104351
pubmed: 32387564
PLoS Pathog. 2015 Feb 13;11(2):e1004664
pubmed: 25679389
Lancet Microbe. 2021 Jul;2(7):e283-e284
pubmed: 33846703
Microb Genom. 2019 Jan;5(1):
pubmed: 30663960
Nature. 2021 Apr;592(7854):438-443
pubmed: 33690265
Science. 2004 Mar 12;303(5664):1666-9
pubmed: 14752165
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W513-8
pubmed: 18515843
Nature. 2022 Mar;603(7902):679-686
pubmed: 35042229
Cell Host Microbe. 2021 Mar 10;29(3):489-502.e8
pubmed: 33548198
J Virol. 2007 Dec;81(24):13876-88
pubmed: 17928347
J Mol Med (Berl). 2021 Jan;99(1):93-106
pubmed: 33269412
Virus Res. 2021 Jan 2;291:198222
pubmed: 33166565
Lancet Public Health. 2021 May;6(5):e267-e268
pubmed: 33857454
Curr Protoc Bioinformatics. 2020 Mar;69(1):e96
pubmed: 32162851
Curr Opin Virol. 2019 Feb;34:90-96
pubmed: 30703578
Microorganisms. 2021 Jul 20;9(7):
pubmed: 34361977
Mol Biol Evol. 2014 Jan;31(1):121-34
pubmed: 24109604
Signal Transduct Target Ther. 2021 Apr 24;6(1):166
pubmed: 33895772
Bioinformatics. 2016 Sep 15;32(18):2847-9
pubmed: 27207943
Bioinformatics. 2018 Dec 1;34(23):4121-4123
pubmed: 29790939
PLoS Comput Biol. 2021 Jul 8;17(7):e1009147
pubmed: 34237054
Euro Surveill. 2017 Mar 30;22(13):
pubmed: 28382917
Nucleic Acids Res. 2017 Jul 3;45(W1):W24-W29
pubmed: 28472356
Virus Res. 2022 Mar;310:198674
pubmed: 35021068
Nature. 2021 Aug;596(7871):276-280
pubmed: 34237773
Cell. 2020 Sep 3;182(5):1295-1310.e20
pubmed: 32841599
Cell. 2021 Sep 30;184(20):5189-5200.e7
pubmed: 34537136
Signal Transduct Target Ther. 2021 Apr 24;6(1):164
pubmed: 33895775
Bioinformatics. 2009 May 1;25(9):1189-91
pubmed: 19151095
PLoS Pathog. 2009 May;5(5):e1000414
pubmed: 19424423
Genome Biol Evol. 2014 Mar;6(3):538-50
pubmed: 24558257
Emerg Microbes Infect. 2020 Dec;9(1):2685-2696
pubmed: 33205709
Curr Pharm Des. 2006;12(15):1811-25
pubmed: 16724949
Genome Med. 2021 Jun 14;13(1):101
pubmed: 34127050
Glob Chall. 2017 Jan 10;1(1):33-46
pubmed: 31565258
Nat Commun. 2020 Jul 30;11(1):3810
pubmed: 32733001
Cell Rep. 2022 Jan 11;38(2):110218
pubmed: 34968415
EMBO J. 2001 May 15;20(10):2587-95
pubmed: 11350948
Science. 2021 Mar 12;371(6534):1139-1142
pubmed: 33536258
Cell. 2021 Apr 29;184(9):2332-2347.e16
pubmed: 33761326
Cell Host Microbe. 2021 Apr 14;29(4):508-515
pubmed: 33789086
Biochem J. 2013 Feb 1;449(3):581-94
pubmed: 23301657