Indels in SARS-CoV-2 occur at template-switching hotspots.
Genomics
RNA virus
Recombination
SARS-CoV-2
Journal
BioData mining
ISSN: 1756-0381
Titre abrégé: BioData Min
Pays: England
ID NLM: 101319161
Informations de publication
Date de publication:
20 Mar 2021
20 Mar 2021
Historique:
received:
16
11
2020
accepted:
23
02
2021
entrez:
21
3
2021
pubmed:
22
3
2021
medline:
22
3
2021
Statut:
epublish
Résumé
The evolutionary dynamics of SARS-CoV-2 have been carefully monitored since the COVID-19 pandemic began in December 2019. However, analysis has focused primarily on single nucleotide polymorphisms and largely ignored the role of insertions and deletions (indels) as well as recombination in SARS-CoV-2 evolution. Using sequences from the GISAID database, we catalogue over 100 insertions and deletions in the SARS-CoV-2 consensus sequences. We hypothesize that these indels are artifacts of recombination events between SARS-CoV-2 replicates whereby RNA-dependent RNA polymerase (RdRp) re-associates with a homologous template at a different loci ("imperfect homologous recombination"). We provide several independent pieces of evidence that suggest this. (1) The indels from the GISAID consensus sequences are clustered at specific regions of the genome. (2) These regions are also enriched for 5' and 3' breakpoints in the transcription regulatory site (TRS) independent transcriptome, presumably sites of RNA-dependent RNA polymerase (RdRp) template-switching. (3) Within raw reads, these indel hotspots have cases of both high intra-host heterogeneity and intra-host homogeneity, suggesting that these indels are both consequences of de novo recombination events within a host and artifacts of previous recombination. We briefly analyze the indels in the context of RNA secondary structure, noting that indels preferentially occur in "arms" and loop structures of the predicted folded RNA, suggesting that secondary structure may be a mechanism for TRS-independent template-switching in SARS-CoV-2 or other coronaviruses. These insights into the relationship between structural variation and recombination in SARS-CoV-2 can improve our reconstructions of the SARS-CoV-2 evolutionary history as well as our understanding of the process of RdRp template-switching in RNA viruses.
Identifiants
pubmed: 33743803
doi: 10.1186/s13040-021-00251-0
pii: 10.1186/s13040-021-00251-0
pmc: PMC7980745
doi:
Types de publication
Journal Article
Langues
eng
Pagination
20Subventions
Organisme : National Science Foundation
ID : GRFP
Références
Clin Infect Dis. 2020 Jul 28;71(15):884-887
pubmed: 32130405
Nat Med. 2020 Apr;26(4):450-452
pubmed: 32284615
Nucleic Acids Res. 1995 Jun 11;23(11):1870-5
pubmed: 7596811
Biomol Detect Quantif. 2015 Mar;3:1-8
pubmed: 26753127
Bioinformatics. 2018 Sep 1;34(17):i884-i890
pubmed: 30423086
J Virol. 1997 Aug;71(8):6183-90
pubmed: 9223514
PLoS One. 2011;6(9):e24907
pubmed: 21966381
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W368-72
pubmed: 24771339
Curr Biol. 2020 Apr 6;30(7):1346-1351.e2
pubmed: 32197085
RNA. 2020 Aug;26(8):937-959
pubmed: 32398273
PeerJ. 2020 May 7;8:e9089
pubmed: 32419987
J Virol. 2015 Oct;89(20):10532-47
pubmed: 26269185
Annu Rev Virol. 2015 Nov;2(1):265-88
pubmed: 26958916
BMC Genomics. 2008 Nov 02;9:520
pubmed: 18976501
Nucleic Acids Res. 2000 Apr 15;28(8):1714-23
pubmed: 10734190
Nature. 2020 Mar;579(7798):270-273
pubmed: 32015507
Methods Mol Biol. 2009;537:39-64
pubmed: 19378139
J Virol. 2004 Jan;78(2):980-94
pubmed: 14694129
BMC Evol Biol. 2007 Mar 14;7:40
pubmed: 17359539
Algorithms Mol Biol. 2011 Nov 24;6:26
pubmed: 22115189
Virology. 1991 Nov;185(1):441-5
pubmed: 1656597
J Bioinform Comput Biol. 2018 Dec;16(6):1840025
pubmed: 30616476
Mol Ecol. 2019 Mar;28(6):1203-1209
pubmed: 30834648
Cell. 2020 May 14;181(4):914-921.e10
pubmed: 32330414
Nat Commun. 2019 Nov 27;10(1):5407
pubmed: 31776342
J Virol. 1981 May;38(2):661-70
pubmed: 6165837
PLoS Genet. 2020 Nov 18;16(11):e1009175
pubmed: 33206635
J Virol. 2003 Nov;77(22):12033-47
pubmed: 14581540
Nat Rev Microbiol. 2011 Jul 04;9(8):617-26
pubmed: 21725337
Genome Biol. 2019 Mar 14;20(1):50
pubmed: 30867008
Nucleic Acids Res. 2018 Jul 2;46(W1):W30-W35
pubmed: 29718468
Science. 2016 Jan 1;351(6268):81-4
pubmed: 26678874
Nature. 2020 Jul;583(7815):282-285
pubmed: 32218527
Euro Surveill. 2017 Mar 30;22(13):
pubmed: 28382917
Nat Microbiol. 2020 Nov;5(11):1408-1417
pubmed: 32724171
Natl Sci Rev. 2020 Jun;7(6):1012-1023
pubmed: 34676127
PLoS One. 2014 Jan 08;9(1):e85370
pubmed: 24416396
PLoS Pathog. 2016 Sep 15;12(9):e1005894
pubmed: 27631086
Nucleic Acids Res. 2012 Dec;40(22):11189-201
pubmed: 23066108
J Virol. 2007 Jan;81(1):20-9
pubmed: 16928755
Nucleic Acids Res. 2018 Jun 20;46(11):5381-5394
pubmed: 29746666
Bioinformatics. 2009 Aug 1;25(15):1974-5
pubmed: 19398448
Curr Top Microbiol Immunol. 2005;287:31-55
pubmed: 15609508
Nucleic Acids Res. 1988 Dec 23;16(24):11705-23
pubmed: 2463516
Mol Biol Evol. 2012 Feb;29(2):457-72
pubmed: 21873298
J Virol. 2008 Feb;82(4):1819-26
pubmed: 18057240
BMC Genomics. 2015 Mar 24;16:229
pubmed: 25886445
Oncotarget. 2017 Jun 1;8(37):60841-60859
pubmed: 28977829