Host-virus chimeric events in SARS-CoV2 infected cells are infrequent and artifactual.


Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
17 Feb 2021
Historique:
entrez: 23 2 2021
pubmed: 24 2 2021
medline: 24 2 2021
Statut: epublish

Résumé

Pathogenic mechanisms underlying severe SARS-CoV2 infection remain largely unelucidated. High throughput sequencing technologies that capture genome and transcriptome information are key approaches to gain detailed mechanistic insights from infected cells. These techniques readily detect both pathogen and host-derived sequences, providing a means of studying host-pathogen interactions. Recent studies have reported the presence of host-virus chimeric (HVC) RNA in RNA-seq data from SARS-CoV2 infected cells and interpreted these findings as evidence of viral integration in the human genome as a potential pathogenic mechanism. Since SARS-CoV2 is a positive sense RNA virus that replicates in the cytoplasm it does not have a nuclear phase in its life cycle, it is biologically unlikely to be in a location where splicing events could result in genome integration. Here, we investigated the biological authenticity of HVC events. In contrast to true biological events such as mRNA splicing and genome rearrangement events, which generate reproducible chimeric sequencing fragments across different biological isolates, we found that HVC events across >100 RNA-seq libraries from patients with COVID-19 and infected cell lines, were highly irreproducible. RNA-seq library preparation is inherently error-prone due to random template switching during reverse transcription of RNA to cDNA. By counting chimeric events observed when constructing an RNA-seq library from human RNA and spike-in RNA from an unrelated species, such as fruit-fly, we estimated that ~1% of RNA-seq reads are artifactually chimeric. In SARS-CoV2 RNA-seq we found that the frequency of HVC events was, in fact, not greater than this background "noise". Finally, we developed a novel experimental approach to enrich SARS-CoV2 sequences from bulk RNA of infected cells. This method enriched viral sequences but did not enrich for HVC events, suggesting that the majority of HVC events are, in all likelihood, artifacts of library construction. In conclusion, our findings indicate that HVC events observed in RNA-sequencing libraries from SARS-CoV2 infected cells are extremely rare and are likely artifacts arising from either random template switching of reverse-transcriptase and/or sequence alignment errors. Therefore, the observed HVC events do not support SARS-CoV2 fusion to cellular genes and/or integration into human genomes.

Identifiants

pubmed: 33619483
doi: 10.1101/2021.02.17.431704
pmc: PMC7899447
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NIGMS NIH HHS
ID : R35 GM138283
Pays : United States
Organisme : Intramural NIH HHS
ID : ZIA AI001175
Pays : United States
Organisme : Intramural NIH HHS
ID : ZIA DK075149
Pays : United States

Commentaires et corrections

Type : UpdateIn

Références

Lancet Respir Med. 2020 Apr;8(4):420-422
pubmed: 32085846
Cell. 2020 May 14;181(4):914-921.e10
pubmed: 32330414
Nat Commun. 2016 Oct 05;7:12992
pubmed: 27703150
Genomics. 2019 Dec;111(6):1853-1861
pubmed: 30552977
Mol Syst Biol. 2015 Aug 07;11(8):826
pubmed: 26253570
Theranostics. 2020 Sep 1;10(24):10957-10972
pubmed: 33042264
J Virol. 2020 Nov 23;94(24):
pubmed: 32999023
PLoS Pathog. 2017 Apr 6;13(4):e1006211
pubmed: 28384274
Nat Genet. 2012 May 27;44(7):765-9
pubmed: 22634754
Proc Natl Acad Sci U S A. 2011 May 31;108(22):9172-7
pubmed: 21571633
Proc Natl Acad Sci U S A. 2019 Apr 23;116(17):8445-8450
pubmed: 30962378
Genomics. 2006 Jul;88(1):127-31
pubmed: 16457984
Cell. 2020 Jun 25;181(7):1502-1517.e23
pubmed: 32559462
J Virol. 2015 Sep;89(17):8967-73
pubmed: 26085148
Virology. 2017 Sep;509:167-177
pubmed: 28646652
Annu Rev Microbiol. 2019 Sep 8;73:529-557
pubmed: 31226023
Cancer Res. 2019 Dec 1;79(23):6010-6023
pubmed: 31481499

Auteurs

Bingyu Yan (B)

Department of Biochemistry, Purdue University, West Lafayette, IN, USA.

Srishti Chakravorty (S)

Department of Biochemistry, Purdue University, West Lafayette, IN, USA.

Carmen Mirabelli (C)

Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA.

Luopin Wang (L)

Department of Computer Science, Purdue University, West Lafayette, IN, USA.

Jorge L Trujillo-Ochoa (JL)

Immunoregulation Section, Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), NIH, Bethesda, MD, USA.

Daniel Chauss (D)

Immunoregulation Section, Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), NIH, Bethesda, MD, USA.

Dhaneshwar Kumar (D)

Department of Biochemistry, Purdue University, West Lafayette, IN, USA.
Immunoregulation Section, Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), NIH, Bethesda, MD, USA.

Michail S Lionakis (MS)

Fungal Pathogenesis Section, Laboratory of Clinical Immunology and Microbiology, National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, MD, USA.

Matthew R Olson (MR)

Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.

Christiane E Wobus (CE)

Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA.

Behdad Afzali (B)

Immunoregulation Section, Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), NIH, Bethesda, MD, USA.

Majid Kazemian (M)

Department of Biochemistry, Purdue University, West Lafayette, IN, USA.
Department of Computer Science, Purdue University, West Lafayette, IN, USA.

Classifications MeSH