A novel statistical method predicts mutability of the genomic segments of the SARS-CoV-2 virus.

Mutability SARS-CoV2 Statistical Analysis Word Ranking

Journal

QRB discovery
ISSN: 2633-2892
Titre abrégé: QRB Discov
Pays: England
ID NLM: 101772102

Informations de publication

Date de publication:
2022
Historique:
received: 26 11 2020
revised: 28 05 2021
accepted: 26 11 2021
entrez: 2 2 2022
pubmed: 3 2 2022
medline: 3 2 2022
Statut: epublish

Résumé

The SARS-CoV-2 virus has made the largest pandemic of the 21st century, with hundreds of millions of cases and tens of millions of fatalities. Scientists all around the world are racing to develop vaccines and new pharmaceuticals to overcome the pandemic and offer effective treatments for COVID-19 disease. Consequently, there is an essential need to better understand how the pathogenesis of SARS-CoV-2 is affected by viral mutations and to determine the conserved segments in the viral genome that can serve as stable targets for novel therapeutics. Here, we introduce a text-mining method to estimate the mutability of genomic segments directly from a reference (ancestral) whole genome sequence. The method relies on calculating the importance of genomic segments based on their spatial distribution and frequency over the whole genome. To validate our approach, we perform a large-scale analysis of the viral mutations in nearly 80,000 publicly available SARS-CoV-2 predecessor whole genome sequences and show that these results are highly correlated with the segments predicted by the statistical method used for keyword detection. Importantly, these correlations are found to hold at the codon and gene levels, as well as for gene coding regions. Using the text-mining method, we further identify codon sequences that are potential candidates for siRNA-based antiviral drugs. Significantly, one of the candidates identified in this work corresponds to the first seven codons of an epitope of the spike glycoprotein, which is the only SARS-CoV-2 immunogenic peptide without a match to a human protein.

Identifiants

pubmed: 35106478
doi: 10.1017/qrd.2021.13
pii: S2633289221000132
pmc: PMC8795775
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e1

Informations de copyright

© The Author(s) 2021.

Références

Front Immunol. 2020 Jul 03;11:1581
pubmed: 32719684
Biotechnol J. 2011 Jun;6(6):650-9
pubmed: 21567958
Front Microbiol. 2020 Jul 22;11:1800
pubmed: 32793182
J Transl Autoimmun. 2020 Apr 09;3:100051
pubmed: 32292901
Nat Med. 2020 Apr;26(4):450-452
pubmed: 32284615
PLoS Comput Biol. 2017 May 5;13(5):e1005531
pubmed: 28475588
Nucleic Acids Res. 1999 Oct 1;27(19):3899-910
pubmed: 10481030
Biochemistry. 2020 Jul 21;59(28):2608-2615
pubmed: 32578982
Mol Biol Evol. 2007 Aug;24(8):1600-3
pubmed: 17525472
Cell Host Microbe. 2020 Mar 11;27(3):325-328
pubmed: 32035028
J Clin Med. 2020 Feb 26;9(3):
pubmed: 32110875
PLoS Biol. 2018 Aug 13;16(8):e3000003
pubmed: 30102691
J Comput Biol. 2000 Feb-Apr;7(1-2):203-14
pubmed: 10890397
Genetics. 1998 Apr;148(4):1667-86
pubmed: 9560386
QRB Discov. 2020 Jun 02;1:e6
pubmed: 34192262
Protein J. 2020 Jun;39(3):198-216
pubmed: 32447571
PLoS One. 2015 Jun 19;10(6):e0130617
pubmed: 26091207
Biochim Biophys Acta Mol Basis Dis. 2020 Oct 1;1866(10):165878
pubmed: 32544429
Immunity. 2020 Jun 16;52(6):910-941
pubmed: 32505227
Nat Commun. 2016 Apr 21;7:11334
pubmed: 27098217
Viruses. 2020 Mar 25;12(4):
pubmed: 32218151
J Med Virol. 2020 Jun;92(6):584-588
pubmed: 32083328
Biomolecules. 2022 Aug 21;12(8):
pubmed: 36009050
Lancet Infect Dis. 2020 Sep;20(9):1018-1019
pubmed: 32860762
Proc Natl Acad Sci U S A. 2016 Oct 11;113(41):E6117-E6125
pubmed: 27671647
Cell Host Microbe. 2012 Nov 15;12(5):623-32
pubmed: 23159052
BMC Bioinformatics. 2013;14 Suppl 11:S2
pubmed: 24564200
Wiley Interdiscip Rev RNA. 2016 Nov;7(6):726-743
pubmed: 27307213
J Virol. 2018 Jun 29;92(14):
pubmed: 29720522
PLoS Pathog. 2008 Jun 06;4(6):e1000079
pubmed: 18535658
Antiviral Res. 2018 Jan;149:58-74
pubmed: 29128390

Auteurs

Amir Hossein Darooneh (AH)

Department of Applied Mathematics, University of Waterloo, Waterloo, ON, Canada.

Michelle Przedborski (M)

Department of Applied Mathematics, University of Waterloo, Waterloo, ON, Canada.

Mohammad Kohandel (M)

Department of Applied Mathematics, University of Waterloo, Waterloo, ON, Canada.

Classifications MeSH