Genomic Surveillance of COVID-19 Variants With Language Models and Machine Learning.

SARS-CoV-2 genomic surveillance natural language preprocessing supervised predictions unsupervised modeling

Journal

Frontiers in genetics
ISSN: 1664-8021
Titre abrégé: Front Genet
Pays: Switzerland
ID NLM: 101560621

Informations de publication

Date de publication:
2022
Historique:
received: 19 01 2022
accepted: 14 03 2022
entrez: 25 4 2022
pubmed: 26 4 2022
medline: 26 4 2022
Statut: epublish

Résumé

The global efforts to control COVID-19 are threatened by the rapid emergence of novel SARS-CoV-2 variants that may display undesirable characteristics such as immune escape, increased transmissibility or pathogenicity. Early prediction for emergence of new strains with these features is critical for pandemic preparedness. We present

Identifiants

pubmed: 35464852
doi: 10.3389/fgene.2022.858252
pii: 858252
pmc: PMC9024110
doi:

Types de publication

Journal Article

Langues

eng

Pagination

858252

Informations de copyright

Copyright © 2022 Nagpal, Pal, Ashima, Tyagi, Tripathi, Nagori, Ahmad, Mishra, Malhotra, Kutum and Sethi.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Références

Comput Methods Programs Biomed. 2011 Dec;104(3):382-96
pubmed: 21208680
Sci Rep. 2021 Jan 12;11(1):844
pubmed: 33436981
Int J Environ Res Public Health. 2020 Mar 31;17(7):
pubmed: 32244425
Nat Methods. 2019 Mar;16(3):243-245
pubmed: 30742040
Bioinformatics. 2020 Feb 15;36(4):1234-1240
pubmed: 31501885
J Biosci. 2021;46:
pubmed: 33737495
Nat Med. 2021 Jan;27(1):94-105
pubmed: 33097835
Sci Transl Med. 2022 Feb 23;14(633):eabk3445
pubmed: 35014856
J Gen Virol. 2021 Apr;102(4):
pubmed: 33855951
J Biosci. 2021;46:
pubmed: 33709963
Healthcare (Basel). 2020 Jun 19;8(2):
pubmed: 32575622
PLoS One. 2020 Nov 5;15(11):e0241535
pubmed: 33152019
Science. 2021 Jan 15;371(6526):284-288
pubmed: 33446556
Chaos Solitons Fractals. 2020 Oct;139:110017
pubmed: 32572310
Nucleic Acids Res. 2021 Jul 2;49(W1):W293-W296
pubmed: 33885785
Bioinformatics. 2018 Dec 1;34(23):4121-4123
pubmed: 29790939
Euro Surveill. 2017 Mar 30;22(13):
pubmed: 28382917
Nat Med. 2021 Jul;27(7):1230-1238
pubmed: 34035535
Entropy (Basel). 2020 Aug 05;22(8):
pubmed: 33286634
J Med Internet Res. 2020 Oct 2;22(10):e22299
pubmed: 32931441
Cell. 2020 Sep 3;182(5):1284-1294.e9
pubmed: 32730807
Innovation (Camb). 2020 Aug 28;1(2):100033
pubmed: 32914143
PLoS One. 2021 Jan 20;16(1):e0245584
pubmed: 33471859

Auteurs

Sargun Nagpal (S)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.

Ridam Pal (R)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.
Indraprastha Institute of Information Technology Delhi, New Delhi, India.

Ananya Tyagi (A)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.

Sadhana Tripathi (S)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.

Aditya Nagori (A)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.

Saad Ahmad (S)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.

Hara Prasad Mishra (HP)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.

Rishabh Malhotra (R)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.

Rintu Kutum (R)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.
Ashoka University, Sonipat, India.

Tavpritesh Sethi (T)

Indraprastha Institute of Information Technology Delhi, New Delhi, India.
All India Institute of Medical Sciences, New Delhi, India.

Classifications MeSH