KG-COVID-19: a framework to produce customized knowledge graphs for COVID-19 response.


Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
18 Aug 2020
Historique:
entrez: 26 8 2020
pubmed: 26 8 2020
medline: 26 8 2020
Statut: epublish

Résumé

Integrated, up-to-date data about SARS-CoV-2 and coronavirus disease 2019 (COVID-19) is crucial for the ongoing response to the COVID-19 pandemic by the biomedical research community. While rich biological knowledge exists for SARS-CoV-2 and related viruses (SARS-CoV, MERS-CoV), integrating this knowledge is difficult and time consuming, since much of it is in siloed databases or in textual format. Furthermore, the data required by the research community varies drastically for different tasks - the optimal data for a machine learning task, for example, is much different from the data used to populate a browsable user interface for clinicians. To address these challenges, we created KG-COVID-19, a flexible framework that ingests and integrates biomedical data to produce knowledge graphs (KGs) for COVID-19 response. This KG framework can also be applied to other problems in which siloed biomedical data must be quickly integrated for different research applications, including future pandemics. An effective response to the COVID-19 pandemic relies on integration of many different types of data available about SARS-CoV-2 and related viruses. KG-COVID-19 is a framework for producing knowledge graphs that can be customized for downstream applications including machine learning tasks, hypothesis-based querying, and browsable user interface to enable researchers to explore COVID-19 data and discover relationships.

Identifiants

pubmed: 32839776
doi: 10.1101/2020.08.17.254839
pmc: PMC7444288
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NIH HHS
ID : R24 OD011883
Pays : United States
Organisme : NCI NIH HHS
ID : U01 CA239108
Pays : United States
Organisme : NCATS NIH HHS
ID : U24 TR002306
Pays : United States

Commentaires et corrections

Type : UpdateIn

Références

Bioinformatics. 2021 Jun 9;37(9):1332-1334
pubmed: 32976572
PLoS Biol. 2017 Jun 29;15(6):e2001414
pubmed: 28662064
N Engl J Med. 2020 Oct 29;383(18):1757-1766
pubmed: 32329974
Nucleic Acids Res. 2012 Jan;40(Database issue):D1100-7
pubmed: 21948594
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613
pubmed: 30476243
Nat Rev Microbiol. 2016 Aug;14(8):523-34
pubmed: 27344959
KDD. 2016 Aug;2016:855-864
pubmed: 27853626
Methods Mol Biol. 2013;1015:311-20
pubmed: 23824865
Nucleic Acids Res. 2017 Jan 4;45(D1):D932-D939
pubmed: 27789690
Int J Cardiol Heart Vasc. 2020 Jul 14;29:100583
pubmed: 32715080
Signal Transduct Target Ther. 2021 Apr 24;6(1):165
pubmed: 33895786
Am J Hum Genet. 2008 Nov;83(5):610-5
pubmed: 18950739
Plant Biotechnol J. 2021 Aug;19(8):1670-1678
pubmed: 33750020
N Engl J Med. 2020 Sep 3;383(10):994
pubmed: 32649078
Nucleic Acids Res. 2014 Jan;42(Database issue):D358-63
pubmed: 24234451
Nucleic Acids Res. 2002 Jan 1;30(1):412-5
pubmed: 11752352
Nat Genet. 2019 Oct;51(10):1429-1433
pubmed: 31548717
Nucleic Acids Res. 2019 Jan 8;47(D1):D963-D970
pubmed: 30371892
N Engl J Med. 2020 Dec 17;383(25):2451-2460
pubmed: 32412710
Med Drug Discov. 2021 Jun;10:100090
pubmed: 33817623
Nucleic Acids Res. 2019 Jan 8;47(D1):D330-D338
pubmed: 30395331
Genome Biol. 2005;6(5):R46
pubmed: 15892874
Nucleic Acids Res. 2017 Jan 4;45(D1):D712-D722
pubmed: 27899636

Auteurs

Classifications MeSH