COVIDPUBGRAPH: A FAIR Knowledge Graph of COVID-19 Publications.


Journal

Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192

Informations de publication

Date de publication:
08 07 2022
Historique:
received: 14 08 2021
accepted: 23 02 2022
entrez: 8 7 2022
pubmed: 9 7 2022
medline: 14 7 2022
Statut: epublish

Résumé

The rapid generation of large amounts of information about the coronavirus SARS-CoV-2 and the disease COVID-19 makes it increasingly difficult to gain a comprehensive overview of current insights related to the disease. With this work, we aim to support the rapid access to a comprehensive data source on COVID-19 targeted especially at researchers. Our knowledge graph, COVIDPUBGRAPH, an RDF knowledge graph of scientific publications, abides by the Linked Data and FAIR principles. The base dataset for the extraction is CORD-19, a dataset of COVID-19-related publications, which is updated regularly. Consequently, COVIDPUBGRAPH is updated biweekly. Our generation pipeline applies named entity recognition, entity linking and link discovery approaches to the original data. The current version of COVIDPUBGRAPH contains 268,108,670 triples and is linked to 9 other datasets by over 1 million links. In our use case studies, we demonstrate the usefulness of our knowledge graph for different applications. COVIDPUBGRAPH is publicly available under the Creative Commons Attribution 4.0 International license.

Identifiants

pubmed: 35803947
doi: 10.1038/s41597-022-01298-2
pii: 10.1038/s41597-022-01298-2
pmc: PMC9263802
doi:

Types de publication

Dataset Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

389

Informations de copyright

© 2022. The Author(s).

Références

Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Cell Discov. 2020 Mar 16;6:14
pubmed: 32194980
Lancet Infect Dis. 2020 May;20(5):533-534
pubmed: 32087114
Scientometrics. 2021;126(4):3683-3692
pubmed: 33612883
Nucleic Acids Res. 2018 Jan 4;46(D1):D1074-D1082
pubmed: 29126136
Nucleic Acids Res. 2019 Jan 8;47(D1):D590-D595
pubmed: 30321428
Nucleic Acids Res. 2016 Jan 4;44(D1):D1075-9
pubmed: 26481350
Sci Data. 2022 Jul 8;9(1):389
pubmed: 35803947

Auteurs

Svetlana Pestryakova (S)

DICE Research Group, Department of Computer Science, Paderborn University, Paderborn, Germany. pestryak@mail.uni-paderborn.de.

Daniel Vollmers (D)

DICE Research Group, Department of Computer Science, Paderborn University, Paderborn, Germany.

Mohamed Ahmed Sherif (MA)

DICE Research Group, Department of Computer Science, Paderborn University, Paderborn, Germany. mohamed.sherif@upb.de.

Stefan Heindorf (S)

DICE Research Group, Department of Computer Science, Paderborn University, Paderborn, Germany.

Muhammad Saleem (M)

DICE Research Group, Department of Computer Science, Paderborn University, Paderborn, Germany.

Diego Moussallem (D)

DICE Research Group, Department of Computer Science, Paderborn University, Paderborn, Germany.

Axel-Cyrille Ngonga Ngomo (AN)

DICE Research Group, Department of Computer Science, Paderborn University, Paderborn, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH