OBO Foundry in 2021: operationalizing open data principles to evaluate ontologies.


Journal

Database : the journal of biological databases and curation
ISSN: 1758-0463
Titre abrégé: Database (Oxford)
Pays: England
ID NLM: 101517697

Informations de publication

Date de publication:
26 10 2021
Historique:
received: 18 05 2021
revised: 05 10 2021
accepted: 13 10 2021
entrez: 26 10 2021
pubmed: 27 10 2021
medline: 28 1 2022
Statut: ppublish

Résumé

Biological ontologies are used to organize, curate and interpret the vast quantities of data arising from biological experiments. While this works well when using a single ontology, integrating multiple ontologies can be problematic, as they are developed independently, which can lead to incompatibilities. The Open Biological and Biomedical Ontologies (OBO) Foundry was created to address this by facilitating the development, harmonization, application and sharing of ontologies, guided by a set of overarching principles. One challenge in reaching these goals was that the OBO principles were not originally encoded in a precise fashion, and interpretation was subjective. Here, we show how we have addressed this by formally encoding the OBO principles as operational rules and implementing a suite of automated validation checks and a dashboard for objectively evaluating each ontology's compliance with each principle. This entailed a substantial effort to curate metadata across all ontologies and to coordinate with individual stakeholders. We have applied these checks across the full OBO suite of ontologies, revealing areas where individual ontologies require changes to conform to our principles. Our work demonstrates how a sizable, federated community can be organized and evaluated on objective criteria that help improve overall quality and interoperability, which is vital for the sustenance of the OBO project and towards the overall goals of making data Findable, Accessible, Interoperable, and Reusable (FAIR). Database URL http://obofoundry.org/.

Identifiants

pubmed: 34697637
pii: 6410158
doi: 10.1093/database/baab069
pmc: PMC8546234
pii:
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : NIH HHS
ID : R24 OD011883
Pays : United States
Organisme : NHGRI NIH HHS
ID : RM1 HG010860
Pays : United States
Organisme : NHGRI NIH HHS
ID : U41 HG008735
Pays : United States
Organisme : NHGRI NIH HHS
ID : R24 HG010032
Pays : United States

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press.

Références

Cold Spring Harb Symp Quant Biol. 2003;68:227-35
pubmed: 15338622
Nat Biotechnol. 2007 Nov;25(11):1251-5
pubmed: 17989687
J Biomed Inform. 2006 Jun;39(3):314-20
pubmed: 16564748
J Biomed Semantics. 2018 Jan 18;9(1):6
pubmed: 29347969
BMC Bioinformatics. 2019 Jul 29;20(1):407
pubmed: 31357927
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W541-5
pubmed: 21672956
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244

Auteurs

Rebecca Jackson (R)

Bend Informatics LLC, 20770 Double Peaks Drive, Bend, OR 97701, USA.

Nicolas Matentzoglu (N)

Semanticly, 71-75 Shelton Street, London WC2H 9JQ, UK.

James A Overton (JA)

Knocean Inc., 2-107 Quebec Ave., Toronto, ON M6P 2T3, Canada.

Randi Vita (R)

La Jolla Institute for Immunology, 9420 Athena Cir, La Jolla, CA 92037, USA.

James P Balhoff (JP)

Renaissance Computing Institute, University of North Carolina, 100 Europa Drive, Suite 540, Chapel Hill, NC 27517, USA.

Pier Luigi Buttigieg (PL)

Alfred Wegener Institute, Helmholtz Center for Polar and Marine Research, Am Handelshafen 12, Bremerhaven 27570, Germany.

Seth Carbon (S)

Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd., Berkeley, CA 94720, USA.

Melanie Courtot (M)

European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK.

Alexander D Diehl (AD)

Department of Biomedical Informatics, University at Buffalo, 77 Goodell St, Buffalo, NY 14203, USA.

Damion M Dooley (DM)

Centre for Infectious Disease Genomics and One Health, Simon Fraser University, 8888 University Dr, Burnaby, BC V5A 1S6, Canada.

William D Duncan (WD)

Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd., Berkeley, CA 94720, USA.

Nomi L Harris (NL)

Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd., Berkeley, CA 94720, USA.

Melissa A Haendel (MA)

Biochemistry and Molecular Genetics Department, University of Colorado School of Medicine, PO Box 6511, Aurora, CO 80045, USA.

Suzanna E Lewis (SE)

Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd., Berkeley, CA 94720, USA.

Darren A Natale (DA)

Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, 2115 Wisconsin Avenue NW, Washington, DC 20007, USA.

David Osumi-Sutherland (D)

European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK.

Alan Ruttenberg (A)

Department of Biomedical Informatics, University at Buffalo, 77 Goodell St, Buffalo, NY 14203, USA.

Lynn M Schriml (LM)

School of Medicine, University of Maryland, 655 W Baltimore St S, Baltimore, MD 21201, USA.

Barry Smith (B)

Department of Biomedical Informatics, University at Buffalo, 77 Goodell St, Buffalo, NY 14203, USA.

Christian J Stoeckert (CJ)

Department of Genetics and Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, 3400 Civic Center Blvd, Philadelphia, PA 19104, USA.

Nicole A Vasilevsky (NA)

Biochemistry and Molecular Genetics Department, University of Colorado School of Medicine, PO Box 6511, Aurora, CO 80045, USA.

Ramona L Walls (RL)

Critical Path Institute, 1730 E River Rd #200, Tucson, AZ 85718, USA.

Jie Zheng (J)

Department of Genetics and Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, 3400 Civic Center Blvd, Philadelphia, PA 19104, USA.

Christopher J Mungall (CJ)

Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd., Berkeley, CA 94720, USA.

Bjoern Peters (B)

La Jolla Institute for Immunology, 9420 Athena Cir, La Jolla, CA 92037, USA.

Articles similaires

Humans Recurrence Male Female Middle Aged

Real world data on cervical cancer treatment patterns, healthcare access and resource utilization in the Brazilian public healthcare system.

Thabata Martins Ferreira Campuzano, Maria Amelia Carlos Souto Maior Borba, Paula de Mendonça Batista et al.
1.00
Humans Female Uterine Cervical Neoplasms Brazil Middle Aged
Humans Female Breast Neoplasms Retrospective Studies Middle Aged
International Classification of Diseases Humans Skin Diseases Algorithms Germany

Classifications MeSH