A Simple Standard for Sharing Ontological Mappings (SSSOM).


Journal

Database : the journal of biological databases and curation
ISSN: 1758-0463
Titre abrégé: Database (Oxford)
Pays: England
ID NLM: 101517697

Informations de publication

Date de publication:
25 05 2022
Historique:
received: 15 12 2021
revised: 08 03 2022
accepted: 11 05 2022
entrez: 26 5 2022
pubmed: 27 5 2022
medline: 28 5 2022
Statut: ppublish

Résumé

Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Or are they associated in some other way? Such relationships between the mapped terms are often not documented, which leads to incorrect assumptions and makes them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Furthermore, the lack of descriptions of how mappings were done makes it hard to combine and reconcile mappings, particularly curated and automated ones. We have developed the Simple Standard for Sharing Ontological Mappings (SSSOM) which addresses these problems by: (i) Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in mappings explicit. (ii) Defining an easy-to-use simple table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data principles. (iii) Implementing open and community-driven collaborative workflows that are designed to evolve the standard continuously to address changing requirements and mapping practices. (iv) Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases in detail and survey some of the existing work on standardizing the exchange of mappings, with the goal of making mappings Findable, Accessible, Interoperable and Reusable (FAIR). The SSSOM specification can be found at http://w3id.org/sssom/spec. Database URL: http://w3id.org/sssom/spec.

Identifiants

pubmed: 35616100
pii: 6591806
doi: 10.1093/database/baac035
pmc: PMC9216545
pii:
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : NIH HHS
ID : R24 OD011883
Pays : United States
Organisme : NHGRI NIH HHS
ID : RM1 HG010860
Pays : United States
Organisme : NHGRI NIH HHS
ID : U24 HG012212
Pays : United States

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press.

Références

Proc Int World Wide Web Conf. 2021 Apr;2021:2672-2683
pubmed: 34514472
J Biomed Inform. 2009 Jun;42(3):530-9
pubmed: 19475726
Nucleic Acids Res. 2019 Jan 8;47(D1):D801-D806
pubmed: 30407599
PeerJ. 2016 Aug 16;4:e2331
pubmed: 27602295
Nucleic Acids Res. 2019 Jan 8;47(D1):D1056-D1065
pubmed: 30462303
Nucleic Acids Res. 2020 Jan 8;48(D1):D845-D855
pubmed: 31680165
AMIA Annu Symp Proc. 2009 Nov 14;2009:198-202
pubmed: 20351849
Hum Mutat. 2012 May;33(5):803-8
pubmed: 22422702
Nucleic Acids Res. 2012 Jan;40(Database issue):D940-6
pubmed: 22080554
Nat Biotechnol. 2007 Nov;25(11):1251-5
pubmed: 17989687
PLoS Biol. 2017 Jun 29;15(6):e2001414
pubmed: 28662064
J Biomed Semantics. 2016 Sep 23;7(1):57
pubmed: 27664130
PLoS Comput Biol. 2009 Jul;5(7):e1000443
pubmed: 19649320
Nat Biotechnol. 2011 May;29(5):415-20
pubmed: 21552244
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70
pubmed: 14681409
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
J Biomed Semantics. 2013 Nov 22;4(1):37
pubmed: 24267948
F1000Res. 2020 Feb 24;9:136
pubmed: 32308977
Database (Oxford). 2021 Oct 26;2021:
pubmed: 34697637
BMC Bioinformatics. 2010 Jan 04;11:5
pubmed: 20047655
CDS Rev. 2014 Mar-Apr;107(2):52
pubmed: 24830113
Conserv Genet. 2018;19(4):995-1005
pubmed: 30100824
Nucleic Acids Res. 2019 Jan 8;47(D1):D1038-D1043
pubmed: 30445645

Auteurs

Nicolas Matentzoglu (N)

Semanticly Ltd, London WC2H 9JQ, UK.

James P Balhoff (JP)

RENCI, University of North Carolina, Chapel Hill, NC 27517, USA.

Susan M Bello (SM)

The Jackson Laboratory, Bar Harbor, ME 04609, USA.

Chris Bizon (C)

RENCI, University of North Carolina, Chapel Hill, NC 27517, USA.

Matthew Brush (M)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

Tiffany J Callahan (TJ)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

Christopher G Chute (CG)

Johns Hopkins University, Baltimore, MD 21210, USA.

William D Duncan (WD)

Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.

Chris T Evelo (CT)

Maastricht University, Maastricht 6211 LK, The Netherlands.

Davera Gabriel (D)

Johns Hopkins University, Baltimore, MD 21210, USA.

John Graybeal (J)

Stanford University, Stanford, CA 94305, USA.

Alasdair Gray (A)

Department of Computer Science, Heriot-Watt University, Edinburgh, Currie EH14 4AS, UK.

Benjamin M Gyori (BM)

Harvard Medical School, Boston, MA 02115, USA.

Melissa Haendel (M)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

Henriette Harmse (H)

European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK.

Nomi L Harris (NL)

Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.

Ian Harrow (I)

Pistoia Alliance Inc, USA.

Harshad B Hegde (HB)

Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.

Amelia L Hoyt (AL)

Beth Israel Deaconess Medical Center, Boston, MA 02215, USA.

Charles T Hoyt (CT)

Harvard Medical School, Boston, MA 02115, USA.

Dazhi Jiao (D)

Johns Hopkins University, Baltimore, MD 21210, USA.

Ernesto Jiménez-Ruiz (E)

City University of London, London EC1V 0HB, UK.
University of Oslo, Oslo 0315, Norway.

Simon Jupp (S)

SciBite Limited, Bio Data Innovation Centre, Wellcome Genome Campus, Hinxton, Saffron Walden CB10 1DR, UK.

Hyeongsik Kim (H)

Robert Bosch LLC, Sunnyvale, CA 94085, USA.

Sebastian Koehler (S)

Ada Health GmbH, Berlin 10178, Germany.

Thomas Liener (T)

Pistoia Alliance Inc, USA.

Qinqin Long (Q)

Leiden University Medical Center, Leiden 2333 ZA, The Netherlands.

James Malone (J)

BenchSci, 25 York St Suite 1100, Toronto, ON M5J 2V5, Canada.

James A McLaughlin (JA)

European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK.

Julie A McMurry (JA)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

Sierra Moxon (S)

Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.

Monica C Munoz-Torres (MC)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

David Osumi-Sutherland (D)

European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK.

James A Overton (JA)

Knocean Inc., Toronto, ON M6P 2T3, Canada.

Bjoern Peters (B)

La Jolla Institute for Immunology, 9420 Athena Circle, La Jolla, CA 92037, USA.

Tim Putman (T)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

Núria Queralt-Rosinach (N)

Leiden University Medical Center, Leiden 2333 ZA, The Netherlands.

Kent Shefchek (K)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

Harold Solbrig (H)

Johns Hopkins University, Baltimore, MD 21210, USA.

Anne Thessen (A)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

Tania Tudorache (T)

Independent Scholar.

Nicole Vasilevsky (N)

University of Colorado Anschutz Medical Campus, Aurora, CO 80217, USA.

Alex H Wagner (AH)

The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH 43205, USA.
The Ohio State University College of Medicine, Columbus, OH 43210, USA.

Christopher J Mungall (CJ)

Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.

Articles similaires

Humans Recurrence Male Female Middle Aged

Real world data on cervical cancer treatment patterns, healthcare access and resource utilization in the Brazilian public healthcare system.

Thabata Martins Ferreira Campuzano, Maria Amelia Carlos Souto Maior Borba, Paula de Mendonça Batista et al.
1.00
Humans Female Uterine Cervical Neoplasms Brazil Middle Aged
Humans Female Breast Neoplasms Retrospective Studies Middle Aged
International Classification of Diseases Humans Skin Diseases Algorithms Germany

Classifications MeSH