A Simple Standard for Sharing Ontological Mappings (SSSOM).
Journal
Database : the journal of biological databases and curation
ISSN: 1758-0463
Titre abrégé: Database (Oxford)
Pays: England
ID NLM: 101517697
Informations de publication
Date de publication:
25 05 2022
25 05 2022
Historique:
received:
15
12
2021
revised:
08
03
2022
accepted:
11
05
2022
entrez:
26
5
2022
pubmed:
27
5
2022
medline:
28
5
2022
Statut:
ppublish
Résumé
Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Or are they associated in some other way? Such relationships between the mapped terms are often not documented, which leads to incorrect assumptions and makes them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Furthermore, the lack of descriptions of how mappings were done makes it hard to combine and reconcile mappings, particularly curated and automated ones. We have developed the Simple Standard for Sharing Ontological Mappings (SSSOM) which addresses these problems by: (i) Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in mappings explicit. (ii) Defining an easy-to-use simple table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data principles. (iii) Implementing open and community-driven collaborative workflows that are designed to evolve the standard continuously to address changing requirements and mapping practices. (iv) Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases in detail and survey some of the existing work on standardizing the exchange of mappings, with the goal of making mappings Findable, Accessible, Interoperable and Reusable (FAIR). The SSSOM specification can be found at http://w3id.org/sssom/spec. Database URL: http://w3id.org/sssom/spec.
Identifiants
pubmed: 35616100
pii: 6591806
doi: 10.1093/database/baac035
pmc: PMC9216545
pii:
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : NIH HHS
ID : R24 OD011883
Pays : United States
Organisme : NHGRI NIH HHS
ID : RM1 HG010860
Pays : United States
Organisme : NHGRI NIH HHS
ID : U24 HG012212
Pays : United States
Informations de copyright
© The Author(s) 2022. Published by Oxford University Press.
Références
Proc Int World Wide Web Conf. 2021 Apr;2021:2672-2683
pubmed: 34514472
J Biomed Inform. 2009 Jun;42(3):530-9
pubmed: 19475726
Nucleic Acids Res. 2019 Jan 8;47(D1):D801-D806
pubmed: 30407599
PeerJ. 2016 Aug 16;4:e2331
pubmed: 27602295
Nucleic Acids Res. 2019 Jan 8;47(D1):D1056-D1065
pubmed: 30462303
Nucleic Acids Res. 2020 Jan 8;48(D1):D845-D855
pubmed: 31680165
AMIA Annu Symp Proc. 2009 Nov 14;2009:198-202
pubmed: 20351849
Hum Mutat. 2012 May;33(5):803-8
pubmed: 22422702
Nucleic Acids Res. 2012 Jan;40(Database issue):D940-6
pubmed: 22080554
Nat Biotechnol. 2007 Nov;25(11):1251-5
pubmed: 17989687
PLoS Biol. 2017 Jun 29;15(6):e2001414
pubmed: 28662064
J Biomed Semantics. 2016 Sep 23;7(1):57
pubmed: 27664130
PLoS Comput Biol. 2009 Jul;5(7):e1000443
pubmed: 19649320
Nat Biotechnol. 2011 May;29(5):415-20
pubmed: 21552244
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70
pubmed: 14681409
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
J Biomed Semantics. 2013 Nov 22;4(1):37
pubmed: 24267948
F1000Res. 2020 Feb 24;9:136
pubmed: 32308977
Database (Oxford). 2021 Oct 26;2021:
pubmed: 34697637
BMC Bioinformatics. 2010 Jan 04;11:5
pubmed: 20047655
CDS Rev. 2014 Mar-Apr;107(2):52
pubmed: 24830113
Conserv Genet. 2018;19(4):995-1005
pubmed: 30100824
Nucleic Acids Res. 2019 Jan 8;47(D1):D1038-D1043
pubmed: 30445645