The Data Use Ontology to streamline responsible access to human biomedical datasets.

FAIR GA4GH automated data access consent controlled access data access data restrictions ontology secondary data use standard

Journal

Cell genomics
ISSN: 2666-979X
Titre abrégé: Cell Genom
Pays: United States
ID NLM: 9918284260106676

Informations de publication

Date de publication:
10 Nov 2021
Historique:
received: 28 02 2021
revised: 02 07 2021
accepted: 09 08 2021
entrez: 25 11 2021
pubmed: 26 11 2021
medline: 26 11 2021
Statut: epublish

Résumé

Human biomedical datasets that are critical for research and clinical studies to benefit human health also often contain sensitive or potentially identifying information of individual participants. Thus, care must be taken when they are processed and made available to comply with ethical and regulatory frameworks and informed consent data conditions. To enable and streamline data access for these biomedical datasets, the Global Alliance for Genomics and Health (GA4GH) Data Use and Researcher Identities (DURI) work stream developed and approved the Data Use Ontology (DUO) standard. DUO is a hierarchical vocabulary of human and machine-readable data use terms that consistently and unambiguously represents a dataset's allowable data uses. DUO has been implemented by major international stakeholders such as the Broad and Sanger Institutes and is currently used in annotation of over 200,000 datasets worldwide. Using DUO in data management and access facilitates researchers' discovery and access of relevant datasets. DUO annotations increase the FAIRness of datasets and support data linkages using common data use profiles when integrating the data for secondary analyses. DUO is implemented in the Web Ontology Language (OWL) and, to increase community awareness and engagement, hosted in an open, centralized GitHub repository. DUO, together with the GA4GH Passport standard, offers a new, efficient, and streamlined data authorization and access framework that has enabled increased sharing of biomedical datasets worldwide.

Identifiants

pubmed: 34820659
doi: 10.1016/j.xgen.2021.100028
pii: S2666-979X(21)00035-5
pmc: PMC8591903
doi:

Types de publication

Journal Article

Langues

eng

Pagination

None

Subventions

Organisme : NHGRI NIH HHS
ID : U24 HG006941
Pays : United States
Organisme : NIH HHS
ID : R24 OD011883
Pays : United States
Organisme : NHGRI NIH HHS
ID : U24 HG010262
Pays : United States
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : NHGRI NIH HHS
ID : RM1 HG010860
Pays : United States

Informations de copyright

© 2021 The Author(s).

Déclaration de conflit d'intérêts

M.N.C. is an employee of Foundation Medicine and equity holder of Roche. A.A.P. is a venture partner at GV and an employee of alphabet corporation. He has received funding from MSFT, Verily, IBM, Intel, Bayer, and Novartis. The views expressed by L.L.R. are the author’s own and do not necessarily represent those of her organization.

Références

Cell Genom. 2021 Nov 10;1(2):
pubmed: 35072136
Nat Biotechnol. 2007 Nov;25(11):1251-5
pubmed: 17989687
Stud Health Technol Inform. 2004;102:20-38
pubmed: 15853262
Nat Genet. 2015 Jul;47(7):692-5
pubmed: 26111507
Nature. 2021 Feb;590(7845):198-201
pubmed: 33568833
Cell Genom. 2021 Nov 10;1(2):
pubmed: 35128509
Nat Genet. 2018 Apr;50(4):474-476
pubmed: 29632381
Cell Genom. 2021 Nov 10;1(2):None
pubmed: 34820660
PLoS Genet. 2016 Jan 21;12(1):e1005772
pubmed: 26796797
Nucleic Acids Res. 2014 Jan;42(Database issue):D975-9
pubmed: 24297256
NPJ Genom Med. 2018 Jul 23;3:17
pubmed: 30062047
Nucleic Acids Res. 2020 Jan 8;48(D1):D704-D715
pubmed: 31701156
J Law Biosci. 2020 Aug 19;7(1):lsaa065
pubmed: 33005429
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244

Auteurs

Jonathan Lawson (J)

Broad Institute of Harvard and the Massachusetts Institute of Technology, Cambridge, MA, USA.

Moran N Cabili (MN)

Broad Institute of Harvard and the Massachusetts Institute of Technology, Cambridge, MA, USA.

Giselle Kerry (G)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Hinxton, UK.

Tiffany Boughtwood (T)

Australian Genomics, Murdoch Children's Research Institute, Parkville, VIC, Australia.

Adrian Thorogood (A)

Centre of Genomics and Policy, Department of Human Genetics, McGill University, Montreal, QC, Canada.
ELIXIR-Luxembourg, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, Luxembourg.

Pinar Alper (P)

ELIXIR-Luxembourg, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, Luxembourg.

Sarion R Bowers (SR)

Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.

Rebecca R Boyles (RR)

RTI International, Research Triangle Park, NC, USA.

Anthony J Brookes (AJ)

University of Leicester, Leicester, UK.

Matthew Brush (M)

University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Tony Burdett (T)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Hinxton, UK.

Hayley Clissold (H)

Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.

Stacey Donnelly (S)

Broad Institute of Harvard and the Massachusetts Institute of Technology, Cambridge, MA, USA.

Stephanie O M Dyke (SOM)

McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Department of Neurology & Neurosurgery, Faculty of Medicine, McGill University, Montreal, QC, Canada.

Mallory A Freeberg (MA)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Hinxton, UK.

Melissa A Haendel (MA)

University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Chihiro Hata (C)

Bioinformation and DDBJ Center, National Institute of Genetics, Mishima, Japan.

Petr Holub (P)

BBMRI-ERIC, AT and Masaryk University, Brno, Czech Republic.

Francis Jeanson (F)

University Health Network, Toronto, ON, Canada.

Aina Jene (A)

Centre de Regulació Genòmica (CRG), Barcelona, Spain.

Minae Kawashima (M)

National Bioscience Database Center, Japan Science and Technology Agency, Tokyo, Japan.

Shuichi Kawashima (S)

Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Kashiwa, Japan.

Melissa Konopko (M)

ELIXIR Hub, Wellcome Genome Campus, Hinxton, UK.

Irene Kyomugisha (I)

Division of Human Genetics, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa.

Haoyuan Li (H)

Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, Canada.

Mikael Linden (M)

ELIXIR-Finland, CSC - IT Center for Science Ltd, Espoo, Finland.

Laura Lyman Rodriguez (LL)

Patient-Centered Outcomes Research Institute, Washington, DC, USA.

Mizuki Morita (M)

Okayama University, Okayama, Japan.

Nicola Mulder (N)

Computational Biology Division, IDM, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa.

Jean Muller (J)

Laboratoire de Génétique Médicale, Institut de Génétique Médicale d'Alsace, INSERM U1112, Université; de Strasbourg, Strasbourg, France.
Laboratoire de Diagnostic Génétique, Institut de Génétique Médicale d'Alsace, Hôpitaux Universitaires de Strasbourg, Strasbourg, France.

Satoshi Nagaie (S)

Tohoku Medical Megabank Organization (ToMMo), Tohoku University, Sendai, Japan.

Jamal Nasir (J)

Department of Life Sciences, University of Northampton, Northampton, UK.

Soichi Ogishima (S)

Tohoku Medical Megabank Organization (ToMMo), Tohoku University, Sendai, Japan.

Vivian Ota Wang (V)

Office of Data Sharing, National Cancer Institute, NIH, Rockville, MD, USA.

Laura D Paglione (LD)

Spherical Cow Group, Rego Park, NY 11374, USA.

Ravi N Pandya (RN)

Microsoft Research, Redmond, WA 98052, USA.

Helen Parkinson (H)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Hinxton, UK.

Anthony A Philippakis (AA)

Broad Institute of Harvard and the Massachusetts Institute of Technology, Cambridge, MA, USA.

Fabian Prasser (F)

Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany.

Jordi Rambla (J)

Centre de Regulació Genòmica (CRG), Barcelona, Spain.

Kathy Reinold (K)

Broad Institute of Harvard and the Massachusetts Institute of Technology, Cambridge, MA, USA.

Gregory A Rushton (GA)

Broad Institute of Harvard and the Massachusetts Institute of Technology, Cambridge, MA, USA.

Andrea Saltzman (A)

Broad Institute of Harvard and the Massachusetts Institute of Technology, Cambridge, MA, USA.

Gary Saunders (G)

ELIXIR Hub, Wellcome Genome Campus, Hinxton, UK.

Heidi J Sofia (HJ)

National Human Genome Research Institute, NIH, Bethesda, MD, USA.

John D Spalding (JD)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Hinxton, UK.

Morris A Swertz (MA)

Genomics Coordination Center, Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.

Ilia Tulchinsky (I)

Google Cloud, Kitchener, ON N2H 5G5, Canada.

Esther J van Enckevort (EJ)

Genomics Coordination Center, Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.

Susheel Varma (S)

Health Data Research UK, Gibbs Building, 215 Euston Road, London NW1 2BE, UK.

Craig Voisin (C)

Google Cloud, Kitchener, ON N2H 5G5, Canada.

Natsuko Yamamoto (N)

Osaka University, Osaka, Japan.

Chisato Yamasaki (C)

Osaka University, Osaka, Japan.

Lyndon Zass (L)

Computational Biology Division, IDM, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa.

Jaime M Guidry Auvil (JM)

Office of Data Sharing, National Cancer Institute, NIH, Rockville, MD, USA.

Tommi H Nyrönen (TH)

ELIXIR-Finland, CSC - IT Center for Science Ltd, Espoo, Finland.

Mélanie Courtot (M)

European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Hinxton, UK.

Classifications MeSH