Expanding the Galaxy's reference data.


Journal

Bioinformatics advances
ISSN: 2635-0041
Titre abrégé: Bioinform Adv
Pays: England
ID NLM: 9918282081306676

Informations de publication

Date de publication:
2022
Historique:
received: 12 11 2021
revised: 01 04 2022
accepted: 26 04 2022
entrez: 7 6 2022
pubmed: 8 6 2022
medline: 8 6 2022
Statut: epublish

Résumé

Properly and effectively managing reference datasets is an important task for many bioinformatics analyses. Refgenie is a reference asset management system that allows users to easily organize, retrieve and share such datasets. Here, we describe the integration of refgenie into the Galaxy platform. Server administrators are able to configure Galaxy to make use of reference datasets made available on a refgenie instance. In addition, a Galaxy Data Manager tool has been developed to provide a graphical interface to refgenie's remote reference retrieval functionality. A large collection of reference datasets has also been made available using the CVMFS (CernVM File System) repository from GalaxyProject.org, with mirrors across the USA, Canada, Europe and Australia, enabling easy use outside of Galaxy. The ability of Galaxy to use refgenie assets was added to the core Galaxy framework in version 22.01, which is available from https://github.com/galaxyproject/galaxy under the Academic Free License version 3.0. The refgenie Data Manager tool can be installed via the Galaxy ToolShed, with source code managed at https://github.com/BlankenbergLab/galaxy-tools-blankenberg/tree/main/data_managers/data_manager_refgenie_pull and released using an MIT license. Access to existing data is also available through CVMFS, with instructions at https://galaxyproject.org/admin/reference-data-repo/. No new data were generated or analyzed in support of this research.

Identifiants

pubmed: 35669346
doi: 10.1093/bioadv/vbac030
pii: vbac030
pmc: PMC9155181
doi:

Types de publication

Journal Article

Langues

eng

Pagination

vbac030

Subventions

Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/T/000PR9817
Pays : United Kingdom
Organisme : NCI NIH HHS
ID : U24 CA231877
Pays : United States
Organisme : NHGRI NIH HHS
ID : U24 HG006620
Pays : United States

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press.

Références

Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
Genome Res. 2005 Oct;15(10):1451-5
pubmed: 16169926
Bioinformatics. 2014 Jul 1;30(13):1917-9
pubmed: 24585771
Genome Biol. 2014 Mar 03;15(3):R46
pubmed: 24580807
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Bioinformatics. 2013 Jan 1;29(1):15-21
pubmed: 23104886
Nat Methods. 2018 Jul;15(7):475-476
pubmed: 29967506
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Nucleic Acids Res. 2020 Aug 20;48(14):8205-8207
pubmed: 32585001
Genome Biol. 2014 Feb 20;15(2):403
pubmed: 25001293
Gigascience. 2020 Feb 1;9(2):
pubmed: 31995185

Auteurs

Nagampalli VijayKrishna (N)

Genomic Medicine Institute, Cleveland Clinic, Cleveland, OH 44195, USA.

Jayadev Joshi (J)

Genomic Medicine Institute, Cleveland Clinic, Cleveland, OH 44195, USA.

Nate Coraor (N)

Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA 16802, USA.

Jennifer Hillman-Jackson (J)

Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA 16802, USA.

Dave Bouvier (D)

Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA 16802, USA.

Marius van den Beek (M)

Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA 16802, USA.

Ignacio Eguinoa (I)

VIB Center for Plant Systems Biology, 9052 Ghent, Belgium.
Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.

Frederik Coppens (F)

VIB Center for Plant Systems Biology, 9052 Ghent, Belgium.
Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.

John Davis (J)

Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA.

Michał Stolarczyk (M)

Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22903, USA.

Nathan C Sheffield (NC)

Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22903, USA.

Simon Gladman (S)

University of Melbourne, Melbourne, VIC, Australia.

Gianmauro Cuccuru (G)

University of Freiburg, Freiburg im Breisgau, Germany.

Björn Grüning (B)

University of Freiburg, Freiburg im Breisgau, Germany.

Nicola Soranzo (N)

Earlham Institute, Norwich Research Park, Norwich, UK.

Helena Rasche (H)

Clinical Bioinformatics Group, Department of Pathology, Erasmus Medical Center, 3015 CN Rotterdam, The Netherlands.

Bradley W Langhorst (BW)

New England Biolabs, Ipswich, MA 01938, USA.

Matthias Bernt (M)

Department Computational Biology, Helmholtz Centre for Environmental Research, UFZ, 04318 Leipzig, Germany.

Dan Fornika (D)

BC Centre for Disease Control Public Health Laboratory, Vancouver, BC, Canada.

David Anderson de Lima Morais (DA)

Centre de Calcul Scientifique, Université de Sherbrooke, Sherbrooke, QC, Canada.

Michel Barrette (M)

Centre de Calcul Scientifique, Université de Sherbrooke, Sherbrooke, QC, Canada.

Peter van Heusden (P)

South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, South Africa.

Mauro Petrillo (M)

European Commission, Joint Research Centre (JRC), Ispra, Italy.

Antonio Puertas-Gallardo (A)

European Commission, Joint Research Centre (JRC), Ispra, Italy.

Alex Patak (A)

European Commission, Joint Research Centre (JRC), Ispra, Italy.

Hans-Rudolf Hotz (HR)

Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.
SIB Swiss Institute of Bioinformatics, Basel, Switzerland.

Daniel Blankenberg (D)

Genomic Medicine Institute, Cleveland Clinic, Cleveland, OH 44195, USA.
Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA.

Classifications MeSH