Laniakea: an open solution to provide Galaxy "on-demand" instances over heterogeneous cloud infrastructures.


Journal

GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872

Informations de publication

Date de publication:
01 04 2020
Historique:
received: 10 01 2020
revised: 13 03 2020
accepted: 17 03 2020
entrez: 7 4 2020
pubmed: 7 4 2020
medline: 12 6 2021
Statut: ppublish

Résumé

While the popular workflow manager Galaxy is currently made available through several publicly accessible servers, there are scenarios where users can be better served by full administrative control over a private Galaxy instance, including, but not limited to, concerns about data privacy, customisation needs, prioritisation of particular job types, tools development, and training activities. In such cases, a cloud-based Galaxy virtual instance represents an alternative that equips the user with complete control over the Galaxy instance itself without the burden of the hardware and software infrastructure involved in running and maintaining a Galaxy server. We present Laniakea, a complete software solution to set up a "Galaxy on-demand" platform as a service. Building on the INDIGO-DataCloud software stack, Laniakea can be deployed over common cloud architectures usually supported both by public and private e-infrastructures. The user interacts with a Laniakea-based service through a simple front-end that allows a general setup of a Galaxy instance, and then Laniakea takes care of the automatic deployment of the virtual hardware and the software components. At the end of the process, the user gains access with full administrative privileges to a private, production-grade, fully customisable, Galaxy virtual instance and to the underlying virtual machine (VM). Laniakea features deployment of single-server or cluster-backed Galaxy instances, sharing of reference data across multiple instances, data volume encryption, and support for VM image-based, Docker-based, and Ansible recipe-based Galaxy deployments. A Laniakea-based Galaxy on-demand service, named Laniakea@ReCaS, is currently hosted at the ELIXIR-IT ReCaS cloud facility. Laniakea offers to scientific e-infrastructures a complete and easy-to-use software solution to provide a Galaxy on-demand service to their users. Laniakea-based cloud services will help in making Galaxy more accessible to a broader user base by removing most of the burdens involved in deploying and running a Galaxy service. In turn, this will facilitate the adoption of Galaxy in scenarios where classic public instances do not represent an optimal solution. Finally, the implementation of Laniakea can be easily adapted and expanded to support different services and platforms beyond Galaxy.

Sections du résumé

BACKGROUND
While the popular workflow manager Galaxy is currently made available through several publicly accessible servers, there are scenarios where users can be better served by full administrative control over a private Galaxy instance, including, but not limited to, concerns about data privacy, customisation needs, prioritisation of particular job types, tools development, and training activities. In such cases, a cloud-based Galaxy virtual instance represents an alternative that equips the user with complete control over the Galaxy instance itself without the burden of the hardware and software infrastructure involved in running and maintaining a Galaxy server.
RESULTS
We present Laniakea, a complete software solution to set up a "Galaxy on-demand" platform as a service. Building on the INDIGO-DataCloud software stack, Laniakea can be deployed over common cloud architectures usually supported both by public and private e-infrastructures. The user interacts with a Laniakea-based service through a simple front-end that allows a general setup of a Galaxy instance, and then Laniakea takes care of the automatic deployment of the virtual hardware and the software components. At the end of the process, the user gains access with full administrative privileges to a private, production-grade, fully customisable, Galaxy virtual instance and to the underlying virtual machine (VM). Laniakea features deployment of single-server or cluster-backed Galaxy instances, sharing of reference data across multiple instances, data volume encryption, and support for VM image-based, Docker-based, and Ansible recipe-based Galaxy deployments. A Laniakea-based Galaxy on-demand service, named Laniakea@ReCaS, is currently hosted at the ELIXIR-IT ReCaS cloud facility.
CONCLUSIONS
Laniakea offers to scientific e-infrastructures a complete and easy-to-use software solution to provide a Galaxy on-demand service to their users. Laniakea-based cloud services will help in making Galaxy more accessible to a broader user base by removing most of the burdens involved in deploying and running a Galaxy service. In turn, this will facilitate the adoption of Galaxy in scenarios where classic public instances do not represent an optimal solution. Finally, the implementation of Laniakea can be easily adapted and expanded to support different services and platforms beyond Galaxy.

Identifiants

pubmed: 32252069
pii: 5816668
doi: 10.1093/gigascience/giaa033
pmc: PMC7136032
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press.

Références

Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
PLoS Comput Biol. 2019 Jun 27;15(6):e1006923
pubmed: 31246949
Bioinformatics. 2019 Oct 1;35(19):3752-3760
pubmed: 30851093
Nat Rev Genet. 2016 Aug 16;17(9):507-22
pubmed: 27528417
Genome Res. 2011 Jun;21(6):830-9
pubmed: 21460062
Genome Med. 2016 Dec 19;8(1):134
pubmed: 27993174
F1000Res. 2018 Aug 6;7:
pubmed: 30254736
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Brief Bioinform. 2019 Mar 22;20(2):405-415
pubmed: 29028883
Brief Bioinform. 2019 Sep 27;20(5):1795-1811
pubmed: 30084865
Trends Biotechnol. 2017 Jun;35(6):479-480
pubmed: 28527489
Genome Biol. 2009;10(3):R25
pubmed: 19261174
BMC Bioinformatics. 2010 Dec 21;11 Suppl 12:S4
pubmed: 21210983
Nucleic Acids Res. 2001 Jan 1;29(1):308-11
pubmed: 11125122
Bioinformatics. 2010 Mar 1;26(5):589-95
pubmed: 20080505
Bioinformatics. 2007 Jul 15;23(14):1713-7
pubmed: 17485425
Nucleic Acids Res. 2018 Jul 2;46(W1):W537-W544
pubmed: 29790989
BMC Med Inform Decis Mak. 2015 Mar 19;15:17
pubmed: 25888747
BMC Genomics. 2018 Feb 5;19(1):120
pubmed: 29402227
Brief Bioinform. 2019 Mar 22;20(2):398-404
pubmed: 28968751
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Nucleic Acids Res. 2016 Jul 8;44(W1):W3-W10
pubmed: 27137889
Gigascience. 2019 Feb 1;8(2):
pubmed: 30535405
Trends Biotechnol. 2017 Jun;35(6):481-483
pubmed: 28117091
Sci Rep. 2019 Nov 26;9(1):17550
pubmed: 31772190
Cell Syst. 2018 Jun 27;6(6):752-758.e1
pubmed: 29953864
Nat Biotechnol. 2011 Nov 08;29(11):972-4
pubmed: 22068528
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Nat Rev Genet. 2018 Apr;19(4):208-219
pubmed: 29379135
Nucleic Acids Res. 2017 Jul 3;45(W1):W560-W566
pubmed: 28582575
Gigascience. 2016 Jul 11;5(1):30
pubmed: 27401684
Yearb Med Inform. 2014 May 22;9:8-13
pubmed: 24853034
Brief Bioinform. 2018 Sep 28;19(5):1035-1050
pubmed: 28419324
BMC Bioinformatics. 2012 Nov 27;13:315
pubmed: 23181507
PLoS One. 2015 Oct 26;10(10):e0140829
pubmed: 26501966
J Biomed Inform. 2014 Jun;49:119-33
pubmed: 24462600

Auteurs

Marco Antonio Tangaro (MA)

Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR), Via Giovanni Amendola 122/O, 70126 Bari, Italy.

Giacinto Donvito (G)

National Institute for Nuclear Physics (INFN), Section of Bari, Via Orabona 4, 70126 Bari, Italy.

Marica Antonacci (M)

National Institute for Nuclear Physics (INFN), Section of Bari, Via Orabona 4, 70126 Bari, Italy.

Matteo Chiara (M)

Department of Biosciences, University of Milan, via Celoria 26, 20133 Milano, Italy.

Pietro Mandreoli (P)

Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR), Via Giovanni Amendola 122/O, 70126 Bari, Italy.
Department of Biosciences, University of Milan, via Celoria 26, 20133 Milano, Italy.

Graziano Pesole (G)

Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR), Via Giovanni Amendola 122/O, 70126 Bari, Italy.
Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Via Orabona 4, 70126 Bari, Italy.

Federico Zambelli (F)

Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR), Via Giovanni Amendola 122/O, 70126 Bari, Italy.
Department of Biosciences, University of Milan, via Celoria 26, 20133 Milano, Italy.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Humans Adult Male Female Video Games

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH