PhenoMeNal: processing and analysis of metabolomics data in the cloud.

NMR cloud computing computational workflows data analysis e-infrastructures galaxy mass spectrometry metabolomics standardization statistics

Journal

GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872

Informations de publication

Date de publication:
01 02 2019
Historique:
accepted: 20 11 2018
revised: 19 10 2018
received: 06 09 2018
pubmed: 12 12 2018
medline: 25 6 2019
entrez: 12 12 2018
Statut: ppublish

Résumé

Metabolomics is the comprehensive study of a multitude of small molecules to gain insight into an organism's metabolism. The research field is dynamic and expanding with applications across biomedical, biotechnological, and many other applied biological domains. Its computationally intensive nature has driven requirements for open data formats, data repositories, and data analysis tools. However, the rapid progress has resulted in a mosaic of independent, and sometimes incompatible, analysis methods that are difficult to connect into a useful and complete data analysis solution. PhenoMeNal (Phenome and Metabolome aNalysis) is an advanced and complete solution to set up Infrastructure-as-a-Service (IaaS) that brings workflow-oriented, interoperable metabolomics data analysis platforms into the cloud. PhenoMeNal seamlessly integrates a wide array of existing open-source tools that are tested and packaged as Docker containers through the project's continuous integration process and deployed based on a kubernetes orchestration framework. It also provides a number of standardized, automated, and published analysis workflows in the user interfaces Galaxy, Jupyter, Luigi, and Pachyderm. PhenoMeNal constitutes a keystone solution in cloud e-infrastructures available for metabolomics. PhenoMeNal is a unique and complete solution for setting up cloud e-infrastructures through easy-to-use web interfaces that can be scaled to any custom public and private cloud environment. By harmonizing and automating software installation and configuration and through ready-to-use scientific workflow user interfaces, PhenoMeNal has succeeded in providing scientists with workflow-driven, reproducible, and shareable metabolomics data analysis platforms that are interfaced through standard data formats, representative datasets, versioned, and have been tested for reproducibility and interoperability. The elastic implementation of PhenoMeNal further allows easy adaptation of the infrastructure to other application areas and 'omics research domains.

Sections du résumé

BACKGROUND
Metabolomics is the comprehensive study of a multitude of small molecules to gain insight into an organism's metabolism. The research field is dynamic and expanding with applications across biomedical, biotechnological, and many other applied biological domains. Its computationally intensive nature has driven requirements for open data formats, data repositories, and data analysis tools. However, the rapid progress has resulted in a mosaic of independent, and sometimes incompatible, analysis methods that are difficult to connect into a useful and complete data analysis solution.
FINDINGS
PhenoMeNal (Phenome and Metabolome aNalysis) is an advanced and complete solution to set up Infrastructure-as-a-Service (IaaS) that brings workflow-oriented, interoperable metabolomics data analysis platforms into the cloud. PhenoMeNal seamlessly integrates a wide array of existing open-source tools that are tested and packaged as Docker containers through the project's continuous integration process and deployed based on a kubernetes orchestration framework. It also provides a number of standardized, automated, and published analysis workflows in the user interfaces Galaxy, Jupyter, Luigi, and Pachyderm.
CONCLUSIONS
PhenoMeNal constitutes a keystone solution in cloud e-infrastructures available for metabolomics. PhenoMeNal is a unique and complete solution for setting up cloud e-infrastructures through easy-to-use web interfaces that can be scaled to any custom public and private cloud environment. By harmonizing and automating software installation and configuration and through ready-to-use scientific workflow user interfaces, PhenoMeNal has succeeded in providing scientists with workflow-driven, reproducible, and shareable metabolomics data analysis platforms that are interfaced through standard data formats, representative datasets, versioned, and have been tested for reproducibility and interoperability. The elastic implementation of PhenoMeNal further allows easy adaptation of the infrastructure to other application areas and 'omics research domains.

Identifiants

pubmed: 30535405
pii: 5232984
doi: 10.1093/gigascience/giy149
pmc: PMC6377398
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/H024921/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/I000771/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/M009157/1
Pays : United Kingdom

Informations de copyright

© The Author(s) 2018. Published by Oxford University Press.

Références

Metabolomics. 2016;12:14
pubmed: 26612985
Sci Data. 2018 Aug 28;5:180179
pubmed: 30152810
Nat Protoc. 2014;9(6):1416-27
pubmed: 24853927
Magn Reson Chem. 2009 Dec;47 Suppl 1:S123-6
pubmed: 19821464
Anal Chem. 2012 Jun 5;84(11):5035-9
pubmed: 22533540
Nat Commun. 2017 Nov 14;8(1):1494
pubmed: 29133785
Nucleic Acids Res. 2016 Jan 4;44(D1):D463-70
pubmed: 26467476
Bioinformatics. 2010 Sep 15;26(18):2354-6
pubmed: 20679334
Metabolomics. 2017;13(4):36
pubmed: 28261014
Nat Genet. 2012 Jan 27;44(2):121-6
pubmed: 22281772
Metabolomics. 2018;14(4):37
pubmed: 29503602
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12580-5
pubmed: 26392543
J Cheminform. 2016 Jan 29;8:3
pubmed: 26834843
Nat Protoc. 2016 Feb;12(2):310–328
pubmed: 28079878
Nat Biotechnol. 2012 Oct;30(10):918-20
pubmed: 23051804
BMC Bioinformatics. 2011 Sep 13;12:366
pubmed: 21914187
Metabolomics. 2007 Sep;3(3):211-221
pubmed: 24039616
SLAS Discov. 2017 Mar;22(3):238-249
pubmed: 27899692
Bioinformatics. 2019 Oct 1;35(19):3752-3760
pubmed: 30851093
Nat Biotechnol. 2007 Nov;25(11):1251-5
pubmed: 17989687
Curr Opin Chem Biol. 2017 Feb;36:58-63
pubmed: 28092796
J Proteome Res. 2015 Aug 7;14(8):3322-35
pubmed: 26088811
PLoS Comput Biol. 2015 Aug 27;11(8):e1004321
pubmed: 26313928
Nucleic Acids Res. 2016 Jul 8;44(W1):W3-W10
pubmed: 27137889
BMC Bioinformatics. 2008 Mar 26;9:163
pubmed: 18366760
BMC Bioinformatics. 2014;15 Suppl 1:S11
pubmed: 24564732
Anal Chem. 2008 Aug 15;80(16):6382-9
pubmed: 18627180
Physiol Genomics. 2007 Apr 24;29(2):99-108
pubmed: 17190852
Bioinformatics. 2010 Oct 15;26(20):2647-8
pubmed: 20829444
Expert Rev Mol Diagn. 2008 Sep;8(5):617-33
pubmed: 18785810
Curr Opin Biotechnol. 2015 Aug;34:189-201
pubmed: 25731751
Nat Rev Mol Cell Biol. 2006 Mar;7(3):198-210
pubmed: 16496022
Anal Chem. 2016 May 17;88(10):5179-88
pubmed: 27116637
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Nucleic Acids Res. 2018 Jul 2;46(W1):W495-W502
pubmed: 29718355
BMC Bioinformatics. 2017 Feb 3;18(1):88
pubmed: 28158972
Metabolomics. 2015;11(6):1587-1597
pubmed: 26491418
Expert Opin Drug Metab Toxicol. 2014 Jul;10(7):915-9
pubmed: 24905565
Int J Mol Sci. 2018 May 06;19(5):
pubmed: 29734799
PLoS Comput Biol. 2017 Dec 1;13(12):e1005839
pubmed: 29194434
Nucleic Acids Res. 2013 Jan;41(Database issue):D781-6
pubmed: 23109552
Nat Rev Genet. 2012 Sep;13(9):667-72
pubmed: 22898652
Theranostics. 2018 Aug 07;8(16):4477-4490
pubmed: 30214633
F1000Res. 2017 Sep 6;6:
pubmed: 29043062
Genome Biol. 2010;11(8):R86
pubmed: 20738864
Bioinformatics. 2015 May 1;31(9):1493-5
pubmed: 25527831
Bioinformatics. 2019 Mar 1;35(5):839-846
pubmed: 30101309
J Biomol NMR. 2013 Apr;55(4):355-67
pubmed: 23456039
Anal Chem. 2018 Jan 2;90(1):649-656
pubmed: 29035042
Anal Chem. 2012 Jan 3;84(1):283-9
pubmed: 22111785
Front Mol Biosci. 2016 Jun 21;3:26
pubmed: 27446929
Nat Genet. 2015 Jul;47(7):692-5
pubmed: 26111507
BMC Bioinformatics. 2015 Apr 16;16:118
pubmed: 25888443
Bioinformatics. 2017 Dec 01;33(23):3805-3807
pubmed: 29036536
Metabolomics. 2017;13(2):12
pubmed: 28090198
Angew Chem Int Ed Engl. 2019 Jan 21;58(4):968-994
pubmed: 29999221
Nucleic Acids Res. 2018 Jul 2;46(W1):W486-W494
pubmed: 29762782
Bioinformatics. 2013 Jul 01;29(13):1685-6
pubmed: 23630176
Bioinformatics. 2016 Jul 1;32(13):2067-8
pubmed: 27153725
J Cheminform. 2017 Oct 30;9(1):55
pubmed: 29086154
BMC Syst Biol. 2010 Oct 06;4:135
pubmed: 20925932
Gigascience. 2017 Nov 1;6(11):1-9
pubmed: 29048555
Biopreserv Biobank. 2015 Aug;13(4):263-70
pubmed: 26186169
PLoS One. 2016 Apr 29;11(4):e0154556
pubmed: 27128319
Bioinformatics. 2015 Oct 15;31(20):3383-6
pubmed: 26079347
PLoS Genet. 2016 Jan 21;12(1):e1005772
pubmed: 26796797
J Am Med Inform Assoc. 2016 Apr;23(e1):e131-7
pubmed: 26567325
Curr Opin Biotechnol. 2015 Aug;34:82-90
pubmed: 25531408
J Cheminform. 2016 Nov 24;8:67
pubmed: 27942268
Metabolomics. 2012 Oct;8(5):757-760
pubmed: 23060735

Auteurs

Kristian Peters (K)

Leibniz Institute of Plant Biochemistry, Stress and Developmental Biology, Weinberg 3, 06120 Halle (Saale), Germany.

James Bradbury (J)

School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.

Sven Bergmann (S)

Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
Swiss Institute of Bioinformatics, Lausanne, Switzerland.

Marco Capuccini (M)

Division of Scientific Computing, Department of Information Technology, Uppsala University, Sweden.
Department of Pharmaceutical Biosciences, Uppsala University, Box 591, 751 24 Uppsala, Sweden.

Marta Cascante (M)

Department of Biochemistry and Molecular Biomedicine, Universitat de Barcelona; Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Instituto de Salud Carlos III (ISCIII), Spain.

Pedro de Atauri (P)

Department of Biochemistry and Molecular Biomedicine, Universitat de Barcelona; Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Instituto de Salud Carlos III (ISCIII), Spain.

Timothy M D Ebbels (TMD)

Department of Surgery & Cancer, Imperial College London, South Kensington, London, SW7 2AZ, United Kingdom.

Carles Foguet (C)

Department of Biochemistry and Molecular Biomedicine, Universitat de Barcelona; Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Instituto de Salud Carlos III (ISCIII), Spain.

Robert Glen (R)

Department of Surgery & Cancer, Imperial College London, South Kensington, London, SW7 2AZ, United Kingdom.
Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB21EW, United Kingdom.

Alejandra Gonzalez-Beltran (A)

Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, OX1 3QG, Oxford, United Kingdom.

Ulrich L Günther (UL)

Institute of Cancer and Genomic Sciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.

Evangelos Handakas (E)

Department of Surgery & Cancer, Imperial College London, South Kensington, London, SW7 2AZ, United Kingdom.

Thomas Hankemeier (T)

Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research (LACDR), Leiden University, Leiden, 2333 CC, The Netherlands.

Kenneth Haug (K)

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.

Stephanie Herman (S)

Department of Pharmaceutical Biosciences, Uppsala University, Box 591, 751 24 Uppsala, Sweden.
Department of Medical Sciences, Clinical Chemistry, Uppsala University, 751 85 Uppsala, Sweden.

Petr Holub (P)

BBMRI-ERIC, Graz, Austria.

Massimiliano Izzo (M)

Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, OX1 3QG, Oxford, United Kingdom.

Daniel Jacob (D)

INRA, University of Bordeaux, Plateforme Métabolome Bordeaux-MetaboHUB, 33140 Villenave d'Ornon, France.

David Johnson (D)

Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, OX1 3QG, Oxford, United Kingdom.
Department of Informatics and Media, Uppsala University, Box 513, 751 20 Uppsala, Sweden.

Fabien Jourdan (F)

INRA - French National Institute for Agricultural Research, UMR1331, Toxalim, Research Centre in Food Toxicology, Toulouse, France.

Namrata Kale (N)

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.

Ibrahim Karaman (I)

Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, St. Mary's Campus, Norfolk Place, W2 1PG, London, United Kingdom.

Bita Khalili (B)

Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
Swiss Institute of Bioinformatics, Lausanne, Switzerland.

Payam Emami Khonsari (P)

Department of Medical Sciences, Clinical Chemistry, Uppsala University, 751 85 Uppsala, Sweden.

Kim Kultima (K)

Department of Medical Sciences, Clinical Chemistry, Uppsala University, 751 85 Uppsala, Sweden.

Samuel Lampa (S)

Department of Pharmaceutical Biosciences, Uppsala University, Box 591, 751 24 Uppsala, Sweden.

Anders Larsson (A)

Department of Pharmaceutical Biosciences, Uppsala University, Box 591, 751 24 Uppsala, Sweden.
National Bioinformatics Infrastructure Sweden, Uppsala University, Uppsala, Sweden.

Christian Ludwig (C)

Institute of Metabolism and Systems Research (IMSR), University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.

Pablo Moreno (P)

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.

Steffen Neumann (S)

Leibniz Institute of Plant Biochemistry, Stress and Developmental Biology, Weinberg 3, 06120 Halle (Saale), Germany.
German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Deutscher Platz 5e, 04103 Leipzig, Germany.

Jon Ander Novella (JA)

Department of Pharmaceutical Biosciences, Uppsala University, Box 591, 751 24 Uppsala, Sweden.
National Bioinformatics Infrastructure Sweden, Uppsala University, Uppsala, Sweden.

Claire O'Donovan (C)

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.

Jake T M Pearce (JTM)

Department of Surgery & Cancer, Imperial College London, South Kensington, London, SW7 2AZ, United Kingdom.

Alina Peluso (A)

Department of Surgery & Cancer, Imperial College London, South Kensington, London, SW7 2AZ, United Kingdom.

Marco Enrico Piras (ME)

Distributed Computing Group, CRS4, Pula, Italy.

Luca Pireddu (L)

Distributed Computing Group, CRS4, Pula, Italy.

Michelle A C Reed (MAC)

Institute of Cancer and Genomic Sciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.

Philippe Rocca-Serra (P)

Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, OX1 3QG, Oxford, United Kingdom.

Pierrick Roger (P)

CEA, LIST, Laboratory for Data Analysis and Systems' Intelligence, MetaboHUB, Gif-Sur-Yvette F-91191, France.

Antonio Rosato (A)

Magnetic Resonance Center (CERM) and Department of Chemistry, University of Florence and CIRMMP, 50019 Sesto Fiorentino, Florence, Italy.

Rico Rueedi (R)

Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
Swiss Institute of Bioinformatics, Lausanne, Switzerland.

Christoph Ruttkies (C)

Leibniz Institute of Plant Biochemistry, Stress and Developmental Biology, Weinberg 3, 06120 Halle (Saale), Germany.

Noureddin Sadawi (N)

Department of Computer Science, College of Engineering, Design and Physical Sciences, Brunel University, London, UK.
Department of Surgery & Cancer, Imperial College London, South Kensington, London, SW7 2AZ, United Kingdom.

Reza M Salek (RM)

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.

Susanna-Assunta Sansone (SA)

Oxford e-Research Centre, Department of Engineering Science, University of Oxford, 7 Keble Road, OX1 3QG, Oxford, United Kingdom.

Vitaly Selivanov (V)

Department of Biochemistry and Molecular Biomedicine, Universitat de Barcelona; Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Instituto de Salud Carlos III (ISCIII), Spain.

Ola Spjuth (O)

Department of Pharmaceutical Biosciences, Uppsala University, Box 591, 751 24 Uppsala, Sweden.

Daniel Schober (D)

Leibniz Institute of Plant Biochemistry, Stress and Developmental Biology, Weinberg 3, 06120 Halle (Saale), Germany.

Etienne A Thévenot (EA)

CEA, LIST, Laboratory for Data Analysis and Systems' Intelligence, MetaboHUB, Gif-Sur-Yvette F-91191, France.

Mattia Tomasoni (M)

Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
Swiss Institute of Bioinformatics, Lausanne, Switzerland.

Merlijn van Rijswijk (M)

Netherlands Metabolomics Center, Leiden, 2333 CC, Netherlands.
ELIXIR-NL, Dutch Techcentre for Life Sciences, Utrecht, 3503 RM, Netherlands.

Michael van Vliet (M)

Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research (LACDR), Leiden University, Leiden, 2333 CC, The Netherlands.

Mark R Viant (MR)

School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.
Phenome Centre Birmingham, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.

Ralf J M Weber (RJM)

School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.
Phenome Centre Birmingham, University of Birmingham, Edgbaston, Birmingham, B15 2TT, United Kingdom.

Gianluigi Zanetti (G)

Distributed Computing Group, CRS4, Pula, Italy.

Christoph Steinbeck (C)

Cheminformatics and Computational Metabolomics, Institute for Analytical Chemistry, Lessingstr. 8, 07743 Jena, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH