Perspectives of using Cloud computing in integrative analysis of multi-omics data.

Cloud computing Kubernetes computational genomics containerization; integration omics data processing orchestration scaling R

Journal

Briefings in functional genomics
ISSN: 2041-2657
Titre abrégé: Brief Funct Genomics
Pays: England
ID NLM: 101528229

Informations de publication

Date de publication:
17 07 2021
Historique:
received: 02 11 2020
revised: 25 01 2021
accepted: 26 01 2021
pubmed: 7 3 2021
medline: 26 10 2021
entrez: 6 3 2021
Statut: ppublish

Résumé

Integrative analysis of multi-omics data is usually computationally demanding. It frequently requires building complex, multi-step analysis pipelines, applying dedicated techniques for data processing and combining several data sources. These efforts lead to a better understanding of life processes, current health state or the effects of therapeutic activities. However, many omics data analysis solutions focus only on a selected problem, disease, types of data or organisms. Moreover, they are implemented for general-purpose scientific computational platforms that most often do not easily scale the calculations natively. These features are not conducive to advances in understanding genotype-phenotypic relationships. Fortunately, with new technological paradigms, including Cloud computing, virtualization and containerization, these functionalities could be orchestrated for easy scaling and building independent analysis pipelines for omics data. Therefore, solutions can be re-used for purposes that they were not primarily designed. This paper shows perspectives of using Cloud computing advances and containerization approach for such a purpose. We first review how the Cloud computing model is utilized in multi-omics data analysis and show weak points of the adopted solutions. Then, we introduce containerization concepts, which allow both scaling and linking of functional services designed for various purposes. Finally, on the Bioconductor software package example, we disclose a verified concept model of a universal solution that exhibits the potentials for performing integrative analysis of multiple omics data sources.

Identifiants

pubmed: 33676373
pii: 6155979
doi: 10.1093/bfgp/elab007
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

198-206

Subventions

Organisme : Rector of the Silesian University of Technology, Gliwice, Poland
ID : 02/020/RGPL9/0184
Organisme : Statutory Research funds of Department of Applied Informatics, Silesian University of Technology, Gliwice, Poland
ID : 02/100/BK_21/0008

Commentaires et corrections

Type : CommentIn

Informations de copyright

© The authors 2021. Published by Oxford University Press on behalf of the Institute of Mathematics and its Applications. All rights reserved.

Auteurs

Dariusz R Augustyn (DR)

Silesian University of Technology, Department of Applied Informatics, Gliwice 44-100, Poland.

Łukasz Wyciślik (Ł)

Silesian University of Technology, Department of Applied Informatics, Gliwice 44-100, Poland.

Dariusz Mrozek (D)

Silesian University of Technology, Department of Applied Informatics, Gliwice 44-100, Poland.

Articles similaires

Vancomycin Polyesters Anti-Bacterial Agents Models, Theoretical Drug Liberation
1.00
Iran Environmental Monitoring Seasons Ecosystem Forests
Cities China Government Conservation of Natural Resources Humans
Models, Theoretical Computer Simulation Software Water Movements

Classifications MeSH