A powerful framework for an integrative study with heterogeneous omics data: from univariate statistics to multi-block analysis.

Arabidopsis thaliana abiotic stress integrative analysis statistical framework systems biology

Journal

Briefings in bioinformatics
ISSN: 1477-4054
Titre abrégé: Brief Bioinform
Pays: England
ID NLM: 100912837

Informations de publication

Date de publication:
20 05 2021
Historique:
received: 06 05 2020
revised: 23 06 2020
accepted: 03 07 2020
pubmed: 12 8 2020
medline: 18 11 2021
entrez: 12 8 2020
Statut: ppublish

Résumé

High-throughput data generated by new biotechnologies require specific and adapted statistical treatment in order to be efficiently used in biological studies. In this article, we propose a powerful framework to manage and analyse multi-omics heterogeneous data to carry out an integrative analysis. We have illustrated this using the mixOmics package for R software as it specifically addresses data integration issues. Our work also aims at applying the most recent functionalities of mixOmics to real datasets. Although multi-block integrative methodologies exist, we hope to encourage a more widespread use of such approaches in an operational framework by biologists. We have used natural populations of the model plant Arabidopsis thaliana in this work, but the framework proposed is not limited to this plant and can be deployed whatever the organisms of interest and the biological question may be. Four omics datasets (phenomics, metabolomics, cell wall proteomics and transcriptomics) were collected, analysed and integrated to study the cell wall plasticity of plants exposed to sub-optimal temperature growth conditions. The methodologies presented here start from basic univariate statistics leading to multi-block integration analysis. We have also highlighted the fact that each method, either unsupervised or supervised, is associated with one biological issue. Using this powerful framework enabled us to arrive at novel conclusions on the biological system, which would not have been possible using standard statistical approaches.

Identifiants

pubmed: 32778869
pii: 5890507
doi: 10.1093/bib/bbaa166
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Auteurs

Merwann Selmani (M)

Laboratoire de Recherche en Sciences Végétales and the Institut de Mathématiques de Toulouse.

Elisabeth Jamet (E)

CNRS and studies plant cell walls.

Christophe Dunand (C)

Toulouse University III-Paul Sabatier.

Sébastien Déjean (S)

Institut de Mathématiques, Toulouse University.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Coal Metagenome Phylogeny Bacteria Genome, Bacterial
Arabidopsis Arabidopsis Proteins Osmotic Pressure Cytoplasm RNA, Messenger

Classifications MeSH