Network principal component analysis: a versatile tool for the investigation of multigroup and multiblock datasets.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
09 06 2021
Historique:
received: 26 05 2020
revised: 09 10 2020
accepted: 30 10 2020
pubmed: 10 11 2020
medline: 1 7 2021
entrez: 9 11 2020
Statut: ppublish

Résumé

Complex data structures composed of different groups of observations and blocks of variables are increasingly collected in many domains, including metabolomics. Analysing these high-dimensional data constitutes a challenge, and the objective of this article is to present an original multivariate method capable of explicitly taking into account links between data tables when they involve the same observations and/or variables. For that purpose, an extension of standard principal component analysis called NetPCA was developed. The proposed algorithm was illustrated as an efficient solution for addressing complex multigroup and multiblock datasets. A case study involving the analysis of metabolomic data with different annotation levels and originating from a chronic kidney disease (CKD) study was used to highlight the different aspects and the additional outputs of the method compared to standard PCA. On the one hand, the model parameters allowed an efficient evaluation of each group's influence to be performed. On the other hand, the relative relevance of each block of variables to the model provided decisive information for an objective interpretation of the different metabolic annotation levels. NetPCA is available as a Python package with NumPy dependencies. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 33165510
pii: 5962088
doi: 10.1093/bioinformatics/btaa954
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1297-1303

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Auteurs

Santiago Codesido (S)

School of Pharmaceutical Sciences, University of Geneva, 1211 Geneva, Switzerland.
Institute of Pharmaceutical Sciences of Western Switzerland, University of Geneva, 1211 Geneva, Switzerland.

Mohamed Hanafi (M)

Unité Statistique, Sensométrie et Chimiométrie, Oniris, 44322 Nantes, France.

Yoric Gagnebin (Y)

School of Pharmaceutical Sciences, University of Geneva, 1211 Geneva, Switzerland.
Institute of Pharmaceutical Sciences of Western Switzerland, University of Geneva, 1211 Geneva, Switzerland.

Víctor González-Ruiz (V)

School of Pharmaceutical Sciences, University of Geneva, 1211 Geneva, Switzerland.
Institute of Pharmaceutical Sciences of Western Switzerland, University of Geneva, 1211 Geneva, Switzerland.

Serge Rudaz (S)

School of Pharmaceutical Sciences, University of Geneva, 1211 Geneva, Switzerland.
Institute of Pharmaceutical Sciences of Western Switzerland, University of Geneva, 1211 Geneva, Switzerland.

Julien Boccard (J)

School of Pharmaceutical Sciences, University of Geneva, 1211 Geneva, Switzerland.
Institute of Pharmaceutical Sciences of Western Switzerland, University of Geneva, 1211 Geneva, Switzerland.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH