A unified strategy to rebalance multifactorial designs with unequal group sizes: application to analysis of variance multiblock orthogonal partial least squares.
AMOPLS
ANOVA
High-dimensional
Rebalancing
Supervised
Unbalanced experimental designs
Journal
Analytica chimica acta
ISSN: 1873-4324
Titre abrégé: Anal Chim Acta
Pays: Netherlands
ID NLM: 0370534
Informations de publication
Date de publication:
04 Jul 2023
04 Jul 2023
Historique:
received:
22
12
2022
revised:
19
04
2023
accepted:
25
04
2023
medline:
26
5
2023
pubmed:
25
5
2023
entrez:
24
5
2023
Statut:
ppublish
Résumé
Adequately handling unbalanced groups remains one of the major challenges for the analysis of multivariate data collected from multifactorial experimental designs. While partial least squares-based methods, such as analysis of variance multiblock orthogonal partial least squares (AMOPLS), can offer better discrimination between factor levels, they can be more heavily affected by this issue, and unbalanced designs of experiments may lead to a substantial confusion of the effects. Even state-of-the-art analysis of variance (ANOVA) decomposition methodologies using general linear models (GLM) lack the ability to efficiently disentangle these sources of variation when combined with AMOPLS. A versatile solution developed as an extension of a prior rebalancing strategy is proposed for the first decomposition step based on ANOVA. This approach has the advantage of yielding an unbiased estimation of the parameters and retaining the within-group variation in the rebalanced design, while preserving the orthogonality of effect matrices, even in presence of unequal group sizes. This property is of utmost importance for model interpretation because it avoids mixing sources of variation related to the different effects in the design. A real case study involving metabolomic data from in vitro toxicological experiments was used to demonstrate the potential of this strategy to handle unequal group sizes using a supervised approach. Primary 3D rat neural cell cultures were exposed to trimethyltin following a multifactorial design of experiments involving three fixed effect factors. The rebalancing strategy was demonstrated as a novel and potent solution to handle unbalanced experimental designs by offering unbiased parameter estimators and orthogonal submatrices, thus avoiding confusion of the effects and facilitating model interpretation. Moreover, it can be combined with any multivariate method used for the analysis of high-dimensional data collected from multifactorial designs.
Sections du résumé
BACKGROUND
BACKGROUND
Adequately handling unbalanced groups remains one of the major challenges for the analysis of multivariate data collected from multifactorial experimental designs. While partial least squares-based methods, such as analysis of variance multiblock orthogonal partial least squares (AMOPLS), can offer better discrimination between factor levels, they can be more heavily affected by this issue, and unbalanced designs of experiments may lead to a substantial confusion of the effects. Even state-of-the-art analysis of variance (ANOVA) decomposition methodologies using general linear models (GLM) lack the ability to efficiently disentangle these sources of variation when combined with AMOPLS.
RESULTS
RESULTS
A versatile solution developed as an extension of a prior rebalancing strategy is proposed for the first decomposition step based on ANOVA. This approach has the advantage of yielding an unbiased estimation of the parameters and retaining the within-group variation in the rebalanced design, while preserving the orthogonality of effect matrices, even in presence of unequal group sizes. This property is of utmost importance for model interpretation because it avoids mixing sources of variation related to the different effects in the design. A real case study involving metabolomic data from in vitro toxicological experiments was used to demonstrate the potential of this strategy to handle unequal group sizes using a supervised approach. Primary 3D rat neural cell cultures were exposed to trimethyltin following a multifactorial design of experiments involving three fixed effect factors.
SIGNIFICANCE AND NOVELTY
UNASSIGNED
The rebalancing strategy was demonstrated as a novel and potent solution to handle unbalanced experimental designs by offering unbiased parameter estimators and orthogonal submatrices, thus avoiding confusion of the effects and facilitating model interpretation. Moreover, it can be combined with any multivariate method used for the analysis of high-dimensional data collected from multifactorial designs.
Identifiants
pubmed: 37225336
pii: S0003-2670(23)00505-6
doi: 10.1016/j.aca.2023.341284
pii:
doi:
Substances chimiques
Sulfadiazine
0N7609K889
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
341284Informations de copyright
Copyright © 2023. Published by Elsevier B.V.
Déclaration de conflit d'intérêts
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.