BZINB model-based pathway analysis and module identification facilitates integration of microbiome and metabolome data.
Journal
bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187
Informations de publication
Date de publication:
01 Feb 2023
01 Feb 2023
Historique:
entrez:
13
2
2023
pubmed:
14
2
2023
medline:
14
2
2023
Statut:
epublish
Résumé
Integration of multi-omics data is a challenging but necessary step to advance our understanding of the biology underlying human health and disease processes. To date, investigations seeking to integrate multi-omics (e.g., microbiome and metabolome) employ simple correlation-based network analyses; however, these methods are not always well-suited for microbiome analyses because they do not accommodate the excess zeros typically present in these data. In this paper, we introduce a bivariate zero-inflated negative binomial (BZINB) model-based network and module analysis method that addresses this limitation and improves microbiome-metabolome correlation-based model fitting by accommodating excess zeros. We use real and simulated data based on a multi-omics study of childhood oral health (ZOE 2.0; investigating early childhood dental disease, ECC) and find that the accuracy of the BZINB model-based correlation method is superior compared to Spearman’s rank and Pearson correlations in terms of approximating the underlying relationships between microbial taxa and metabolites. The new method, BZINB-iMMPath facilitates the construction of metabolite-species and species-species correlation networks using BZINB and identifies modules of (i.e., correlated) species by combining BZINB and similarity-based clustering. Perturbations in correlation networks and modules can be efficiently tested between groups (i.e., healthy and diseased study participants). Upon application of the new method in the ZOE 2.0 study microbiome-metabolome data, we identify that several biologically-relevant correlations of ECC-associated microbial taxa with carbohydrate metabolites differ between healthy and dental caries-affected participants. In sum, we find that the BZINB model is a useful alternative to Spearman or Pearson correlations for estimating the underlying correlation of zero-inflated bivariate count data and thus is suitable for integrative analyses of multi-omics data such as those encountered in microbiome and metabolome studies.
Identifiants
pubmed: 36778424
doi: 10.1101/2023.01.30.526301
pmc: PMC9915478
pii:
doi:
Types de publication
Preprint
Langues
eng
Commentaires et corrections
Type : UpdateIn
Références
Bioinformatics. 2020 Feb 15;36(4):1159-1166
pubmed: 31501851
J Bacteriol. 2015 Apr 27;197(3):2104-2111
pubmed: 25917902
Caries Res. 2013;47(2):89-102
pubmed: 23207320
J Dent Res. 2015 Dec;94(12):1628-37
pubmed: 26377570
Comput Struct Biotechnol J. 2020 Sep 10;18:2583-2595
pubmed: 33033579
J Dent Res. 2021 Jun;100(6):615-622
pubmed: 33423574
NPJ Syst Biol Appl. 2020 Jun 19;6(1):20
pubmed: 32561750
J Neurochem. 1995 Apr;64(4):1734-41
pubmed: 7891102
Genome Biol. 2019 Nov 28;20(1):257
pubmed: 31779668
Nucleic Acids Res. 2012 Sep 1;40(17):e133
pubmed: 22638577
J Clin Periodontol. 2017 Mar;44 Suppl 18:S23-S38
pubmed: 28266108
Int J Environ Res Public Health. 2020 Nov 01;17(21):
pubmed: 33139633
Front Cell Dev Biol. 2020 Oct 22;8:588041
pubmed: 33195248
Cell Rep Methods. 2021 Oct 25;1(6):100095
pubmed: 35474895
IUBMB Life. 2008 Sep;60(9):605-8
pubmed: 18506840
Anal Chem. 2009 Aug 15;81(16):6656-67
pubmed: 19624122
Nat Microbiol. 2019 Feb;4(2):293-305
pubmed: 30531976
Genet Epidemiol. 2021 Mar;45(2):142-153
pubmed: 32989764
Comput Biol Med. 2021 Nov;138:104933
pubmed: 34655897
Genome Res. 2003 Nov;13(11):2498-504
pubmed: 14597658
Mol Microbiol. 2007 Feb;63(3):872-80
pubmed: 17302806
Nat Commun. 2020 Mar 3;11(1):1169
pubmed: 32127540
Microb Cell. 2018 May 07;5(5):215-219
pubmed: 29796386
Brief Bioinform. 2022 May 13;23(3):
pubmed: 35325048
Nat Methods. 2018 Nov;15(11):962-968
pubmed: 30377376
J Bacteriol. 2010 Oct;192(19):5002-17
pubmed: 20656903
BMC Bioinformatics. 2008 Dec 29;9:559
pubmed: 19114008