Improved Metabolite Prediction Using Microbiome Data-Based Elastic Net Models.


Journal

Frontiers in cellular and infection microbiology
ISSN: 2235-2988
Titre abrégé: Front Cell Infect Microbiol
Pays: Switzerland
ID NLM: 101585359

Informations de publication

Date de publication:
2021
Historique:
received: 01 07 2021
accepted: 22 09 2021
entrez: 11 11 2021
pubmed: 12 11 2021
medline: 25 11 2021
Statut: epublish

Résumé

Microbiome data are becoming increasingly available in large health cohorts, yet metabolomics data are still scant. While many studies generate microbiome data, they lack matched metabolomics data or have considerable missing proportions of metabolites. Since metabolomics is key to understanding microbial and general biological activities, the possibility of imputing individual metabolites or inferring metabolomics pathways from microbial taxonomy or metagenomics is intriguing. Importantly, current metabolomics profiling methods such as the HMP Unified Metabolic Analysis Network (HUMAnN) have unknown accuracy and are limited in their ability to predict individual metabolites. To address this gap, we developed a novel metabolite prediction method, and we present its application and evaluation in an oral microbiome study. The new method for predicting metabolites using microbiome data (ENVIM) is based on the elastic net model (ENM). ENVIM introduces an extra step to ENM to consider variable importance (VI) scores, and thus, achieves better prediction power. We investigate the metabolite prediction performance of ENVIM using metagenomic and metatranscriptomic data in a supragingival biofilm multi-omics dataset of 289 children ages 3-5 who were participants of a community-based study of early childhood oral health (ZOE 2.0) in North Carolina, United States. We further validate ENVIM in two additional publicly available multi-omics datasets generated from studies of gut health. We select gene family sets based on variable importance scores and modify the existing ENM strategy used in the MelonnPan prediction software to accommodate the unique features of microbiome and metabolome data. We evaluate metagenomic and metatranscriptomic predictors and compare the prediction performance of ENVIM to the standard ENM employed in MelonnPan. The newly developed ENVIM method showed superior metabolite predictive accuracy than MelonnPan when trained with metatranscriptomics data only, metagenomics data only, or both. Better metabolite prediction is achieved in the gut microbiome compared with the oral microbiome setting. We report the best-predictable compounds in all these three datasets from two different body sites. For example, the metabolites trehalose, maltose, stachyose, and ribose are all well predicted by the supragingival microbiome.

Identifiants

pubmed: 34760716
doi: 10.3389/fcimb.2021.734416
pmc: PMC8573316
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

734416

Subventions

Organisme : NIDCR NIH HHS
ID : R03 DE028983
Pays : United States
Organisme : NLM NIH HHS
ID : T15 LM012500
Pays : United States
Organisme : NCI NIH HHS
ID : P30 CA016059
Pays : United States
Organisme : NIDCR NIH HHS
ID : U01 DE025046
Pays : United States

Informations de copyright

Copyright © 2021 Xie, Cho, Lin, Pillai, Heimisdottir, Bandyopadhyay, Zou, Roach, Divaris and Wu.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Références

J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
Nat Med. 2019 Apr;25(4):667-678
pubmed: 30936548
Anal Chem. 2009 Aug 15;81(16):6656-67
pubmed: 19624122
Nature. 2019 May;569(7758):655-662
pubmed: 31142855
Nat Microbiol. 2019 Feb;4(2):293-305
pubmed: 30531976
J Dent Res. 2021 Jun;100(6):615-622
pubmed: 33423574
Sci Rep. 2018 Jan 12;8(1):663
pubmed: 29330539
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50
pubmed: 16199517
J Public Health Dent. 2020 Mar;80 Suppl 1:S31-S36
pubmed: 30566750
Methods Mol Biol. 2019;1922:525-548
pubmed: 30838598
Bioinformatics. 2007 May 15;23(10):1294-6
pubmed: 17384015
Nat Commun. 2019 Jul 17;10(1):3136
pubmed: 31316056
Nat Med. 2021 May;27(5):766-774
pubmed: 33820996
Nat Genet. 2019 Apr;51(4):600-605
pubmed: 30778224
Nat Rev Endocrinol. 2019 May;15(5):261-273
pubmed: 30670819
mSystems. 2016 Jan-Feb;1(1):
pubmed: 27239563
Cell Host Microbe. 2020 Aug 12;28(2):245-257.e6
pubmed: 32544460
Int J Environ Res Public Health. 2020 Nov 01;17(21):
pubmed: 33139633
Nucleic Acids Res. 2014 Jan;42(Database issue):D459-71
pubmed: 24225315
Ann Epidemiol. 2016 May;26(5):330-5
pubmed: 27255738
Methods Mol Biol. 2019;1922:511-523
pubmed: 30838597
Nat Methods. 2018 Nov;15(11):962-968
pubmed: 30377376

Auteurs

Jialiu Xie (J)

Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.

Hunyong Cho (H)

Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.

Bridget M Lin (BM)

Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.

Malvika Pillai (M)

Carolina Health Informatics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.

Lara H Heimisdottir (LH)

Division of Pediatric and Public Health, Adams School of Dentistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.

Dipankar Bandyopadhyay (D)

Department of Biostatistics, School of Medicine, Virginia Commonwealth University, Richmond, VA, United States.

Fei Zou (F)

Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.
Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.

Jeffrey Roach (J)

Research Computing, University of North Carolina, Chapel Hill, NC, United States.

Kimon Divaris (K)

Division of Pediatric and Public Health, Adams School of Dentistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.
Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.

Di Wu (D)

Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.
Division of Oral and Craniofacial Health Research, Adams School of Dentistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH