Benchmarking Community-Wide Estimates of Growth Potential from Metagenomes Using Codon Usage Statistics.

codon usage bias growth rate metagenomics microbial ecology

Journal

mSystems
ISSN: 2379-5077
Titre abrégé: mSystems
Pays: United States
ID NLM: 101680636

Informations de publication

Date de publication:
26 10 2022
Historique:
pubmed: 4 10 2022
medline: 4 10 2022
entrez: 3 10 2022
Statut: ppublish

Résumé

Trait inference from mixed-species assemblages is a central problem in microbial ecology. Frequently, sequencing information from an environment is available, but phenotypic measurements from individual community members are not. With the increasing availability of molecular data for microbial communities, bioinformatic approaches that map metagenome to (meta)phenotype are needed. Recently, we developed a tool, gRodon, that enables the prediction of the maximum growth rate of an organism from genomic data on the basis of codon usage patterns. Our work and that of other groups suggest that such predictors can be applied to mixed-species communities in order to derive estimates of the average community-wide maximum growth rate. Here, we present an improved maximum growth rate predictor designed for metagenomes that corrects a persistent GC bias in the original gRodon model for metagenomic prediction. We benchmark this predictor with simulated metagenomic data sets to show that it has superior performance on mixed-species communities relative to earlier models. We go on to provide guidance on data preprocessing and show that calling genes from assembled contigs rather than directly from reads dramatically improves performance. Finally, we apply our predictor to large-scale metagenomic data sets from marine and human microbiomes to illustrate how community-wide growth prediction can be a powerful approach for hypothesis generation. Altogether, we provide an updated tool with clear guidelines for users about the uses and pitfalls of metagenomic prediction of the average community-wide maximal growth rate.

Identifiants

pubmed: 36190138
doi: 10.1128/msystems.00745-22
pmc: PMC9600850
doi:

Types de publication

Journal Article Research Support, U.S. Gov't, Non-P.H.S. Research Support, Non-U.S. Gov't

Langues

eng

Pagination

e0074522

Références

Nature. 2012 Jun 13;486(7402):207-14
pubmed: 22699609
Database (Oxford). 2020 Jan 1;2020:
pubmed: 32761142
Nat Microbiol. 2016 Sep 12;1(11):16160
pubmed: 27617693
BMC Bioinformatics. 2021 Jun 7;22(1):306
pubmed: 34098872
Proc Natl Acad Sci U S A. 2021 Mar 23;118(12):
pubmed: 33723043
Cell. 2019 Dec 12;179(7):1623-1635.e11
pubmed: 31835036
Science. 1997 May 2;276(5313):734-40
pubmed: 9115194
Nature. 2018 Aug;560(7717):233-237
pubmed: 30069051
Science. 2015 Nov 6;350(6261):aac9323
pubmed: 26542581
Sci Data. 2018 Sep 04;5:180176
pubmed: 30179232
Sci Data. 2020 Jun 5;7(1):170
pubmed: 32503990
Nucleic Acids Res. 1982 Nov 25;10(22):7055-74
pubmed: 6760125
Nucleic Acids Res. 2016 Aug 19;44(14):6614-24
pubmed: 27342282
Nature. 2017 Nov 23;551(7681):457-463
pubmed: 29088705
Microbiome. 2022 Feb 16;10(1):33
pubmed: 35172890
J Mol Biol. 1996 Aug 2;260(5):649-63
pubmed: 8709146
Bioinformatics. 2015 May 15;31(10):1674-6
pubmed: 25609793
Bioinformatics. 2019 Feb 1;35(3):521-522
pubmed: 30016412
BMC Bioinformatics. 2022 May 28;23(1):198
pubmed: 35643462
Nucleic Acids Res. 2000 Sep 15;28(18):3517-23
pubmed: 10982871
Trends Microbiol. 2019 Oct;27(10):814-823
pubmed: 31296406
ISME J. 2014 Aug;8(8):1553-65
pubmed: 24739623
Bioinformatics. 2018 Sep 1;34(17):i884-i890
pubmed: 30423086
Nat Biotechnol. 2020 Jun;38(6):685-688
pubmed: 32483366
Front Microbiol. 2014 May 27;5:251
pubmed: 24904563
Nat Biotechnol. 2019 Feb;37(2):179-185
pubmed: 30718868
Genome Res. 2018 Apr;28(4):569-580
pubmed: 29496730
BMC Bioinformatics. 2005 Jul 19;6:182
pubmed: 16029499
PLoS Genet. 2010 Jan 15;6(1):e1000808
pubmed: 20090831
Nucleic Acids Res. 2022 Jan 7;50(D1):D785-D794
pubmed: 34520557
J Mol Biol. 1981 Sep 25;151(3):389-409
pubmed: 6175758
Nucleic Acids Res. 2013 Oct;41(19):8842-52
pubmed: 23921637
PLoS Comput Biol. 2012;8(11):e1002779
pubmed: 23209389
Nucleic Acids Res. 2010 Nov;38(20):e191
pubmed: 20805240
Nucleic Acids Res. 2014 Jan;42(Database issue):D553-9
pubmed: 24316578
Science. 2008 May 23;320(5879):1039-43
pubmed: 18497288
Science. 2015 May 22;348(6237):1261359
pubmed: 25999513
Science. 2010 Nov 19;330(6007):1099-102
pubmed: 21097934
Science. 2021 Apr 16;372(6539):287-291
pubmed: 33859034
Bioinformatics. 2014 Jul 15;30(14):2068-9
pubmed: 24642063
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
Ann Rev Mar Sci. 2016;8:285-309
pubmed: 26195108
Proc Natl Acad Sci U S A. 2018 May 22;115(21):E4940-E4949
pubmed: 29735666

Auteurs

J L Weissman (JL)

Department of Biological Sciences-Marine and Environmental Biology, University of Southern Californiagrid.42505.36, Los Angeles, California, USA.

Marie Peras (M)

Trace Genomics, Inc., Redwood City, California, USA.

Tyler P Barnum (TP)

Trace Genomics, Inc., Redwood City, California, USA.

Jed A Fuhrman (JA)

Department of Biological Sciences-Marine and Environmental Biology, University of Southern Californiagrid.42505.36, Los Angeles, California, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH