Standardized multi-omics of Earth's microbiomes reveals microbial and metabolite diversity.
Journal
Nature microbiology
ISSN: 2058-5276
Titre abrégé: Nat Microbiol
Pays: England
ID NLM: 101674869
Informations de publication
Date de publication:
12 2022
12 2022
Historique:
received:
03
11
2021
accepted:
10
10
2022
pubmed:
29
11
2022
medline:
3
12
2022
entrez:
28
11
2022
Statut:
ppublish
Résumé
Despite advances in sequencing, lack of standardization makes comparisons across studies challenging and hampers insights into the structure and function of microbial communities across multiple habitats on a planetary scale. Here we present a multi-omics analysis of a diverse set of 880 microbial community samples collected for the Earth Microbiome Project. We include amplicon (16S, 18S, ITS) and shotgun metagenomic sequence data, and untargeted metabolomics data (liquid chromatography-tandem mass spectrometry and gas chromatography mass spectrometry). We used standardized protocols and analytical methods to characterize microbial communities, focusing on relationships and co-occurrences of microbially related metabolites and microbial taxa across environments, thus allowing us to explore diversity at extraordinary scale. In addition to a reference database for metagenomic and metabolomic data, we provide a framework for incorporating additional studies, enabling the expansion of existing knowledge in the form of an evolving community resource. We demonstrate the utility of this database by testing the hypothesis that every microbe and metabolite is everywhere but the environment selects. Our results show that metabolite diversity exhibits turnover and nestedness related to both microbial communities and the environment, whereas the relative abundances of microbially related metabolites vary and co-occur with specific microbial consortia in a habitat-specific manner. We additionally show the power of certain chemistry, in particular terpenoids, in distinguishing Earth's environments (for example, terrestrial plant surfaces and soils, freshwater and marine animal stool), as well as that of certain microbes including Conexibacter woesei (terrestrial soils), Haloquadratum walsbyi (marine deposits) and Pantoea dispersa (terrestrial plant detritus). This Resource provides insight into the taxa and metabolites within microbial communities from diverse habitats across Earth, informing both microbial and chemical ecology, and provides a foundation and methods for multi-omics microbiome studies of hosts and the environment.
Identifiants
pubmed: 36443458
doi: 10.1038/s41564-022-01266-x
pii: 10.1038/s41564-022-01266-x
pmc: PMC9712116
doi:
Substances chimiques
Soil
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
2128-2150Subventions
Organisme : NCCIH NIH HHS
ID : DP1 AT010885
Pays : United States
Organisme : NIGMS NIH HHS
ID : K12 GM068524
Pays : United States
Investigateurs
Lars T Angenant
(LT)
Alison M Berry
(AM)
Leonora S Bittleston
(LS)
Jennifer L Bowen
(JL)
Max Chavarría
(M)
Don A Cowan
(DA)
Dan Distel
(D)
Peter R Girguis
(PR)
Jaime Huerta-Cepas
(J)
Paul R Jensen
(PR)
Lingjing Jiang
(L)
Gary M King
(GM)
Anton Lavrinienko
(A)
Aurora MacRae-Crerar
(A)
Thulani P Makhalanyane
(TP)
Tapio Mappes
(T)
Ezequiel M Marzinelli
(EM)
Gregory Mayer
(G)
Katherine D McMahon
(KD)
Jessica L Metcalf
(JL)
Sou Miyake
(S)
Timothy A Mousseau
(TA)
Catalina Murillo-Cruz
(C)
David Myrold
(D)
Brian Palenik
(B)
Adrián A Pinto-Tomás
(AA)
Dorota L Porazinska
(DL)
Jean-Baptiste Ramond
(JB)
Forest Rowher
(F)
Taniya RoyChowdhury
(T)
Stuart A Sandin
(SA)
Steven K Schmidt
(SK)
Henning Seedorf
(H)
Ashley Shade
(A)
J Reuben Shipway
(JR)
Jennifer E Smith
(JE)
James Stegen
(J)
Frank J Stewart
(FJ)
Karen Tait
(K)
Torsten Thomas
(T)
Yael Tucker
(Y)
Jana M U'Ren
(JM)
Phillip C Watts
(PC)
Nicole S Webster
(NS)
Jesse R Zaneveld
(JR)
Shan Zhang
(S)
Informations de copyright
© 2022. The Author(s).
Références
Thompson, L. R. et al. A communal catalogue reveals Earth’s multiscale microbial diversity. Nature 551, 457–463 (2017).
pubmed: 29088705
pmcid: 6192678
doi: 10.1038/nature24621
Knight, R. et al. Best practices for analysing microbiomes. Nat. Rev. Microbiol. 16, 410–422 (2018).
pubmed: 29795328
doi: 10.1038/s41579-018-0029-9
Proctor, L. M. et al. The Integrative Human Microbiome Project. Nature 569, 641–648 (2019).
doi: 10.1038/s41586-019-1238-8
Vangay, P. et al. Microbiome metadata standards: report of the National Microbiome Data Collaborative’s workshop and follow-on activities. mSystems 6, e01194-20 (2021).
pubmed: 33622857
pmcid: 8573954
Lozupone, C. A. & Knight, R. Global patterns in bacterial diversity. Proc. Natl Acad. Sci. USA 104, 11436–11440 (2007).
pubmed: 17592124
pmcid: 2040916
doi: 10.1073/pnas.0611525104
Quince, C. et al. Shotgun metagenomics, from sampling to analysis. Nat. Biotechnol. 35, 833–844 (2017).
pubmed: 28898207
doi: 10.1038/nbt.3935
Franzosa, E. A. et al. Species-level functional profiling of metagenomes and metatranscriptomes. Nat. Methods 15, 962–968 (2018).
pubmed: 30377376
pmcid: 6235447
doi: 10.1038/s41592-018-0176-y
Blin, K. et al. antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res. 47, W81–W87 (2019).
pubmed: 31032519
pmcid: 6602434
doi: 10.1093/nar/gkz310
Ziemert, N., Alanjary, M. & Weber, T. The evolution of genome mining in microbes - a review. Nat. Prod. Rep. 33, 988–1005 (2016).
pubmed: 27272205
doi: 10.1039/C6NP00025H
Dinsdale, E. A. et al. Functional metagenomic profiling of nine biomes. Nature 452, 629–632 (2008).
pubmed: 18337718
doi: 10.1038/nature06810
Louca, S. et al. Decoupling function and taxonomy in the global ocean microbiome. Science 353, 1272–1277 (2016).
pubmed: 27634532
doi: 10.1126/science.aaf4507
Lloyd-Price, J. et al. Strains, functions and dynamics in the expanded Human Microbiome Project. Nature 550, 61–66 (2017).
pubmed: 28953883
pmcid: 5831082
doi: 10.1038/nature23889
Libis, V. et al. Uncovering the biosynthetic potential of rare metagenomic DNA using co-occurrence network analysis of targeted sequences. Nat. Commun. 10, 3848 (2019).
pubmed: 31451725
pmcid: 6710260
doi: 10.1038/s41467-019-11658-z
Nayfach, S. et al. A genomic catalog of Earth’s microbiomes. Nat. Biotechnol. https://doi.org/10.1038/s41587-020-0718-6 (2020).
Kleiner, M. et al. Metaproteomics of a gutless marine worm and its symbiotic microbial community reveal unusual pathways for carbon and energy use. Proc. Natl Acad. Sci. USA 109, E1173–E1182 (2012).
pubmed: 22517752
pmcid: 3358896
doi: 10.1073/pnas.1121198109
Vogel, C. & Marcotte, E. M. Insights into the regulation of protein abundance from proteomic and transcriptomic analyses. Nat. Rev. Genet. 13, 227–232 (2012).
pubmed: 22411467
pmcid: 3654667
doi: 10.1038/nrg3185
Hultman, J. et al. Multi-omics of permafrost, active layer and thermokarst bog soil microbiomes. Nature 521, 208–212 (2015).
pubmed: 25739499
doi: 10.1038/nature14238
Amos, G. C. A. et al. Comparative transcriptomics as a guide to natural product discovery and biosynthetic gene cluster functionality. Proc. Natl Acad. Sci. USA 114, E11121–E11130 (2017).
pubmed: 29229817
pmcid: 5748202
doi: 10.1073/pnas.1714381115
Aksenov, A. A., da Silva, R., Knight, R., Lopes, N. P. & Dorrestein, P. C. Global chemical analysis of biology by mass spectrometry. Nat. Rev. Chem. 1, 0054 (2017).
doi: 10.1038/s41570-017-0054
Kesnerová, L. et al. Disentangling metabolic functions of bacteria in the honey bee gut. PLoS Biol. 15, e2003467 (2017).
pubmed: 29232373
pmcid: 5726620
doi: 10.1371/journal.pbio.2003467
Williams, A. et al. Metabolomic shifts associated with heat stress in coral holobionts. Sci. Adv. 7, eabd4210 (2021).
pubmed: 33523848
pmcid: 7775768
doi: 10.1126/sciadv.abd4210
Muller, E. et al. A meta-analysis study of the robustness and universality of gut microbiome-metabolome associations. Microbiome 9, 203 (2021).
pubmed: 34641974
pmcid: 8507343
doi: 10.1186/s40168-021-01149-z
Santoro, E. P. et al. Coral microbiome manipulation elicits metabolic and genetic restructuring to mitigate heat stress and evade mortality. Sci. Adv. 7, eabg3088 (2021).
pubmed: 34389536
pmcid: 8363143
doi: 10.1126/sciadv.abg3088
Xu, L. et al. Genome-resolved metagenomics reveals role of iron metabolism in drought-induced rhizosphere microbiome dynamics. Nat. Commun. 12, 3209 (2021).
pubmed: 34050180
pmcid: 8163885
doi: 10.1038/s41467-021-23553-7
Davies, D. G. et al. The involvement of cell-to-cell signals in the development of a bacterial biofilm. Science 280, 295–298 (1998).
pubmed: 9535661
doi: 10.1126/science.280.5361.295
Hibbing, M. E. et al. Bacterial competition: surviving and thriving in a microbial jungle. Nat. Rev. Microbiol. 8, 15–25 (2010).
pubmed: 19946288
pmcid: 2879262
doi: 10.1038/nrmicro2259
Davies, J. Specialized microbial metabolites: functions and origins. J. Antibiot. 66, 361–364 (2013).
doi: 10.1038/ja.2013.61
Gunatilaka, A. A. L. Natural products from plant-associated microorganisms: distribution, structural diversity, bioactivity, and implications of their occurrence. J. Nat. Prod. 69, 509–526 (2006).
pubmed: 16562864
pmcid: 3362121
doi: 10.1021/np058128n
Kelly, C. R. et al. Fecal microbiota transplant for treatment of Clostridium difficile infection in immunocompromised patients. Am. J. Gastroenterol. 109, 1065–1071 (2014).
pubmed: 24890442
pmcid: 5537742
doi: 10.1038/ajg.2014.133
Louis, P. et al. The gut microbiota, bacterial metabolites and colorectal cancer. Nat. Rev. Microbiol. 12, 661–672 (2014).
pubmed: 25198138
doi: 10.1038/nrmicro3344
Bell, T. H. et al. A diverse soil microbiome degrades more crude oil than specialized bacterial assemblages obtained in culture. Appl. Environ. Microbiol. 82, 5530–5541 (2016).
pubmed: 27371586
pmcid: 5007768
doi: 10.1128/AEM.01327-16
Bokulich, N. A. et al. Antibiotics, birth mode, and diet shape microbiome maturation during early life. Sci. Transl. Med 8, 343ra82 (2016).
pubmed: 27306664
pmcid: 5308924
doi: 10.1126/scitranslmed.aad7121
Tang, W. H. W. et al. Gut microbiota in cardiovascular health and disease. Circ. Res. 120, 1183–1196 (2017).
pubmed: 28360349
pmcid: 5390330
doi: 10.1161/CIRCRESAHA.117.309715
Pham, J. V. et al. A review of the microbial production of bioactive natural products and biologics. Front. Microbiol. 10, 1404 (2019).
pubmed: 31281299
pmcid: 6596283
doi: 10.3389/fmicb.2019.01404
Xue, M.-Y. et al. Multi-omics reveals that the rumen microbiome and its metabolome together with the host metabolome contribute to individualized dairy cow performance. Microbiome 8, 64 (2020).
pubmed: 32398126
pmcid: 7218573
doi: 10.1186/s40168-020-00819-8
Hong, Y. et al. Integrated metagenomic and metabolomic analysis of the effect of Astragalus polysaccharides on alleviating high-fat diet-induced metabolic disorders. Front. Pharmacol. 11, 833 (2020).
pubmed: 32587515
pmcid: 7299173
doi: 10.3389/fphar.2020.00833
Ye, X. et al. Effect of host breeds on gut microbiome and serum metabolome in meat rabbits. BMC Vet. Res. 17, 24 (2021).
pubmed: 33413361
pmcid: 7791989
doi: 10.1186/s12917-020-02732-6
Mohanty, I. et al. Multi-omic profiling of Melophlus sponges reveals diverse metabolomic and microbiome architectures that are non-overlapping with ecological neighbors. Mar. Drugs 18, 124 (2020).
pubmed: 32092934
pmcid: 7074536
doi: 10.3390/md18020124
Ganugi, P. et al. Nitrogen use efficiency, rhizosphere bacterial community, and root metabolome reprogramming due to maize seed treatment with microbial biostimulants. Physiol. Plant. 174, e13679 (2022).
pubmed: 35362106
pmcid: 9324912
doi: 10.1111/ppl.13679
Turroni, S. et al. Fecal metabolome of the Hadza hunter-gatherers: a host-microbiome integrative view. Sci. Rep. 6, 32826 (2016).
pubmed: 27624970
pmcid: 5021991
doi: 10.1038/srep32826
Dührkop, K. et al. Systematic classification of unknown metabolites using high-resolution fragmentation mass spectra. Nat. Biotechnol. https://doi.org/10.1038/s41587-020-0740-8 (2020).
Baas Becking, L. G. M. Geobiologie of Inleiding Tot De Milieukunde (in Dutch) (W. P. Van Stockum & Zoon, 1934).
de Wit, R. & Bouvier, T. ‘Everything is everywhere but the environment selects’; what did Baas Becking and Beijerinck really say? Environ. Microbiol. 8, 755–758 (2006).
pubmed: 16584487
doi: 10.1111/j.1462-2920.2006.01017.x
Martiny, J. B. H. et al. Microbial biogeography: putting microorganisms on the map. Nat. Rev. Microbiol. 4, 102–112 (2006).
pubmed: 16415926
doi: 10.1038/nrmicro1341
O’Malley, M. A. ‘Everything is everywhere: but the environment selects’: ubiquitous distribution and ecological determinism in microbial biogeography. Stud. Hist. Phil. Biol. Biomed. Sci. 39, 314–325 (2008).
doi: 10.1016/j.shpsc.2008.06.005
Fondi, M. et al. “Every Gene Is Everywhere but the Environment Selects”: global geolocalization of gene sharing in environmental samples through network analysis. Genome Biol. Evol. 8, 1388–1400 (2016).
pubmed: 27190206
pmcid: 4898794
doi: 10.1093/gbe/evw077
Allison, S. D. & Martiny, J. B. H. Resistance, resilience, and redundancy in microbial communities. Proc. Natl Acad. Sci. USA 105, 11512–11519 (2008).
pubmed: 18695234
pmcid: 2556421
doi: 10.1073/pnas.0801925105
Louca, S. et al. Function and functional redundancy in microbial communities. Nat. Ecol. Evol. 2, 936–943 (2018).
pubmed: 29662222
doi: 10.1038/s41559-018-0519-1
Barnes, E. M. et al. Predicting microbiome function across space is confounded by strain-level differences and functional redundancy across taxa. Front. Microbiol. 11, 101 (2020).
pubmed: 32117131
pmcid: 7018939
doi: 10.3389/fmicb.2020.00101
Thompson, L. et al. EMP sample submission guide v1. https://doi.org/10.17504/protocols.io.pfqdjmw (2018).
Gonzalez, A. et al. Qiita: rapid, web-enabled microbiome meta-analysis. Nat. Methods 15, 796–798 (2018).
pubmed: 30275573
pmcid: 6235622
doi: 10.1038/s41592-018-0141-9
Wang, M. et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 34, 828–837 (2016).
pubmed: 27504778
pmcid: 5321674
doi: 10.1038/nbt.3597
Carvalho, J. C. et al. Measuring fractions of beta diversity and their relationships to nestedness: a theoretical and empirical comparison of novel approaches. Oikos 122, 825–834 (2013).
doi: 10.1111/j.1600-0706.2012.20980.x
Monciardini, P. et al. Conexibacter woesei gen. nov. sp. nov., a novel representative of a deep evolutionary line of descent within the class Actinobacteria. Int. J. Syst. Evol. Microbiol. 53, 569–576 (2003).
pubmed: 12710628
doi: 10.1099/ijs.0.02400-0
Sharma, M. P. et al. Deciphering the role of trehalose in tripartite symbiosis among rhizobia, arbuscular mycorrhizal fungi, and legumes for enhancing abiotic stress tolerance in crop plants. Front. Microbiol. 11, 509919 (2020).
pubmed: 33042042
pmcid: 7527417
doi: 10.3389/fmicb.2020.509919
Morton, J. T. et al. Learning representations of microbe-metabolite interactions. Nat. Methods 16, 1306–1314 (2019).
pubmed: 31686038
pmcid: 6884698
doi: 10.1038/s41592-019-0616-3
Antwis, R. E. Fifty important research questions in microbial ecology. FEMS Microbiol. Ecol. 93, fix044 (2017).
doi: 10.1093/femsec/fix044
Avalos, M. et al. Biosynthesis, evolution and ecology of microbial terpenoids. Nat. Prod. Rep. 39, 249 (2022).
pubmed: 34612321
doi: 10.1039/D1NP00047K
Nayfach, S. et al. New insights from uncultivated genomes of the global human gut microbiome. Nature 568, 505–510 (2019).
pubmed: 30867587
pmcid: 6784871
doi: 10.1038/s41586-019-1058-x
Fierer, N. et al. Cross-biome metagenomic analysis of soil microbial communities and their functional attributes. Proc. Natl Acad. Sci. USA 109, 21390–21395 (2012).
pubmed: 23236140
pmcid: 3535587
doi: 10.1073/pnas.1215210110
Bahram, M. et al. Structure and function of the global topsoil microbiome. Nature 560, 233–237 (2018).
pubmed: 30069051
doi: 10.1038/s41586-018-0386-6
Sunagawa, S. et al. Structure and function of the global ocean microbiome. Science 348, 1261359 (2015).
pubmed: 25999513
doi: 10.1126/science.1261359
Erickson, A. R. et al. Integrated metagenomics/metaproteomics reveals human host-microbiota signatures of Crohn’s disease. PLoS ONE 7, e49138 (2012).
pubmed: 23209564
pmcid: 3509130
doi: 10.1371/journal.pone.0049138
Lim, Y. W. et al. Metagenomics and metatranscriptomics: windows on CF-associated viral and microbial communities. J. Cyst. Fibros. 12, 154–164 (2013).
pubmed: 22951208
doi: 10.1016/j.jcf.2012.07.009
Williams, T. J. et al. The role of planktonic Flavobacteria in processing organic matter in coastal East Antarctica revealed using metagenomics and metaproteomics. Environ. Microbiol. 15, 1302–1317 (2013).
pubmed: 23126454
doi: 10.1111/1462-2920.12017
Leary, D. H. et al. Integrated metagenomic and metaproteomic analyses of marine biofilm communities. Biofueling 30, 1211–1223 (2014).
doi: 10.1080/08927014.2014.977267
Lu, K. et al. Arsenic exposure perturbs the gut microbiome and its metabolic profile in mice: an integrated metagenomics and metabolomics analysis. Environ. Health Perspect. 122, 284–291 (2014).
pubmed: 24413286
pmcid: 3948040
doi: 10.1289/ehp.1307429
Bikel, S. et al. Combining metagenomics, metatranscriptomics and viromics to explore novel microbial interactions: towards a system-level understanding of human microbiome. Comput. Struct. Biotechnol. J. 13, 390–401 (2015).
pubmed: 26137199
pmcid: 4484546
doi: 10.1016/j.csbj.2015.06.001
Califf, K. J. et al. Multi-omics analysis of periodontal pocket microbial communities pre- and posttreatment. mSystems 2, e00016–e00017 (2017).
pubmed: 28744486
pmcid: 5513737
doi: 10.1128/mSystems.00016-17
Schirmer, M. et al. Dynamics of metatranscription in the inflammatory bowel disease gut microbiome. Nat. Microbiol. 3, 337–346 (2018).
pubmed: 29311644
pmcid: 6131705
doi: 10.1038/s41564-017-0089-z
Lloyd-Price, J. et al. Multi-omics of the gut microbial ecosystem of inflammatory bowel diseases. Nature 569, 655–662 (2019).
pubmed: 31142855
pmcid: 6650278
doi: 10.1038/s41586-019-1237-9
Xu, L. et al. Genome-resolved metagenomes reveals role of iron metabolism in drought-induced rhizosphere microbiome dynamics. Nat. Commun. 12, 3209 (2021).
pubmed: 34050180
pmcid: 8163885
doi: 10.1038/s41467-021-23553-7
Garza, D. R. et al. Towards predicting the environmental metabolome from metagenomics with a mechanistic model. Nat. Microbiol 3, 456–460 (2018).
pubmed: 29531366
doi: 10.1038/s41564-018-0124-8
Yilmaz, P. et al. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat. Biotechnol. 29, 415–420 (2011).
pubmed: 21552244
pmcid: 3367316
doi: 10.1038/nbt.1823
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
pubmed: 23051804
pmcid: 3471674
doi: 10.1038/nbt.2377
Pluskal, T., Castillo, S., Villar-Briones, A. & Oresic, M. MZmine 2: modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data. BMC Bioinformatics 11, 395 (2010).
pubmed: 20650010
pmcid: 2918584
doi: 10.1186/1471-2105-11-395
Schmid, R. et al. Ion identity molecular networking in the GNPS environment. Nat. Commun. https://doi.org/10.1038/s41467-021-23953-9 (2020).
Du, X., Smirnov, A., Pluskal, T., Jia, W. & Sumner, S. Metabolomics data preprocessing using ADAP and MZmine 2. Methods Mol. Biol. 2104, 25–48 (2020).
pubmed: 31953811
pmcid: 8359540
doi: 10.1007/978-1-0716-0239-3_3
Nothias, L.-F. et al. Feature-based molecular networking in the GNPS analysis environment. Nat. Methods 17, 905–908 (2020).
pubmed: 32839597
pmcid: 7885687
doi: 10.1038/s41592-020-0933-6
Mohimani, H. et al. Dereplication of peptidic natural products through database search of mass spectra. Nat. Chem. Biol. 13, 30–37 (2017).
pubmed: 27820803
doi: 10.1038/nchembio.2219
Mohimani, H. et al. Dereplication of microbial metabolites through database search of mass spectra. Nat. Commun. 9, 4035 (2018).
pubmed: 30279420
pmcid: 6168521
doi: 10.1038/s41467-018-06082-8
Dührkop, K. et al. SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information. Nat. Methods 16, 299–302 (2019).
pubmed: 30886413
doi: 10.1038/s41592-019-0344-8
Böcker, S., Letzel, M. C., Lipták, Z. & Pervukhin, A. SIRIUS: decomposing isotope patterns for metabolite identification. Bioinformatics 25, 218–224 (2009).
pubmed: 19015140
doi: 10.1093/bioinformatics/btn603
Böcker, S. & Dührkop, K. Fragmentation trees reloaded. J. Cheminformormatics 8, 5 (2016).
doi: 10.1186/s13321-016-0116-8
Ludwig, M. et al. Database-independent molecular formula annotation using Gibbs sampling through ZODIAC. Nat. Mach. Intell. 2, 629–641 (2020).
doi: 10.1038/s42256-020-00234-6
Dührkop, K., Shen, H., Meusel, M., Rousu, J. & Böcker, S. Searching molecular structure databases with tandem mass spectra using CSI:FingerID. Proc. Natl Acad. Sci. USA 112, 12580–12585 (2015).
pubmed: 26392543
pmcid: 4611636
doi: 10.1073/pnas.1509788112
Kim, H. et al. NPClassifier: a deep neural network-based structural classification tool for natural products. J. Nat. Prod. https://doi.org/10.1021/acs.jnatprod.1c00399 (2021).
van Santen, J. A. et al. The Natural Products Atlas: an open access knowledge base for microbial natural products discovery. ACS Cent. Sci. 5, 1824–1833 (2019).
pubmed: 31807684
pmcid: 6891855
doi: 10.1021/acscentsci.9b00806
Kautsar, S. A. et al. MIBiG 2.0: a repository for biosynthetic gene clusters of known function. Nucleic Acids Res. 48, D454–D458 (2020).
pubmed: 31612915
Bolyen, E. et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 37, 852–857 (2019).
pubmed: 31341288
pmcid: 7015180
doi: 10.1038/s41587-019-0209-9
Martino, C. et al. A novel sparse compositional technique reveals microbial perturbations. mSystems 4, e00016-19 (2019).
pubmed: 30801021
pmcid: 6372836
doi: 10.1128/mSystems.00016-19
Morton, J. T. et al. Establishing microbial composition measurement standards with reference frames. Nat. Commun. 10, 2719 (2019).
pubmed: 31222023
pmcid: 6586903
doi: 10.1038/s41467-019-10656-5
Vázquez-Baeza, Y., Pirrung, M., Gonzalez, A. & Knight, R. EMPeror: a tool for visualizing high-throughput microbial community data. Gigascience 2, 16 (2013).
pubmed: 24280061
pmcid: 4076506
doi: 10.1186/2047-217X-2-16
Fedarko, M. W. et al. Visualizing ’omic feature rankings and log-ratios using Qurro. NAR Genom. Bioinform. 2, lqaa023 (2020).
pubmed: 32391521
pmcid: 7194218
doi: 10.1093/nargab/lqaa023
Wilkinson, L. Ggplot2: elegant graphics for data analysis by WICKHAM, H. Biometrics 67, 678–679 (2011).
doi: 10.1111/j.1541-0420.2011.01616.x
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2013).
Aksenov, A. A. et al. Auto-deconvolution and molecular networking of gas chromatography-mass spectrometry data. Nat. Biotechnol. 39, 169–173 (2021).
pubmed: 33169034
doi: 10.1038/s41587-020-0700-3
Marotz, L. et al. Earth Microbiome Project (EMP) high throughput (HTP) DNA extraction protocol v1. https://doi.org/10.17504/protocols.io.pdmdi46 (2018).
Marotz, C. et al. DNA extraction for streamlined metagenomics of diverse environmental samples. Biotechniques 62, 290–293 (2017).
pubmed: 28625159
doi: 10.2144/000114559
Minich, J. J. et al. KatharoSeq enables high-throughput microbiome analysis from low-biomass samples. mSystems 3, e00218-17 (2018).
pubmed: 29577086
pmcid: 5864415
doi: 10.1128/mSystems.00218-17
Minich, J. J. et al. Quantifying and understanding well-to-well contamination in microbiome research. mSystems 4, e00186-19 (2019).
pubmed: 31239396
pmcid: 6593221
doi: 10.1128/mSystems.00186-19
Shaffer, J. P. et al. A comparison of DNA/RNA extraction protocols for high-throughput sequencing of microbial communities. Biotechniques 70, 149–159 (2021).
pubmed: 33512248
pmcid: 7931620
doi: 10.2144/btn-2020-0153
Minich, J. J. et al. High-throughput miniaturized 16S rRNA amplicon library preparation reduces costs while preserving microbiome integrity. mSystems 3, e00166-18 (2018).
pubmed: 30417111
pmcid: 6222042
doi: 10.1128/mSystems.00166-18
Karst, S. M. et al. High-accuracy long-read amplicon sequences using unique molecular identifiers with Nanopore or PacBio sequencing. Nat. Methods 18, 165–169 (2021).
pubmed: 33432244
doi: 10.1038/s41592-020-01041-y
Greg, J. et al. EMP 16S Illumina amplicon protocol v1. https://doi.org/10.17504/protocols.io.nuudeww (2018).
Caporaso, J. G. et al. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc. Natl Acad. Sci. USA 108, 4516–4522 (2011).
pubmed: 20534432
doi: 10.1073/pnas.1000080107
Caporaso, J. G. et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 6, 1621–1624 (2012).
pubmed: 22402401
pmcid: 3400413
doi: 10.1038/ismej.2012.8
Parada, A. E., Needham, D. M. & Fuhrman, J. A. Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time series and global field samples. Environ. Microbiol. 18, 1403–1414 (2016).
pubmed: 26271760
doi: 10.1111/1462-2920.13023
Apprill, A., McNally, S., Parsons, R. & Weber, L. Minor revision to V4 region SSU rRNA 806R gene primer greatly increases detection of SAR11 bacterioplankton. Aquat. Microb. Ecol. 75, 129–137 (2015).
doi: 10.3354/ame01753
Quince, C., Lanzen, A., Davenport, R. J. & Turnbaugh, P. J. Removing noise from pyrosequenced amplicons. BMC Bioinformatics 12, 38 (2011).
pubmed: 21276213
pmcid: 3045300
doi: 10.1186/1471-2105-12-38
Walters, W. et al. Improved bacterial 16S rRNA gene (V4 and V4-5) and fungal internal transcribed spacer marker gene primers for microbial community surveys. mSystems 1, e00009-15 (2016).
pubmed: 27822518
doi: 10.1128/mSystems.00009-15
Linda et al. EMP 18S Illumina amplicon protocol v1. https://doi.org/10.17504/protocols.io.nuvdew6 (2018).
Amaral-Zettler, L. A., McCliment, E. A., Ducklow, H. W. & Huse, S. M. A method for studying protistan diversity using massively parallel sequencing of V9 hypervariable regions of small-subunit ribosomal RNA genes. PLoS ONE 4, e6372 (2009).
pubmed: 19633714
pmcid: 2711349
doi: 10.1371/journal.pone.0006372
Stoeck, T. et al. Multiple marker parallel tag environmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water. Mol. Ecol. 19, 21–31 (2010).
pubmed: 20331767
doi: 10.1111/j.1365-294X.2009.04480.x
Vestheim, H. & Jarman, S. N. Blocking primers to enhance PCR amplification of rare sequences in mixed samples - a case study on prey DNA in Antarctic krill stomachs. Front. Zool. 5, 12 (2008).
pubmed: 18638418
pmcid: 2517594
doi: 10.1186/1742-9994-5-12
Dylan, P. S. et al. EMP ITS Illumina amplicon protocol v1. https://doi.org/10.17504/protocols.io.pa7dihn (2018).
White, T. et al. Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. In PCR Protocols: A Guide to Methods and Applications, Innis, M.A., D.H. Gelfand, J.J. Sninsky, and T.J. White (eds). 315–322. New Jersey, Academic Press (1990) https://www.sciencedirect.com/book/9780123721808/pcr-protocols
Hoggard, M. et al. Characterizing the human mycobiota: a comparison of small subunit rRNA, ITS1, ITS2, and large subunit rRNA genomic targets. Front. Microbiol. 9, 2208 (2018).
pubmed: 30283425
pmcid: 6157398
doi: 10.3389/fmicb.2018.02208
Bokulich, N. A. & Mills, D. A. Improved selection of internal transcribed spacer-specific primers enables quantitative, ultra-high-throughput profiling of fungal communities. Appl. Environ. Microbiol. 79, 2519–2526 (2013).
pubmed: 23377949
pmcid: 3623200
doi: 10.1128/AEM.03870-12
Klindworth, A. et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 41, e1 (2012).
pubmed: 22933715
pmcid: 3592464
doi: 10.1093/nar/gks808
Hunt, D. E. et al. Evaluation of 23S rRNA PCR primers for use in phylogenetic studies of bacterial diversity. Appl. Environ. Microbiol. 72, 2221–2225 (2006).
pubmed: 16517676
pmcid: 1393206
doi: 10.1128/AEM.72.3.2221-2225.2006
Amir, A. et al. Deblur rapidly resolves single-nucleotide community sequence patterns. mSystems 2, e00191-16 (2017).
pubmed: 28289731
pmcid: 5340863
doi: 10.1128/mSystems.00191-16
McDonald, D. et al. An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analysis of bacteria and archaea. ISME J. 6, 610–618 (2012).
pubmed: 22134646
doi: 10.1038/ismej.2011.139
Janssen, S. et al. Phylogenetic placement of exact amplicon sequences improves associations with clinical information. mSystems 3, e00021-18 (2018).
pubmed: 29719869
pmcid: 5904434
doi: 10.1128/mSystems.00021-18
Hamady, M. & Knight, R. Microbial community profiling for human microbiome projects: Tools techniques and challenges. Genome Res. 19, 1141–1152 (2009).
pubmed: 19383763
pmcid: 3776646
doi: 10.1101/gr.085464.108
Hamady, M., Walker, J., Harris, J., Gold, J. K. & Knight, R. Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex. Nat. Methods 5, 235–237 (2008).
pubmed: 18264105
pmcid: 3439997
doi: 10.1038/nmeth.1184
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal 17, https://doi.org/10.14806/ej.17.1.200 (2011).
Callahan, B. J., McMurdie, P. J. & Holmes, S. P. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. ISME J. 11, 2639–2643 (2017).
pubmed: 28731476
pmcid: 5702726
doi: 10.1038/ismej.2017.119
Bokulich, N. A. et al. Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin. Microbiome 6, 90 (2018).
pubmed: 29773078
pmcid: 5956843
doi: 10.1186/s40168-018-0470-z
Yilmaz, P. et al. The SILVA and “All-species Living Tree Project (LPT)” taxonomic frameworks. Nucleic Acids Res. 42, D643–D648 (2014).
pubmed: 24293649
doi: 10.1093/nar/gkt1209
Nilsson, R. H. et al. The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications. Nucleic Acids Res. 47, D259–D264 (2018).
pmcid: 6324048
doi: 10.1093/nar/gky1022
Rognes, T., Flouri, T., Nichols, B., Quince, C. & Mahé, F. Vsearch: a versatile open source tool for metagenomics. PeerJ 4, e2584 (2016).
pubmed: 27781170
pmcid: 5075697
doi: 10.7717/peerj.2584
Sanders, J. G. et al. Optimizing sequencing protocols for leaderboard metagenomics by combining long and short reads. Genome Biol. 20, 226 (2019).
pubmed: 31672156
pmcid: 6822431
doi: 10.1186/s13059-019-1834-9
Glenn, T. C. et al. Adapterama I: universal stubs and primers for 384 unique dual-indexed or 147,456 combinatorially-indexed Illumina libraries (iTru & iNext). PeerJ 7, e7755 (2019).
pubmed: 31616586
pmcid: 6791352
doi: 10.7717/peerj.7755
Didion, J. P., Martin, M. & Collins, F. S. Atropos: specific, sensitive, and speedy trimming of sequencing reads. PeerJ 5, e3720 (2017).
pubmed: 28875074
pmcid: 5581536
doi: 10.7717/peerj.3720
Zhu, Q. et al. Phylogenomics of 10,575 genomes reveals evolutionary proximity between domains Bacteria and Archaea. Nat. Commun. 10, 5477 (2019).
pubmed: 31792218
pmcid: 6889312
doi: 10.1038/s41467-019-13443-4
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie2. Nat. Methods 9, 357–359 (2012).
pubmed: 22388286
pmcid: 3322381
doi: 10.1038/nmeth.1923
Sczyrba, A. et al. Critical assessment of metagenome interpretation: a benchmark of metagenomics software. Nat. Methods 14, 1063–1071 (2017).
pubmed: 28967888
pmcid: 5903868
doi: 10.1038/nmeth.4458
Meyer, F. et al. Critical assessment of metagenome interpretation: the second round of challenges. Nat. Methods 19, 429–440 (2022).
pubmed: 35396482
pmcid: 9007738
doi: 10.1038/s41592-022-01431-4
Zhu, Q. et al. Phylogeny-aware analysis of metagenome community ecology based on matched reference genomes while bypassing taxonomy. mSystems 7, e00167-22 (2022).
pubmed: 35369727
pmcid: 9040630
doi: 10.1128/msystems.00167-22
Pennell, M. W. et al. geiger v2.0: an expanded suite of methods for fitting macroevolutionary models to phylogenetic trees. Bioinformatics 30, 2216–2218 (2014).
pubmed: 24728855
doi: 10.1093/bioinformatics/btu181
Swenson, N. G. Functional and Phylogenetic Ecology in R (Springer, 2014).
Cantrell, K. et al. EMPress enables tree-guided, interactive, and exploratory analysis of multi-omic data sets. mSystems 6, e01216–e01220 (2021).
pubmed: 33727399
pmcid: 8546999
doi: 10.1128/mSystems.01216-20
Kanehisa, M. & Goto, S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27–30 (2000).
pubmed: 10592173
pmcid: 102409
doi: 10.1093/nar/28.1.27
Kanehisa, M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 28, 1947–1951 (2019).
pubmed: 31441146
pmcid: 6798127
doi: 10.1002/pro.3715
Kanehisa, M. et al. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021).
pubmed: 33125081
doi: 10.1093/nar/gkaa970
Utro, F. et al. Hierarchically labeled database indexing allows scalable characterization of microbiomes. iScience 23, 100988 (2020).
pubmed: 32248063
pmcid: 7125348
doi: 10.1016/j.isci.2020.100988
Seabolt, E. E. et al. Functional genomics platform, a cloud-based platform for studying microbial life at scale. IEEE/ACM Trans. Comput. Biol. Bioinf. 19, 940–952 (2022).
doi: 10.1109/TCBB.2020.3021231
Haiminen, N. et al. Functional profiling of COVID-19 respiratory tract microbiomes. Sci. Rep. 11, 6433 (2021).
pubmed: 33742096
pmcid: 7979704
doi: 10.1038/s41598-021-85750-0
Almeida-Neto, M. et al. A consistent metric for nestedness analysis in ecological systems: reconciling concept and measurement. Oikos 117, 1227–1239 (2008).
doi: 10.1111/j.0030-1299.2008.16644.x
Makowski, D. et al. Methods and algorithms for correlation analysis in R. J. Open Source Softw. 5, 2306 (2019).
doi: 10.21105/joss.02306
Carrieri, A. P. et al. Explainable AI reveals changes in skin microbiome composition linked to phenotypic differences. Sci. Rep. 11, 4565 (2021).
pubmed: 33633172
pmcid: 7907326
doi: 10.1038/s41598-021-83922-6
Lundberg, S. & Lee, S.-I. A unified approach to interpreting model predictions. In Proc. 31st International Conference on Neural Information Processing Systems, Guyon, I., U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (eds). 4768–4777 (Curran Associates, 2017). https://doi.org/10.5555/3295222.3295230
Chen, T. & Guestrin, C. XGBoost: a scalable tree boosting system. In Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (ACM, 2016). https://doi.org/10.1145/2939672.2939785
Allaband, C. et al. Intermittent hypoxia and hypercapnia alter diurnal rhythms of luminal gut microbiome and metabolome. mSystems 6, e00116–e00121 (2021).
pubmed: 34184915
pmcid: 8269208
doi: 10.1128/mSystems.00116-21
Fierer, N. et al. Global patterns in belowground communities. Ecol. Lett. 12, 1238–1249 (2009).
pubmed: 19674041
doi: 10.1111/j.1461-0248.2009.01360.x