From soil to sequence: filling the critical gap in genome-resolved metagenomics is essential to the future of soil microbial ecology.
FAIR Data Principles
Genome Resolved Metagenomics
Hybrid Assembly
Microbiome Assembled Genomes
Soil Microbiome
Journal
Environmental microbiome
ISSN: 2524-6372
Titre abrégé: Environ Microbiome
Pays: England
ID NLM: 101768168
Informations de publication
Date de publication:
02 Aug 2024
02 Aug 2024
Historique:
received:
12
05
2024
accepted:
22
07
2024
medline:
3
8
2024
pubmed:
3
8
2024
entrez:
2
8
2024
Statut:
epublish
Résumé
Soil microbiomes are heterogeneous, complex microbial communities. Metagenomic analysis is generating vast amounts of data, creating immense challenges in sequence assembly and analysis. Although advances in technology have resulted in the ability to easily collect large amounts of sequence data, soil samples containing thousands of unique taxa are often poorly characterized. These challenges reduce the usefulness of genome-resolved metagenomic (GRM) analysis seen in other fields of microbiology, such as the creation of high quality metagenomic assembled genomes and the adoption of genome scale modeling approaches. The absence of these resources restricts the scale of future research, limiting hypothesis generation and the predictive modeling of microbial communities. Creating publicly available databases of soil MAGs, similar to databases produced for other microbiomes, has the potential to transform scientific insights about soil microbiomes without requiring the computational resources and domain expertise for assembly and binning.
Identifiants
pubmed: 39095861
doi: 10.1186/s40793-024-00599-w
pii: 10.1186/s40793-024-00599-w
doi:
Types de publication
Letter
Langues
eng
Pagination
56Subventions
Organisme : U.S. Department of Energy
ID : DE-AC05-76RL01830
Organisme : U.S. Department of Energy
ID : DE-AC02-05CH11231
Organisme : U.S. Department of Energy
ID : DE-AC02-05CH11231
Organisme : U.S. Department of Energy
ID : 89233218CNA000001
Organisme : U.S. Department of Energy
ID : DE-AC02-05CH11231
Informations de copyright
© 2024. The Author(s).
Références
Banerjee S, van der Heijden MGA. Soil microbiomes and one health. Nat Rev Microbiol. 2023;21:6–20.
pubmed: 35999468
doi: 10.1038/s41579-022-00779-w
Scharlemann JP, Tanner EV, Hiederer R, Kapos V. Global soil carbon: understanding and managing the largest terrestrial carbon pool. Carbon Manag. 2014;5:81–91.
doi: 10.4155/cmt.13.77
Wu R, et al. Moisture modulates soil reservoirs of active DNA and RNA viruses. Commun Biol. 2021;4:992.
pubmed: 34446837
pmcid: 8390657
doi: 10.1038/s42003-021-02514-2
Fierer N. Embracing the unknown: disentangling the complexities of the soil microbiome. Nat Rev Microbiol. 2017;15:579–90.
pubmed: 28824177
doi: 10.1038/nrmicro.2017.87
Wagg C, Schlaeppi K, Banerjee S, Kuramae EE, van der Heijden MG. A. Fungal-bacterial diversity and microbiome complexity predict ecosystem functioning. Nat Commun. 2019;10:4841.
pubmed: 31649246
pmcid: 6813331
doi: 10.1038/s41467-019-12798-y
Bastida F, et al. Soil microbial diversity-biomass relationships are driven by soil carbon content across global biomes. ISME J. 2021;15:2081–91.
pubmed: 33564112
pmcid: 8245509
doi: 10.1038/s41396-021-00906-0
Quince C, Walker AW, Simpson JT, Loman NJ, Segata N. Shotgun metagenomics, from sampling to analysis. Nat Biotechnol. 2017;35:833–44.
pubmed: 28898207
doi: 10.1038/nbt.3935
Huttenhower C, et al. Structure, function and diversity of the healthy human microbiome. Nature. 2012;486:207–14.
doi: 10.1038/nature11234
Nayfach S, et al. A genomic catalog of Earth’s microbiomes. Nat Biotechnol. 2021;39:499–509.
pubmed: 33169036
doi: 10.1038/s41587-020-0718-6
Segata N, et al. Computational meta’omics for microbial community studies. Mol Syst Biol. 2013;9:666.
pubmed: 23670539
pmcid: 4039370
doi: 10.1038/msb.2013.22
Beghini F, et al. Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3. eLife. 2021;10:e65088.
pubmed: 33944776
pmcid: 8096432
doi: 10.7554/eLife.65088
Tyson GW, et al. Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature. 2004;428:37–43.
pubmed: 14961025
doi: 10.1038/nature02340
Setubal JC. Metagenome-assembled genomes: concepts, analogies, and challenges. Biophys Rev. 2021;13:905–9.
pubmed: 35059016
pmcid: 8724365
doi: 10.1007/s12551-021-00865-y
Sangwan N, Xia F, Gilbert JA. Recovering complete and draft population genomes from metagenome datasets. Microbiome. 2016;4:8.
pubmed: 26951112
pmcid: 4782286
doi: 10.1186/s40168-016-0154-5
Singleton CM et al. Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing. Nat. Commun 12, 2009 (2021).
Nelson WC, et al. Terabase Metagenome sequencing of Grassland Soil Microbiomes. Microbiol Resour Announc. 2020;9. https://doi.org/10.1128/mra.00718 .
Hofmeyr S, et al. Terabase-scale metagenome coassembly with MetaHipMer. Sci Rep. 2020;10:10689.
pubmed: 32612216
pmcid: 7329831
doi: 10.1038/s41598-020-67416-5
Delgado LF, Andersson AF. Evaluating metagenomic assembly approaches for biome-specific gene catalogues. Microbiome. 2022;10:1–11.
doi: 10.1186/s40168-022-01259-2
Uritskiy GV, DiRuggiero J, Taylor J. MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome. 2018;6:158.
pubmed: 30219103
pmcid: 6138922
doi: 10.1186/s40168-018-0541-1
Schulz F, et al. Hidden diversity of soil giant viruses. Nat Commun. 2018;9:4881.
pubmed: 30451857
pmcid: 6243002
doi: 10.1038/s41467-018-07335-2
Riley R et al. Terabase-Scale Coassembly of a Tropical Soil Microbiome. Microbiol Spectr e00200–23 (2023).
Davison AJ. Journal of General Virology – introduction to ‘ICTV Virus Taxonomy profiles’. J Gen Virol. 2017;98:1–1.
pubmed: 28218575
pmcid: 5370390
doi: 10.1099/jgv.0.000686
Sayers EW, et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2022;50:D20–6.
pubmed: 34850941
doi: 10.1093/nar/gkab1112
Tully BJ, Graham ED, Heidelberg JF. The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans. Sci Data. 2018;5:1–8.
doi: 10.1038/sdata.2017.203
Borton MA et al. A functional microbiome catalog crowdsourced from North American rivers. bioRxiv 2023–07 (2023).
Sunagawa S, et al. Tara Oceans: towards global ocean ecosystems biology. Nat Rev Microbiol. 2020;18:428–45.
pubmed: 32398798
doi: 10.1038/s41579-020-0364-5
Gurbich TA, et al. MGnify genomes: a resource for Biome-specific Microbial Genome catalogues. Comput Resour Mol Biol. 2023;435:168016.
Eloe-Fadrosh EA, et al. The National Microbiome Data Collaborative Data Portal: an integrated multi-omics microbiome data resource. Nucleic Acids Res. 2022;50:D828–36.
Choi J, et al. Strategies to improve reference databases for soil microbiomes. ISME J. 2017;11:829–34.
pubmed: 27935589
doi: 10.1038/ismej.2016.168
Ma B, et al. A genomic catalogue of soil microbiomes boosts mining of biodiversity and genetic resources. Nat Commun. 2023;14:7318.
pubmed: 37951952
pmcid: 10640626
doi: 10.1038/s41467-023-43000-z
Carini P, et al. Relic DNA is abundant in soil and obscures estimates of soil microbial diversity. Nat Microbiol. 2016;2:1–6.
doi: 10.1038/nmicrobiol.2016.242
Lennon JT, Muscarella ME, Placella SA, Lehmkuhl BK. How, when, and where Relic DNA affects Microbial Diversity. mBio. 2018;9. https://doi.org/10.1128/mbio.00637-18 .
Carini P, et al. Effects of spatial variability and Relic DNA removal on the detection of temporal dynamics in Soil Microbial communities. mBio. 2020;11. https://doi.org/10.1128/mbio.02776-19 .
Zaikova E et al. Antarctic Relic Microbial Mat Community revealed by Metagenomics and Metatranscriptomics. Front Ecol Evol 7, (2019).
Marx V. Method of the year: long-read sequencing. Nat Methods. 2023;20:6–11.
pubmed: 36635542
doi: 10.1038/s41592-022-01730-w
Chen Z, Erickson DL, Meng J. Benchmarking hybrid assembly approaches for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing. BMC Genomics. 2020;21:631.
pubmed: 32928108
pmcid: 7490894
doi: 10.1186/s12864-020-07041-8
Antipov D, Korobeynikov A, McLean JS, Pevzner PA. hybridSPAdes: an algorithm for hybrid assembly of short and long reads. Bioinformatics. 2016;32:1009–15.
pubmed: 26589280
doi: 10.1093/bioinformatics/btv688
Wick RR, Judd LM, Gorrie CL, Holt KE. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLOS Comput Biol. 2017;13:e1005595.
pubmed: 28594827
pmcid: 5481147
doi: 10.1371/journal.pcbi.1005595
Tao Y et al. Improved Assembly of Metagenome-assembled genomes and viruses in tibetan saline Lake Sediment by HiFi Metagenomic sequencing. Microbiol Spectr 11, e03328–22.
Orellana LH, Krüger K, Sidhu C, Amann R. Comparing genomes recovered from time-series metagenomes using long- and short-read sequencing technologies. Microbiome. 2023;11:105.
pubmed: 37179340
pmcid: 10182627
doi: 10.1186/s40168-023-01557-3
Gehrig JL, et al. Finding the right fit: evaluation of short-read and long-read sequencing approaches to maximize the utility of clinical microbiome data. Microb Genomics. 2022;8:000794.
Zhang Z, Yang C, Veldsman WP, Fang X, Zhang L. Benchmarking genome assembly methods on metagenomic sequencing data. Brief Bioinform. 2023;24:bbad087.
pubmed: 36917471
doi: 10.1093/bib/bbad087
Eisenhofer R, et al. A comparison of short-read, HiFi long-read, and hybrid strategies for genome-resolved metagenomics. Microbiol Spectr. 2024;12:e03590–23.
pubmed: 38451230
pmcid: 10986573
doi: 10.1128/spectrum.03590-23
Zhang H, Jain C, Aluru S. A comprehensive evaluation of long read error correction methods. BMC Genomics. 2020;21:889.
pubmed: 33349243
pmcid: 7751105
doi: 10.1186/s12864-020-07227-0
Bonetta R, Valentino G. Machine learning techniques for protein function prediction. Proteins Struct Funct Bioinforma. 2020;88:397–413.
doi: 10.1002/prot.25832
Sanderson T, Bileschi ML, Belanger D, Colwell LJ. ProteInfer: deep networks for protein functional inference. Biorxiv 2021–09 (2021).
de Padovani K, et al. Machine learning meets genome assembly. Brief Bioinform. 2019;20:2116–29.
doi: 10.1093/bib/bby072
Wu R, et al. Hi-C metagenome sequencing reveals soil phage–host interactions. Nat Commun. 2023;14:7666.
pubmed: 37996432
pmcid: 10667309
doi: 10.1038/s41467-023-42967-z
Belton J-M, et al. Hi–C: a comprehensive technique to capture the conformation of genomes. Methods. 2012;58:268–76.
pubmed: 22652625
doi: 10.1016/j.ymeth.2012.05.001
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–55.
pubmed: 25977477
pmcid: 4484387
doi: 10.1101/gr.186072.114
Orakov A, et al. GUNC: detection of chimerism and contamination in prokaryotic genomes. Genome Biol. 2021;22:178.
pubmed: 34120611
pmcid: 8201837
doi: 10.1186/s13059-021-02393-0
Bonner WA, Hulett HR, Sweet RG, Herzenberg LA. Fluorescence activated cell sorting. Rev Sci Instrum. 2003;43:404–9.
doi: 10.1063/1.1685647
Nuccio EE, et al. HT-SIP: a semi-automated stable isotope probing pipeline identifies cross-kingdom interactions in the hyphosphere of arbuscular mycorrhizal fungi. Microbiome. 2022;10:199.
pubmed: 36434737
pmcid: 9700909
doi: 10.1186/s40168-022-01391-z
Priest SJ, Yadav V, Heitman. J. Advances in understanding the evolution of fungal genome architecture. F1000Research 9, F1000 Faculty Rev-776 (2020).
Peng X, et al. Genomic and functional analyses of fungal and bacterial consortia that enable lignocellulose breakdown in goat gut microbiomes. Nat Microbiol. 2021;6:499–511.
pubmed: 33526884
pmcid: 8007473
doi: 10.1038/s41564-020-00861-0
Singh NK, et al. Characterization of metagenome-assembled genomes from the International Space Station. Microbiome. 2023;11:125.
pubmed: 37264385
pmcid: 10233975
doi: 10.1186/s40168-023-01545-7
Stajich JE, et al. FungiDB: an integrated functional genomics database for fungi. Nucleic Acids Res. 2012;40:D675–81.
pubmed: 22064857
doi: 10.1093/nar/gkr918
Wilkinson MD, et al. The FAIR Guiding principles for scientific data management and stewardship. Sci Data. 2016;3:160018.
pubmed: 26978244
pmcid: 4792175
doi: 10.1038/sdata.2016.18
Yilmaz P, et al. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat Biotechnol. 2011;29:415–20.
pubmed: 21552244
pmcid: 3367316
doi: 10.1038/nbt.1823
Zeng S, et al. A compendium of 32,277 metagenome-assembled genomes and over 80 million genes from the early-life human gut microbiome. Nat Commun. 2022;13:5139.
pubmed: 36050292
pmcid: 9437082
doi: 10.1038/s41467-022-32805-z
Qin J, et al. A human gut microbial gene catalog established by metagenomic sequencing. Nature. 2010;464:59–65.
pubmed: 20203603
pmcid: 3779803
doi: 10.1038/nature08821
Molecular Observation Network (MONet). | Environmental Molecular Sciences Laboratory. https://www.emsl.pnnl.gov/monet .
Arkin AP, et al. KBase: the United States department of energy systems biology knowledgebase. Nat Biotechnol. 2018;36:566–9.
pubmed: 29979655
pmcid: 6870991
doi: 10.1038/nbt.4163
The Galaxy Community. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2022 update. Nucleic Acids Res. 2022;50:W345–51.
doi: 10.1093/nar/gkac247