From soil to sequence: filling the critical gap in genome-resolved metagenomics is essential to the future of soil microbial ecology.

FAIR Data Principles Genome Resolved Metagenomics Hybrid Assembly Microbiome Assembled Genomes Soil Microbiome

Journal

Environmental microbiome
ISSN: 2524-6372
Titre abrégé: Environ Microbiome
Pays: England
ID NLM: 101768168

Informations de publication

Date de publication:
02 Aug 2024
Historique:
received: 12 05 2024
accepted: 22 07 2024
medline: 3 8 2024
pubmed: 3 8 2024
entrez: 2 8 2024
Statut: epublish

Résumé

Soil microbiomes are heterogeneous, complex microbial communities. Metagenomic analysis is generating vast amounts of data, creating immense challenges in sequence assembly and analysis. Although advances in technology have resulted in the ability to easily collect large amounts of sequence data, soil samples containing thousands of unique taxa are often poorly characterized. These challenges reduce the usefulness of genome-resolved metagenomic (GRM) analysis seen in other fields of microbiology, such as the creation of high quality metagenomic assembled genomes and the adoption of genome scale modeling approaches. The absence of these resources restricts the scale of future research, limiting hypothesis generation and the predictive modeling of microbial communities. Creating publicly available databases of soil MAGs, similar to databases produced for other microbiomes, has the potential to transform scientific insights about soil microbiomes without requiring the computational resources and domain expertise for assembly and binning.

Identifiants

pubmed: 39095861
doi: 10.1186/s40793-024-00599-w
pii: 10.1186/s40793-024-00599-w
doi:

Types de publication

Letter

Langues

eng

Pagination

56

Subventions

Organisme : U.S. Department of Energy
ID : DE-AC05-76RL01830
Organisme : U.S. Department of Energy
ID : DE-AC02-05CH11231
Organisme : U.S. Department of Energy
ID : DE-AC02-05CH11231
Organisme : U.S. Department of Energy
ID : 89233218CNA000001
Organisme : U.S. Department of Energy
ID : DE-AC02-05CH11231

Informations de copyright

© 2024. The Author(s).

Références

Banerjee S, van der Heijden MGA. Soil microbiomes and one health. Nat Rev Microbiol. 2023;21:6–20.
pubmed: 35999468 doi: 10.1038/s41579-022-00779-w
Scharlemann JP, Tanner EV, Hiederer R, Kapos V. Global soil carbon: understanding and managing the largest terrestrial carbon pool. Carbon Manag. 2014;5:81–91.
doi: 10.4155/cmt.13.77
Wu R, et al. Moisture modulates soil reservoirs of active DNA and RNA viruses. Commun Biol. 2021;4:992.
pubmed: 34446837 pmcid: 8390657 doi: 10.1038/s42003-021-02514-2
Fierer N. Embracing the unknown: disentangling the complexities of the soil microbiome. Nat Rev Microbiol. 2017;15:579–90.
pubmed: 28824177 doi: 10.1038/nrmicro.2017.87
Wagg C, Schlaeppi K, Banerjee S, Kuramae EE, van der Heijden MG. A. Fungal-bacterial diversity and microbiome complexity predict ecosystem functioning. Nat Commun. 2019;10:4841.
pubmed: 31649246 pmcid: 6813331 doi: 10.1038/s41467-019-12798-y
Bastida F, et al. Soil microbial diversity-biomass relationships are driven by soil carbon content across global biomes. ISME J. 2021;15:2081–91.
pubmed: 33564112 pmcid: 8245509 doi: 10.1038/s41396-021-00906-0
Quince C, Walker AW, Simpson JT, Loman NJ, Segata N. Shotgun metagenomics, from sampling to analysis. Nat Biotechnol. 2017;35:833–44.
pubmed: 28898207 doi: 10.1038/nbt.3935
Huttenhower C, et al. Structure, function and diversity of the healthy human microbiome. Nature. 2012;486:207–14.
doi: 10.1038/nature11234
Nayfach S, et al. A genomic catalog of Earth’s microbiomes. Nat Biotechnol. 2021;39:499–509.
pubmed: 33169036 doi: 10.1038/s41587-020-0718-6
Segata N, et al. Computational meta’omics for microbial community studies. Mol Syst Biol. 2013;9:666.
pubmed: 23670539 pmcid: 4039370 doi: 10.1038/msb.2013.22
Beghini F, et al. Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3. eLife. 2021;10:e65088.
pubmed: 33944776 pmcid: 8096432 doi: 10.7554/eLife.65088
Tyson GW, et al. Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature. 2004;428:37–43.
pubmed: 14961025 doi: 10.1038/nature02340
Setubal JC. Metagenome-assembled genomes: concepts, analogies, and challenges. Biophys Rev. 2021;13:905–9.
pubmed: 35059016 pmcid: 8724365 doi: 10.1007/s12551-021-00865-y
Sangwan N, Xia F, Gilbert JA. Recovering complete and draft population genomes from metagenome datasets. Microbiome. 2016;4:8.
pubmed: 26951112 pmcid: 4782286 doi: 10.1186/s40168-016-0154-5
Singleton CM et al. Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing. Nat. Commun 12, 2009 (2021).
Nelson WC, et al. Terabase Metagenome sequencing of Grassland Soil Microbiomes. Microbiol Resour Announc. 2020;9. https://doi.org/10.1128/mra.00718 .
Hofmeyr S, et al. Terabase-scale metagenome coassembly with MetaHipMer. Sci Rep. 2020;10:10689.
pubmed: 32612216 pmcid: 7329831 doi: 10.1038/s41598-020-67416-5
Delgado LF, Andersson AF. Evaluating metagenomic assembly approaches for biome-specific gene catalogues. Microbiome. 2022;10:1–11.
doi: 10.1186/s40168-022-01259-2
Uritskiy GV, DiRuggiero J, Taylor J. MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome. 2018;6:158.
pubmed: 30219103 pmcid: 6138922 doi: 10.1186/s40168-018-0541-1
Schulz F, et al. Hidden diversity of soil giant viruses. Nat Commun. 2018;9:4881.
pubmed: 30451857 pmcid: 6243002 doi: 10.1038/s41467-018-07335-2
Riley R et al. Terabase-Scale Coassembly of a Tropical Soil Microbiome. Microbiol Spectr e00200–23 (2023).
Davison AJ. Journal of General Virology – introduction to ‘ICTV Virus Taxonomy profiles’. J Gen Virol. 2017;98:1–1.
pubmed: 28218575 pmcid: 5370390 doi: 10.1099/jgv.0.000686
Sayers EW, et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2022;50:D20–6.
pubmed: 34850941 doi: 10.1093/nar/gkab1112
Tully BJ, Graham ED, Heidelberg JF. The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans. Sci Data. 2018;5:1–8.
doi: 10.1038/sdata.2017.203
Borton MA et al. A functional microbiome catalog crowdsourced from North American rivers. bioRxiv 2023–07 (2023).
Sunagawa S, et al. Tara Oceans: towards global ocean ecosystems biology. Nat Rev Microbiol. 2020;18:428–45.
pubmed: 32398798 doi: 10.1038/s41579-020-0364-5
Gurbich TA, et al. MGnify genomes: a resource for Biome-specific Microbial Genome catalogues. Comput Resour Mol Biol. 2023;435:168016.
Eloe-Fadrosh EA, et al. The National Microbiome Data Collaborative Data Portal: an integrated multi-omics microbiome data resource. Nucleic Acids Res. 2022;50:D828–36.
Choi J, et al. Strategies to improve reference databases for soil microbiomes. ISME J. 2017;11:829–34.
pubmed: 27935589 doi: 10.1038/ismej.2016.168
Ma B, et al. A genomic catalogue of soil microbiomes boosts mining of biodiversity and genetic resources. Nat Commun. 2023;14:7318.
pubmed: 37951952 pmcid: 10640626 doi: 10.1038/s41467-023-43000-z
Carini P, et al. Relic DNA is abundant in soil and obscures estimates of soil microbial diversity. Nat Microbiol. 2016;2:1–6.
doi: 10.1038/nmicrobiol.2016.242
Lennon JT, Muscarella ME, Placella SA, Lehmkuhl BK. How, when, and where Relic DNA affects Microbial Diversity. mBio. 2018;9. https://doi.org/10.1128/mbio.00637-18 .
Carini P, et al. Effects of spatial variability and Relic DNA removal on the detection of temporal dynamics in Soil Microbial communities. mBio. 2020;11. https://doi.org/10.1128/mbio.02776-19 .
Zaikova E et al. Antarctic Relic Microbial Mat Community revealed by Metagenomics and Metatranscriptomics. Front Ecol Evol 7, (2019).
Marx V. Method of the year: long-read sequencing. Nat Methods. 2023;20:6–11.
pubmed: 36635542 doi: 10.1038/s41592-022-01730-w
Chen Z, Erickson DL, Meng J. Benchmarking hybrid assembly approaches for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing. BMC Genomics. 2020;21:631.
pubmed: 32928108 pmcid: 7490894 doi: 10.1186/s12864-020-07041-8
Antipov D, Korobeynikov A, McLean JS, Pevzner PA. hybridSPAdes: an algorithm for hybrid assembly of short and long reads. Bioinformatics. 2016;32:1009–15.
pubmed: 26589280 doi: 10.1093/bioinformatics/btv688
Wick RR, Judd LM, Gorrie CL, Holt KE. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLOS Comput Biol. 2017;13:e1005595.
pubmed: 28594827 pmcid: 5481147 doi: 10.1371/journal.pcbi.1005595
Tao Y et al. Improved Assembly of Metagenome-assembled genomes and viruses in tibetan saline Lake Sediment by HiFi Metagenomic sequencing. Microbiol Spectr 11, e03328–22.
Orellana LH, Krüger K, Sidhu C, Amann R. Comparing genomes recovered from time-series metagenomes using long- and short-read sequencing technologies. Microbiome. 2023;11:105.
pubmed: 37179340 pmcid: 10182627 doi: 10.1186/s40168-023-01557-3
Gehrig JL, et al. Finding the right fit: evaluation of short-read and long-read sequencing approaches to maximize the utility of clinical microbiome data. Microb Genomics. 2022;8:000794.
Zhang Z, Yang C, Veldsman WP, Fang X, Zhang L. Benchmarking genome assembly methods on metagenomic sequencing data. Brief Bioinform. 2023;24:bbad087.
pubmed: 36917471 doi: 10.1093/bib/bbad087
Eisenhofer R, et al. A comparison of short-read, HiFi long-read, and hybrid strategies for genome-resolved metagenomics. Microbiol Spectr. 2024;12:e03590–23.
pubmed: 38451230 pmcid: 10986573 doi: 10.1128/spectrum.03590-23
Zhang H, Jain C, Aluru S. A comprehensive evaluation of long read error correction methods. BMC Genomics. 2020;21:889.
pubmed: 33349243 pmcid: 7751105 doi: 10.1186/s12864-020-07227-0
Bonetta R, Valentino G. Machine learning techniques for protein function prediction. Proteins Struct Funct Bioinforma. 2020;88:397–413.
doi: 10.1002/prot.25832
Sanderson T, Bileschi ML, Belanger D, Colwell LJ. ProteInfer: deep networks for protein functional inference. Biorxiv 2021–09 (2021).
de Padovani K, et al. Machine learning meets genome assembly. Brief Bioinform. 2019;20:2116–29.
doi: 10.1093/bib/bby072
Wu R, et al. Hi-C metagenome sequencing reveals soil phage–host interactions. Nat Commun. 2023;14:7666.
pubmed: 37996432 pmcid: 10667309 doi: 10.1038/s41467-023-42967-z
Belton J-M, et al. Hi–C: a comprehensive technique to capture the conformation of genomes. Methods. 2012;58:268–76.
pubmed: 22652625 doi: 10.1016/j.ymeth.2012.05.001
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–55.
pubmed: 25977477 pmcid: 4484387 doi: 10.1101/gr.186072.114
Orakov A, et al. GUNC: detection of chimerism and contamination in prokaryotic genomes. Genome Biol. 2021;22:178.
pubmed: 34120611 pmcid: 8201837 doi: 10.1186/s13059-021-02393-0
Bonner WA, Hulett HR, Sweet RG, Herzenberg LA. Fluorescence activated cell sorting. Rev Sci Instrum. 2003;43:404–9.
doi: 10.1063/1.1685647
Nuccio EE, et al. HT-SIP: a semi-automated stable isotope probing pipeline identifies cross-kingdom interactions in the hyphosphere of arbuscular mycorrhizal fungi. Microbiome. 2022;10:199.
pubmed: 36434737 pmcid: 9700909 doi: 10.1186/s40168-022-01391-z
Priest SJ, Yadav V, Heitman. J. Advances in understanding the evolution of fungal genome architecture. F1000Research 9, F1000 Faculty Rev-776 (2020).
Peng X, et al. Genomic and functional analyses of fungal and bacterial consortia that enable lignocellulose breakdown in goat gut microbiomes. Nat Microbiol. 2021;6:499–511.
pubmed: 33526884 pmcid: 8007473 doi: 10.1038/s41564-020-00861-0
Singh NK, et al. Characterization of metagenome-assembled genomes from the International Space Station. Microbiome. 2023;11:125.
pubmed: 37264385 pmcid: 10233975 doi: 10.1186/s40168-023-01545-7
Stajich JE, et al. FungiDB: an integrated functional genomics database for fungi. Nucleic Acids Res. 2012;40:D675–81.
pubmed: 22064857 doi: 10.1093/nar/gkr918
Wilkinson MD, et al. The FAIR Guiding principles for scientific data management and stewardship. Sci Data. 2016;3:160018.
pubmed: 26978244 pmcid: 4792175 doi: 10.1038/sdata.2016.18
Yilmaz P, et al. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat Biotechnol. 2011;29:415–20.
pubmed: 21552244 pmcid: 3367316 doi: 10.1038/nbt.1823
Zeng S, et al. A compendium of 32,277 metagenome-assembled genomes and over 80 million genes from the early-life human gut microbiome. Nat Commun. 2022;13:5139.
pubmed: 36050292 pmcid: 9437082 doi: 10.1038/s41467-022-32805-z
Qin J, et al. A human gut microbial gene catalog established by metagenomic sequencing. Nature. 2010;464:59–65.
pubmed: 20203603 pmcid: 3779803 doi: 10.1038/nature08821
Molecular Observation Network (MONet). | Environmental Molecular Sciences Laboratory. https://www.emsl.pnnl.gov/monet .
Arkin AP, et al. KBase: the United States department of energy systems biology knowledgebase. Nat Biotechnol. 2018;36:566–9.
pubmed: 29979655 pmcid: 6870991 doi: 10.1038/nbt.4163
The Galaxy Community. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2022 update. Nucleic Acids Res. 2022;50:W345–51.
doi: 10.1093/nar/gkac247

Auteurs

Winston E Anthony (WE)

Pacific Northwest National Laboratory, Richland, WA, 99354, USA. winston.anthony@pnnl.gov.

Steven D Allison (SD)

University of California Irvine, Irvine, CA, USA.
Department of Earth System Science, University of California, Irvine, CA, USA.

Caitlin M Broderick (CM)

W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, MI, USA.

Luciana Chavez Rodriguez (L)

University of California Irvine, Irvine, CA, USA.

Alicia Clum (A)

Lawrence Berkeley National Laboratory, Berkeley, CA, USA.

Hugh Cross (H)

National Ecological Observatory Network - Battelle, Boulder, CO, USA.

Emiley Eloe-Fadrosh (E)

Lawrence Berkeley National Laboratory, Berkeley, CA, USA.

Sarah Evans (S)

W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, MI, USA.

Dawson Fairbanks (D)

University of California Riverside, Riverside, CA, USA.
The University of Arizona, Tucson, AZ, USA.

Rachel Gallery (R)

The University of Arizona, Tucson, AZ, USA.

Júlia Brandão Gontijo (JB)

University of California Davis, Davis, CA, USA.

Jennifer Jones (J)

W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, MI, USA.

Jason McDermott (J)

Pacific Northwest National Laboratory, Richland, WA, 99354, USA.

Jennifer Pett-Ridge (J)

Lawrence Livermore National Laboratory, Livermore, CA, USA.
Life & Environmental Sciences Department, University of California Merced, Merced, CA, 95343, USA.

Sydne Record (S)

University of Maine, Orono, ME, USA.

Jorge Luiz Mazza Rodrigues (JLM)

Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
University of California Davis, Davis, CA, USA.

William Rodriguez-Reillo (W)

Harvard Medical School, Boston, MA, USA.

Katherine L Shek (KL)

University of New Hampshire, Durham, NH, USA.

Tina Takacs-Vesbach (T)

University of New Mexico, Albuquerque, NM, USA.

Jeffrey L Blanchard (JL)

University of Massachusetts Amherst, Amherst, MA, USA. jlb@umass.edu.

Classifications MeSH