Metagenomic analysis of soybean endosphere microbiome to reveal signatures of microbes for health and disease.

Endospheric mg-rast Function Metagenomics Omicbox Taxonomy

Journal

Journal, genetic engineering & biotechnology
ISSN: 2090-5920
Titre abrégé: J Genet Eng Biotechnol
Pays: Netherlands
ID NLM: 101317150

Informations de publication

Date de publication:
16 Aug 2023
Historique:
received: 02 02 2023
accepted: 27 07 2023
medline: 16 8 2023
pubmed: 16 8 2023
entrez: 16 8 2023
Statut: epublish

Résumé

Soil metagenomics is a cultivation-independent molecular strategy for investigating and exploiting the diversity of soil microbial communities. Soil microbial diversity is essential because it is critical to sustaining soil health for agricultural productivity and protection against harmful organisms. This study aimed to perform a metagenomic analysis of the soybean endosphere (all microbial communities found in plant leaves) to reveal signatures of microbes for health and disease. The dataset is based on the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) release "microbial diversity in soybean". The quality control process rejected 21 of the evaluated sequences (0.03% of the total sequences). Dereplication determined that 68,994 sequences were artificial duplicate readings, and removed them from consideration. Ribosomal Ribonucleic acid (RNA) genes were present in 72,747 sequences that successfully passed quality control (QC). Finally, we found that hierarchical classification for taxonomic assignment was conducted using MG-RAST, and the considered dataset of the metagenome domain of bacteria (99.68%) dominated the other groups. In Eukaryotes (0.31%) and unclassified sequence 2 (0.00%) in the taxonomic classification of bacteria in the genus group, Streptomyces, Chryseobacterium, Ppaenibacillus, Bacillus, and Mitsuaria were found. We also found some biological pathways, such as CMP-KDO biosynthesis II (from D-arabinose 5-phosphate), tricarboxylic acid cycle (TCA) cycle (plant), citrate cycle (TCA cycle), fatty acid biosynthesis, and glyoxylate and dicarboxylate metabolism. Gene prediction uncovered 1,180 sequences, 15,172 of which included gene products, with the shortest sequence being 131 bases and maximum length 3829 base pairs. The gene list was additionally annotated using Integrated Microbial Genomes and Microbiomes. The annotation process yielded a total of 240 genes found in 177 bacterial strains. These gene products were found in the genome of strain 7598. Large volumes of data are generated using modern sequencing technology to sample all genes in all species present in a given complex sample. These data revealed that it is a rich source of potential biomarkers for soybean plants. The results of this study will help us to understand the role of the endosphere microbiome in plant health and identify the microbial signatures of health and disease. The MG-RAST is a public resource for the automated phylogenetic and functional study of metagenomes. This is a powerful tool for investigating the diversity and function of microbial communities.

Sections du résumé

BACKGROUND BACKGROUND
Soil metagenomics is a cultivation-independent molecular strategy for investigating and exploiting the diversity of soil microbial communities. Soil microbial diversity is essential because it is critical to sustaining soil health for agricultural productivity and protection against harmful organisms. This study aimed to perform a metagenomic analysis of the soybean endosphere (all microbial communities found in plant leaves) to reveal signatures of microbes for health and disease.
RESULTS RESULTS
The dataset is based on the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) release "microbial diversity in soybean". The quality control process rejected 21 of the evaluated sequences (0.03% of the total sequences). Dereplication determined that 68,994 sequences were artificial duplicate readings, and removed them from consideration. Ribosomal Ribonucleic acid (RNA) genes were present in 72,747 sequences that successfully passed quality control (QC). Finally, we found that hierarchical classification for taxonomic assignment was conducted using MG-RAST, and the considered dataset of the metagenome domain of bacteria (99.68%) dominated the other groups. In Eukaryotes (0.31%) and unclassified sequence 2 (0.00%) in the taxonomic classification of bacteria in the genus group, Streptomyces, Chryseobacterium, Ppaenibacillus, Bacillus, and Mitsuaria were found. We also found some biological pathways, such as CMP-KDO biosynthesis II (from D-arabinose 5-phosphate), tricarboxylic acid cycle (TCA) cycle (plant), citrate cycle (TCA cycle), fatty acid biosynthesis, and glyoxylate and dicarboxylate metabolism. Gene prediction uncovered 1,180 sequences, 15,172 of which included gene products, with the shortest sequence being 131 bases and maximum length 3829 base pairs. The gene list was additionally annotated using Integrated Microbial Genomes and Microbiomes. The annotation process yielded a total of 240 genes found in 177 bacterial strains. These gene products were found in the genome of strain 7598. Large volumes of data are generated using modern sequencing technology to sample all genes in all species present in a given complex sample.
CONCLUSIONS CONCLUSIONS
These data revealed that it is a rich source of potential biomarkers for soybean plants. The results of this study will help us to understand the role of the endosphere microbiome in plant health and identify the microbial signatures of health and disease. The MG-RAST is a public resource for the automated phylogenetic and functional study of metagenomes. This is a powerful tool for investigating the diversity and function of microbial communities.

Identifiants

pubmed: 37584775
doi: 10.1186/s43141-023-00535-4
pii: 10.1186/s43141-023-00535-4
pmc: PMC10429481
doi:

Types de publication

Journal Article

Langues

eng

Pagination

84

Informations de copyright

© 2023. Academy of Scientific Research and Technology.

Références

Arch Med Res. 2002 Sep-Oct;33(5):439-47
pubmed: 12459313
Int J Mol Sci. 2022 Oct 10;23(19):
pubmed: 36233333
Mol Cell. 2003 Aug;12(2):321-8
pubmed: 14536072
Front Microbiol. 2021 Feb 11;12:632280
pubmed: 33643263
FEMS Microbiol Rev. 2013 Sep;37(5):634-63
pubmed: 23790204
Nat Commun. 2017 Aug 9;8(1):215
pubmed: 28790312
Front Microbiol. 2019 Aug 02;10:1744
pubmed: 31428064
Front Microbiol. 2018 Feb 22;9:244
pubmed: 29520256
Front Microbiol. 2014 Sep 17;5:491
pubmed: 25278934
J Bacteriol. 2000 Nov;182(22):6499-502
pubmed: 11053396
Chem Biol. 1998 Oct;5(10):R245-9
pubmed: 9818143
Cell Mol Life Sci. 2017 Sep;74(18):3293-3303
pubmed: 28409190
Genome Biol. 2009;10(3):R25
pubmed: 19261174
Scand J Clin Lab Invest. 2000 Nov;60(7):509-40
pubmed: 11202048
Curr Biol. 2019 Aug 5;29(15):R724-R725
pubmed: 31386843
Int J Mol Sci. 2018 Mar 22;19(4):
pubmed: 29565834
Microb Cell Fact. 2005 Mar 25;4(1):8
pubmed: 15790425
Microbiol Resour Announc. 2020 Oct 8;9(41):
pubmed: 33033124
Nat Biotechnol. 2012 Oct;30(10):961-2
pubmed: 23051815
Braz J Microbiol. 2020 Dec;51(4):1861-1871
pubmed: 32529561
Prog Lipid Res. 2010 Apr;49(2):97-107
pubmed: 19815028
J Bacteriol. 2017 Aug 8;199(17):
pubmed: 28630128
Appl Environ Microbiol. 2009 Feb;75(4):915-24
pubmed: 19088312
Nucleic Acids Res. 2017 Jan 4;45(D1):D507-D516
pubmed: 27738135
J Biol Chem. 2018 May 18;293(20):7592-7605
pubmed: 29535185
Nucleic Acids Res. 2023 Jan 6;51(D1):D723-D732
pubmed: 36382399
Infect Genet Evol. 2021 Sep;93:104928
pubmed: 34022437
Nucleic Acids Res. 1996 Jan 1;24(1):21-5
pubmed: 8594581
Front Plant Sci. 2016 Dec 12;7:1879
pubmed: 28018410
Microbiome. 2018 Mar 27;6(1):58
pubmed: 29587885
Nucleic Acids Res. 2016 Jan 4;44(D1):D590-4
pubmed: 26656948
Appl Environ Microbiol. 2011 Sep;77(17):5934-44
pubmed: 21764952
Annu Rev Biochem. 2020 Jun 20;89:605-636
pubmed: 32569521
Ann Microbiol. 2021;71(1):30
pubmed: 34305497
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45
pubmed: 26553804
Microb Cell Fact. 2016 Dec 1;15(1):203
pubmed: 27905924
Curr Opin Chem Biol. 2008 Apr;12(2):118-25
pubmed: 18314013
Int J Mol Sci. 2023 Mar 09;24(6):
pubmed: 36982348
FEMS Microbiol Rev. 2021 Aug 17;45(4):
pubmed: 33428728
Compr Physiol. 2013 Jan;3(1):365-401
pubmed: 23720291

Auteurs

Usha Chouhan (U)

Department of Mathematics, Bioinformatics & Computer Applications, Maulana Azad National Institute of Technology, Bhopal, 462051, MP, India.

Umesh Gamad (U)

School of Biotechnology, Devi Ahilya Vishwavidyalaya, Indore, MP, 452001, India.

Jyoti Kant Choudhari (JK)

Department of Mathematics, Bioinformatics & Computer Applications, Maulana Azad National Institute of Technology, Bhopal, 462051, MP, India. jtchoudhary27@gmail.com.

Classifications MeSH