METAnnotatorX2: a Comprehensive Tool for Deep and Shallow Metagenomic Data Set Analyses.

deep functional profiling metagenomics shallow taxonomy

Journal

mSystems
ISSN: 2379-5077
Titre abrégé: mSystems
Pays: United States
ID NLM: 101680636

Informations de publication

Date de publication:
29 Jun 2021
Historique:
entrez: 29 6 2021
pubmed: 30 6 2021
medline: 30 6 2021
Statut: aheadofprint

Résumé

The use of bioinformatic tools for read-based taxonomic and functional analyses of metagenomic data sets, including their assembly and management, is rather fragmentary due to the absence of an accepted gold standard. Moreover, most currently available software tools need input of millions of reads and rely on approximations in data analysis in order to reduce computing times. These issues result in suboptimal results in terms of accuracy, sensitivity, and specificity when used either for the reconstruction of taxonomic or functional profiles through read analysis or analysis of genomes reconstructed by metagenomic assembly. Moreover, the recent introduction of novel DNA sequencing technologies that generate long reads, such as Nanopore and PacBio, represent a valuable data resource that still suffers from a lack of dedicated tools to perform integrated hybrid analysis alongside short read data. In order to overcome these limitations, here we describe a comprehensive bioinformatic platform, METAnnotatorX2, aimed at providing an optimized user-friendly resource which maximizes output quality, while also allowing user-specific adaptation of the pipeline and straightforward integrated analysis of both short and long read data. To further improve performance quality and accuracy of taxonomic assignment of reads and contigs, custom preprocessed and taxonomically revised genomic databases for viruses, prokaryotes, and various eukaryotes were developed. The performance of METAnnotatorX2 was tested by analysis of artificial data sets encompassing viral, archaeal, bacterial, and eukaryotic (fungal) sequence reads that simulate different biological matrices. Moreover, real biological samples were employed to validate

Identifiants

pubmed: 34184911
doi: 10.1128/mSystems.00583-21
pmc: PMC8269244
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e0058321

Références

Front Public Health. 2018 Aug 30;6:235
pubmed: 30214898
Bioinformatics. 2012 Feb 15;28(4):464-9
pubmed: 22199388
BMC Bioinformatics. 2008 Sep 19;9:386
pubmed: 18803844
BMC Genomics. 2017 Jul 10;18(1):521
pubmed: 28693474
Microbiol Mol Biol Rev. 2017 Nov 8;81(4):
pubmed: 29118049
Nucleic Acids Res. 2012 Jan;40(Database issue):D742-53
pubmed: 22102576
FEMS Microbiol Lett. 2016 Apr;363(7):
pubmed: 26936607
Nucleic Acids Res. 2016 Jan 4;44(D1):D471-80
pubmed: 26527732
Front Microbiol. 2019 Feb 21;10:243
pubmed: 30967843
ISME J. 2018 Jun;12(6):1520-1531
pubmed: 29588495
Yale J Biol Med. 2016 Sep 30;89(3):353-362
pubmed: 27698619
Nat Biotechnol. 2019 Aug;37(8):937-944
pubmed: 31359005
Bioinformatics. 2016 Mar 15;32(6):929-31
pubmed: 26576653
Proc Natl Acad Sci U S A. 2009 Nov 10;106(45):19126-31
pubmed: 19855009
Environ Microbiol Rep. 2019 Dec;11(6):840-847
pubmed: 31668006
Nat Methods. 2012 Jun 10;9(8):811-4
pubmed: 22688413
Brief Bioinform. 2020 Mar 23;21(2):584-594
pubmed: 30815668
Elife. 2021 May 04;10:
pubmed: 33944776
Genome Res. 2020 Mar;30(3):315-333
pubmed: 32188701
Nat Commun. 2018 Nov 30;9(1):5114
pubmed: 30504855
Appl Environ Microbiol. 2009 Dec;75(23):7537-41
pubmed: 19801464
mSystems. 2018 Nov 13;3(6):
pubmed: 30443602
Front Microbiol. 2017 May 10;8:827
pubmed: 28539920
Nat Rev Microbiol. 2017 Oct;15(10):579-590
pubmed: 28824177
Cell. 2019 Aug 8;178(4):779-794
pubmed: 31398336
Nucleic Acids Res. 2018 Jan 4;46(D1):D633-D639
pubmed: 29059334
PLoS One. 2020 Feb 13;15(2):e0228899
pubmed: 32053657
Microbiome. 2018 Aug 20;6(1):145
pubmed: 30126456
Int J Syst Evol Microbiol. 2018 Jul;68(7):2386-2392
pubmed: 29792589
Genome Biol. 2014 Mar 03;15(3):R46
pubmed: 24580807
Microbiome. 2019 Feb 26;7(1):31
pubmed: 30808411
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Philos Trans R Soc Lond B Biol Sci. 2006 Nov 29;361(1475):1929-40
pubmed: 17062412
Genome Biol. 2019 Nov 28;20(1):257
pubmed: 31779668
Brief Bioinform. 2019 Jul 19;20(4):1125-1136
pubmed: 29028872
Nat Biotechnol. 2019 Aug;37(8):852-857
pubmed: 31341288
Glycobiology. 2018 Dec 1;28(1):3-8
pubmed: 29040563
Nucleic Acids Res. 2015 Sep 18;43(16):7762-8
pubmed: 26250111
Nucleic Acids Res. 2014 Jan;42(Database issue):D490-5
pubmed: 24270786

Auteurs

Christian Milani (C)

Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parmagrid.10383.39, Parma, Italy.
Microbiome Research Hub, University of Parmagrid.10383.39, Parma, Italy.

Gabriele Andrea Lugli (GA)

Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parmagrid.10383.39, Parma, Italy.

Federico Fontana (F)

Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parmagrid.10383.39, Parma, Italy.
GenProbio srl, Parma, Italy.

Leonardo Mancabelli (L)

Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parmagrid.10383.39, Parma, Italy.

Giulia Alessandri (G)

Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parmagrid.10383.39, Parma, Italy.

Giulia Longhi (G)

Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parmagrid.10383.39, Parma, Italy.
GenProbio srl, Parma, Italy.

Rosaria Anzalone (R)

GenProbio srl, Parma, Italy.

Alice Viappiani (A)

GenProbio srl, Parma, Italy.

Francesca Turroni (F)

Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parmagrid.10383.39, Parma, Italy.
Microbiome Research Hub, University of Parmagrid.10383.39, Parma, Italy.

Douwe van Sinderen (D)

APC Microbiome Ireland and School of Microbiology, Bioscience Institute, National University of Irelandgrid.9344.a, Cork, Ireland.

Marco Ventura (M)

Laboratory of Probiogenomics, Department of Chemistry, Life Sciences, and Environmental Sustainability, University of Parmagrid.10383.39, Parma, Italy.
Microbiome Research Hub, University of Parmagrid.10383.39, Parma, Italy.

Classifications MeSH