Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.

Afiahayati, Sato K, Sakakibara Y (2015) MetaVelvet-SL: an extension of the Velvet assembler to a de novo metagenomic assembler utilizing supervised learning. DNA Res 22:69-77. https://doi.org/10.1093/dnares/dsu041

Aggio RBM, Ruggiero K, Villas-Bôas SG (2010) Pathway Activity Profiling (PAPi): from the metabolite profile to the metabolic pathway activity. Bioinformatics 26:2969–2976. https://doi.org/10.1093/bioinformatics/btq567

doi: 10.1093/bioinformatics/btq567 pubmed: 20929912

Ainsworth D, Sternberg MJE, Raczy C, Butcher SA (2017) k-SLAM: accurate and ultra-fast taxonomic classification and gene identification for large metagenomic data sets. Nucleic Acids Res 45:1649–1656. https://doi.org/10.1093/nar/gkw1248

doi: 10.1093/nar/gkw1248 pubmed: 27965413

Alic AS, Blanquer I (2016) MuffinInfo: HTML5-Based Statistics Extractor from Next-Generation Sequencing Data. J Comput Biol 23:750–755. https://doi.org/10.1089/cmb.2016.0031

doi: 10.1089/cmb.2016.0031 pubmed: 27606794

Alkhateeb A, Rueda L (2017) Zseq: an approach for preprocessing next-generation sequencing data. J Comput Biol 24:746–755. https://doi.org/10.1089/cmb.2017.0021

doi: 10.1089/cmb.2017.0021 pubmed: 28414515 pmcid: 5563921

Alneberg J, Bjarnason BS, de Bruijn I, Schirmer M, Quick J, Ijaz UZ, Lahti L, Loman NJ, Andersson AF, Quince C (2014) Binning metagenomic contigs by coverage and composition. Nat Methods 11:1144–1146. https://doi.org/10.1038/nmeth.3103

doi: 10.1038/nmeth.3103 pubmed: 25218180

Alonso A, Lasseigne BN, Williams K, Nielsen J, Ramaker RC, Hardigan AA, Johnston B, Roberts BS, Cooper SJ, Marsal S, Myers RM (2017) aRNApipe: a balanced, efficient and distributed pipeline for processing RNA-seq data in high-performance computing environments. Bioinformatics 33:1727–1729. https://doi.org/10.1093/bioinformatics/btx023

doi: 10.1093/bioinformatics/btx023 pubmed: 28108448 pmcid: 5447234

Alshawaqfeh M, Bashaireh A, Serpedin E, Suchodolski J (2017a) Reliable Biomarker discovery from Metagenomic data via RegLRSD algorithm. BMC Bioinformatics 18:328. https://doi.org/10.1186/s12859-017-1738-1

doi: 10.1186/s12859-017-1738-1 pubmed: 28693478 pmcid: 5504766

AlShawaqfeh M, Wajid B, Minamoto Y, Markel M, Lidbury J, Steiner J, Serpedin E, Suchodolski J (2017b) A dysbiosis index to assess microbial changes in fecal samples of dogs with chronic inflammatory enteropathy. J FEMS Microbiol Ecol 93:fix136. https://doi.org/10.1093/femsec/fix136

doi: 10.1093/femsec/fix136

Alshawaqfeh M, Gharaibeh A, Wajid B (2019) A Hybrid Feature Selection Method for Classifying Metagenomic Data in Relation to Inflammatory Bowel DiseaseICAAI 2019: Proceedings of the 2019 3rd International Conference on Advances in Artificial Intelligence 86–89. 10.1145/ 3369114.3371675

Ames SK, Hysom DA, Gardner SN, Lloyd GS, Gokhale MB, Allen JE (2013) Scalable metagenomic taxonomy classification using a reference genome database. Bioinformatics 29:2253–2260. https://doi.org/10.1093/bioinformatics/btt389

doi: 10.1093/bioinformatics/btt389 pubmed: 23828782 pmcid: 3753567

Anand G, Zarrinpar A, Loomba R (2016) Targeting dysbiosis for the treatment of liver disease. Semin Liver Dis 36:37–47. https://doi.org/10.1055/s-0035-1571276

doi: 10.1055/s-0035-1571276 pubmed: 26870931 pmcid: 6658186

Anders S, Pyl PT, Huber W (2015) HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31:166–169. https://doi.org/10.1093/bioinformatics/btu638

doi: 10.1093/bioinformatics/btu638

Andreas B, McHardy AC (2018) Critical assessment of metagenome interpretation enters the second round. mSystems 3:e00103-e118. https://doi.org/10.1128/mSystems.00103-18

doi: 10.1128/mSystems.00103-18

Andrés-León E, Núñez-Torres R, Rojas AM (2016) miARma-Seq: a comprehensive tool for miRNA, mRNA and circRNA analysis. Sci Rep 6:1–8. https://doi.org/10.1038/srep25749

doi: 10.1038/srep25749

Andrews S (2010) FastQC: a quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ . Accessed 13 Aug 2021

Arango-Argoty G, Singh G, Heath LS, Pruden A, Xiao W, Zhang L (2016) MetaStorm: A Public Resource for Customizable Metagenomics Annotation. PLoS ONE 11:e0162442. https://doi.org/10.1371/journal.pone.0162442

doi: 10.1371/journal.pone.0162442 pubmed: 27632579 pmcid: 5025195

Asnicar F, Weingart G, Tickle TL, Huttenhower C, Segata N (2015) Compact graphical representation of phylogenetic data and metadata with GraPhlAn. PeerJ 3:e1029. https://doi.org/10.7717/peerj.1029

doi: 10.7717/peerj.1029 pubmed: 26157614 pmcid: 4476132

Attwood TK, Coletta A, Muirhead G, Pavlopoulou A, Philippou PB et al (2012) The PRINTS database: a fine- grained protein sequence annotation and analysis resource—its status in 2012. Database 2012: bas019. https://doi.org/10.1093/database/bas019

Ayyala DN, Lin S (2015) GrammR: graphical representation and modeling of count data with application in metagenomics. Bioinformatics 31:1648–1654. https://doi.org/10.1093/bioinformatics/btv032

doi: 10.1093/bioinformatics/btv032 pubmed: 25609792

Bacci G, Bazzicalupo M, Benedetti A, Mengoni A (2014) StreamingTrim 1.0: a Java software for dynamic trimming of 16S rRNA sequence data from metagenetic studies. Mol Ecol Resour 14:426–434. https://doi.org/10.1111/1755-0998.12187

doi: 10.1111/1755-0998.12187 pubmed: 24128146

Banerjee J, Mishra N, Dhas YJMg (2015) Metagenomics: A new horizon in cancer research. J Meta Gene 5:84–89. https://doi.org/10.1016/j.mgene.2015.05.005

doi: 10.1016/j.mgene.2015.05.005

Bankevich A, Nurk S, Antipov D et al (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. https://doi.org/10.1089/cmb.2012.0021

doi: 10.1089/cmb.2012.0021 pubmed: 22506599 pmcid: 3342519

Basler G, Nikoloski Z (2011) JMassBalance: mass-balanced randomization and analysis of metabolic networks. Bioinformatics 27:2761–2762. https://doi.org/10.1093/bioinformatics/btr448

doi: 10.1093/bioinformatics/btr448 pubmed: 21803803 pmcid: 3179655

Benoit G, Peterlongo P, Mariadassou M, Drezen E, Schbath S, Lavenier D, Lemaitre C (2016) Multiple comparative metagenomics using multiset k-mer counting. PeerJ Comput Sci 2:e94. https://doi.org/10.7717/peerj-cs.94

doi: 10.7717/peerj-cs.94

Berendzen J, Bruno WJ, Cohn JD, Hengartner NW, Kuske CR, McMahon BH, Wolinsky MA, Xie G (2012) Rapid phylogenetic and functional classification of short genomic fragments with signature peptides. BMC Res Notes 5:460. https://doi.org/10.1186/1756-0500-5-460

doi: 10.1186/1756-0500-5-460 pubmed: 22925230 pmcid: 3772700

Bergmann EA, Chen BJ, Arora K, Vacic V, Zody MC (2016) Conpair: concordance and contamination estimator for matched tumor-normal pairs. Bioinformatics 32:3196–3198. https://doi.org/10.1093/bioinformatics/btw389

doi: 10.1093/bioinformatics/btw389 pubmed: 27354699 pmcid: 5048070

Berini F, Casciello C, Marcone GL, Marinelli F (2017) Metagenomics: novel enzymes from non-culturable microbes. FEMS Microbiol Lett 364:fnx211. https://doi.org/10.1093/femsle/fnx211

doi: 10.1093/femsle/fnx211

Bertrand D, Shaw J, Kalathiyappan M et al (2019) Hybrid metagenomic assembly enables high-resolution analysis of resistance determinants and mobile elements in human microbiomes. Nat Biotechnol 37:937–944. https://doi.org/10.1038/s41587-019-0191-2

doi: 10.1038/s41587-019-0191-2 pubmed: 31359005

Blin K, Pascal Andreu V, de los Santos ELC, Del Carratore F, Lee SY, Medema MH, Weber T (2019a) The antiSMASH database version 2: a comprehensive resource on secondary metabolite biosynthetic gene clusters. Nucleic Acids Res 47:D625-D630. https://doi.org/10.1093/nar/gky1060

Blin K, Shaw S, Steinke K, Villebro R, Ziemert N, Lee SY, Medema MH, Weber T (2019b) antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res 47:W81–W87. https://doi.org/10.1093/nar/gkz310

doi: 10.1093/nar/gkz310 pubmed: 31032519 pmcid: 6602434

Blom J, Kreis J, Spanig S, Juhre T et al (2016) EDGAR 2.0: an enhanced software platform for comparative gene content analyses. Nucleic Acids Res 44:W22–W28. https://doi.org/10.1093/nar/gkw255

doi: 10.1093/nar/gkw255 pubmed: 27098043 pmcid: 4987874

Boisvert S, Raymond F, Godzaridis É, Laviolette F, Corbeil J (2012) Ray Meta: scalable de novo metagenome assembly and profiling. Genome Biol 13:R122. https://doi.org/10.1186/gb-2012-13-12-r122

doi: 10.1186/gb-2012-13-12-r122 pubmed: 23259615 pmcid: 4056372

Booth SC, Weljie AM, Turner RJ (2013) Comput Struct Biotechnol J 4:e201301003. https://doi.org/10.5936/csbj.201301003

doi: 10.5936/csbj.201301003 pubmed: 24688685 pmcid: 3962093

Borozan I, Ferretti V (2016) CSSSCL: a python package that uses combined sequence similarity scores for accurate taxonomic classification of long and short sequence reads. Bioinformatics 32:453–455. https://doi.org/10.1093/bioinformatics/btv587

doi: 10.1093/bioinformatics/btv587 pubmed: 26454281

Brandt BW, Bonder MJ, Huse SM, Zaura E (2012) TaxMan: a server to trim rRNA reference databases and inspect taxonomic coverage. Nucleic Acids Res 40:W82–W87. https://doi.org/10.1093/nar/gks418

doi: 10.1093/nar/gks418 pubmed: 22618877 pmcid: 3394339

Brown J, Pirrung M, McCue LA (2017) FQC Dashboard: integrates FastQC results into a web-based, interactive, and extensible FASTQ quality control tool. Bioinformatics 33:3137–3139. https://doi.org/10.1093/bioinformatics/btx373

doi: 10.1093/bioinformatics/btx373 pubmed: 28605449 pmcid: 5870778

Buchfink B, Xie C, Huson DH (2015) Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60. https://doi.org/10.1038/nmeth.3176

doi: 10.1038/nmeth.3176 pubmed: 25402007 pmcid: 25402007

Bushnell B (2014) BBMap: A Fast, Accurate, Splice-Aware Aligner. Berkeley: Lawrence Berkeley National Lab. (LBNL)

Cabanski CR, Cavin K, Bizon C, Wilkerson MD, Parker JS, Wilhelmsen KC, Perou CM, Marron J, Hayes DN (2012) ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data. BMC Bioinformatics 13:1–10. https://doi.org/10.1186/1471-2105-13-221

doi: 10.1186/1471-2105-13-221

Caboche S, Even G, Loywick A, Audebert C, Hot D (2017) MICRA: an automatic pipeline for fast characterization of microbial genomes from high-throughput sequencing data. Genome Biol 18:233. https://doi.org/10.1186/s13059-017-1367-z

doi: 10.1186/s13059-017-1367-z pubmed: 29258574 pmcid: 5738152

Cantor M, Nordberg H, Smirnova T, Hess M, Tringe S, Dubchak I (2015) Elviz – exploration of metagenome assemblies with an interactive visualization tool. BMC Bioinf 16:130. https://doi.org/10.1186/s12859-015-0566-4

doi: 10.1186/s12859-015-0566-4

Cao R, Freitas C, Chan L, Sun M, Jiang H, Chen Z (2017) ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network. Molecules 22. https://doi.org/10.3390/molecules22101732

Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD et al (2010) QIIME allows analysis of high-throughput community sequencing data. Nat Methods 7:335–336. https://doi.org/10.1038/nmeth.f.303

doi: 10.1038/nmeth.f.303 pubmed: 20383131 pmcid: 20383131

Caspi R, Altman T, Dale JM, Dreher K et al (2010) The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 38:D473–D479. https://doi.org/10.1093/nar/gkp875

doi: 10.1093/nar/gkp875 pubmed: 19850718

Cédric Cabau FE, Djari A, Guiguen Y, Bobe J, Klopp C (2017) Compacting and correcting Trinity and Oases RNA-Seq de novo assemblies. PeerJ 5:e2988. https://doi.org/10.7717/peerj.2988

doi: 10.7717/peerj.2988 pubmed: 28224052 pmcid: 5316280

Cepeda V, Liu B, Almeida M, Hill CM, Koren S, Treangen TJ, Pop M (2017) MetaCompass: reference-guided assembly of metagenomes. BioRxiv 212506. https://doi.org/10.1101/212506

Chaves I, Costa BV, Rodrigues AS, Bohn A, Miguel CM (2017) mi RP ursuit—a pipeline for automated analyses of small RNA s in model and nonmodel plants. FEBS Lett 591:2261–2268. https://doi.org/10.1002/1873-3468.12746

doi: 10.1002/1873-3468.12746 pubmed: 28686301

Chen S, Huang T, Zhou Y, Han Y, Xu M, Gu J (2017) AfterQC: automatic filtering, trimming, error removing and quality control for fastq data. BMC Bioinformatics 18:91–100. https://doi.org/10.1186/s12859-017-1469-3

doi: 10.1186/s12859-017-1469-3

Chen Y, Chen Y, Shi C, Huang Z, Zhang Y, Li S, Li Y, Ye J, Yu C, Li Z (2018) SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7:gix120. https://doi.org/10.1093/gigascience/gix120

doi: 10.1093/gigascience/gix120

Chiara M, Gioiosa S, Chillemi G, D’Antonio M, Flati T et al (2018a) CoVaCS: a consensus variant calling system. BMC Genomics 19:1–9. https://doi.org/10.1186/s12864-018-4508-1

doi: 10.1186/s12864-018-4508-1

Chiara M, Placido A, Picardi E, Ceci LR, Horner DS, Pesole G (2018b) A-GAME: improving the assembly of pooled functional metagenomics sequence data. BMC Genomics 19:1–10. https://doi.org/10.1186/s12864-017-4369-z

doi: 10.1186/s12864-017-4369-z

Chikhi R, Rizk G (2013) Space-efficient and exact de Bruijn graph representation based on a Bloom filter. Algorithms Mol Biol 8:22. https://doi.org/10.1186/1748-7188-8-22

doi: 10.1186/1748-7188-8-22 pubmed: 24040893 pmcid: 3848682

Chiu CY, Miller SA (2019) Clinical metagenomics. Nat Rev Genet 20:341–355. https://doi.org/10.1038/s41576-019-0113-7

doi: 10.1038/s41576-019-0113-7 pubmed: 30918369 pmcid: 6858796

Choi K, Smith LP, Medley JK, Sauro HM (2016) phraSED-ML: A paraphrased, human-readable adaptation of SED-ML. J Bioinform Comput Biol 14:1650035. https://doi.org/10.1142/s0219720016500359

doi: 10.1142/s0219720016500359 pubmed: 27774871 pmcid: 5313123

Chu J, Sadeghi S, Raymond A, Jackman SD, Nip KM, Mar R, Mohamadi H, Butterfield YS, Robertson AG, Birol I (2014) BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters. Bioinformatics 30:3402–3404. https://doi.org/10.1093/bioinformatics/btu558

doi: 10.1093/bioinformatics/btu558 pubmed: 25143290 pmcid: 4816029

Cibulskis K, McKenna A, Fennell T, Banks E, DePristo M, Getz G (2011) ContEst: estimating cross-contamination of human samples in next-generation sequencing data. Bioinformatics 27:2601–2602. https://doi.org/10.1093/bioinformatics/btr446

doi: 10.1093/bioinformatics/btr446 pubmed: 21803805 pmcid: 3167057

Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW (2016) GenBank. Nucleic Acids Res 44:D67–D72. https://doi.org/10.1093/nar/gkv1276

doi: 10.1093/nar/gkv1276 pubmed: 26590407

Clark SC, Egan R, Frazier PI, Wang Z (2013) ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies. Bioinformatics 29:435–443. https://doi.org/10.1093/bioinformatics/bts723

doi: 10.1093/bioinformatics/bts723 pubmed: 23303509

Clos-Garcia M, Garcia K, Alonso C et al (2020) Integrative Analysis of Fecal Metagenomics and Metabolomics in Colorectal Cancer. Cancers (basel) 12(5):1142. https://doi.org/10.3390/cancers12051142

doi: 10.3390/cancers12051142

Correia D, Doppelt-Azeroual O, Denis J-B, Vandenbogaert M, Caro V (2015) MetaGenSense: A web-application for analysis and exploration of high throughput sequencing metagenomic data. F1000Research 4:86. https://doi.org/10.12688/f1000research.6139.3

doi: 10.12688/f1000research.6139.3 pubmed: 28451381

Cole JR, Wang Q, Fish JA, Chai B, McGarrell DM et al (2014) Ribosomal Database Project: data and tools for high throughput rRNA analysis. Nucleic Acids Res 42:D633–D642. https://doi.org/10.1093/nar/gkt1244

doi: 10.1093/nar/gkt1244 pubmed: 24288368

Compeau PEC, Pevzner PA, Tesler G (2011) How to apply de Bruijn graphs to genome assembly. Nat Biotechnol 29:987–991. https://doi.org/10.1038/nbt.2023

doi: 10.1038/nbt.2023 pubmed: 22068540 pmcid: 5531759

Cox MP, Peterson DA, Biggs PJ (2010) SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics 11:1–6. https://doi.org/10.1186/1471-2105-11-485

doi: 10.1186/1471-2105-11-485

Crispatzu G, Kulkarni P, Toliat MR, Nürnberg P, Herling M, Herling CD, Frommolt P (2017) Semi-automated cancer genome analysis using high-performance computing. Hum Mutat 38:1325–1335. https://doi.org/10.1002/humu.23275

doi: 10.1002/humu.23275 pubmed: 28598576

Cuccuru G, Orsini M, Pinna A, Sbardellati A, Soranzo N, Travaglione A, Uva P, Zanetti G, Fotia G (2014) Orione, a web-based framework for NGS analysis in microbiology. Bioinformatics 30:1928–1929. https://doi.org/10.1093/bioinformatics/btu135

doi: 10.1093/bioinformatics/btu135 pubmed: 24618473 pmcid: 4071203

D’Antonio M, D’Onorio De Meo P, Pallocca M, Picardi E, D’Erchia AM, Calogero RA, Castrignanò T, Pesole G (2015) RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application. BMC Genomics 16:S3. https://doi.org/10.1186/1471-2164-16-S6-S3

doi: 10.1186/1471-2164-16-S6-S3 pubmed: 26046471 pmcid: 4461013

Darling AE, Jospin G, Lowe E, Matsen FA IV, Bik HM, Eisen JA (2014) PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ 2:e243. https://doi.org/10.7717/peerj.243

doi: 10.7717/peerj.243 pubmed: 24482762 pmcid: 3897386

Davenport CF, Neugebauer J, Beckmann N, Friedrich B et al (2012) Genometa - A Fast and Accurate Classifier for Short Metagenomic Shotgun Reads. PLoS ONE 7:e41224. https://doi.org/10.1371/journal.pone.0041224

doi: 10.1371/journal.pone.0041224 pubmed: 22927906 pmcid: 3424124

Davis MP, van Dongen S, Abreu-Goodger C, Bartonicek N, Enright AJ (2013) Kraken: a set of tools for quality control and analysis of high-throughput sequence data. Methods 63:41–49. https://doi.org/10.1016/j.ymeth.2013.06.027

doi: 10.1016/j.ymeth.2013.06.027 pubmed: 23816787 pmcid: 3991327

Davis NM, Proctor DM, Holmes SP, Relman DA, Callahan BJ (2018) Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data. bioRxiv 221499. https://doi.org/10.1101/221499

De Anda V, Zapata-Peñasco I, Poot-Hernandez AC, Eguiarte LE, Contreras-Moreira B, Souza V (2017) MEBS, a software platform to evaluate large (meta)genomic collections according to their metabolic machinery: unraveling the sulfur cycle. GigaScience 6. https://doi.org/10.1093/gigascience/gix096

de Oliveira GLV, Leite AZ, Higuchi BS, Gonzaga MI, Mariano VS (2017) Intestinal dysbiosis and probiotic applications in autoimmune diseases. J Immunology 152:1–12. https://doi.org/10.1111/imm.12765

doi: 10.1111/imm.12765

DeSantis TZ, Hugenholtz P, Larsen N, Rojas M et al (2020) Greengenes, a Chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol 72:5069–5072. https://doi.org/10.1128/AEM.03006-05

doi: 10.1128/AEM.03006-05

Deutsch EW (2010) The PeptideAtlas Project. In: Hubbard S, Jones A (eds) Proteome Bioinformatics. Methods in Molecular Biology™ (Methods and Protocols), vol 604. Humana Press. 10.1007/978-1-60761-444-9_19

Dhawan A, Barberis A, Cheng W-C, Domingo E et al (2017) sigQC: A procedural approach for standardising the evaluation of gene signatures. bioRxiv 203729. https://doi.org/10.1101/203729

Ding X, Cheng F, Cao C, Sun X (2015) DectICO: an alignment-free supervised metagenomic classification method based on feature extraction and dynamic selection. BMC Bioinformatics 16:323. https://doi.org/10.1186/s12859-015-0753-3

doi: 10.1186/s12859-015-0753-3 pubmed: 26446672 pmcid: 4596415

Dong X, Kleiner M, Sharp CE, Thorson E, Li C, Liu D, Strous M (2017) Fast and Simple Analysis of MiSeq Amplicon Sequencing Data with MetaAmp. Front Microbiol 8:1461. https://doi.org/10.3389/fmicb.2017.01461

doi: 10.3389/fmicb.2017.01461 pubmed: 28824589 pmcid: 5540949

Douglas-Klotz N (2005) The Sufi book of life: 99 pathways of the heart for the modern dervish. Penguin

Drost H-G, Paszkowski J (2017) Biomartr: genomic data retrieval with R. Bioinformatics 33:1216–1217. https://doi.org/10.1093/bioinformatics/btw821

doi: 10.1093/bioinformatics/btw821 pubmed: 28110292 pmcid: 5408848

Dutilh BE, Schmieder R, Nulton J, Felts B, Salamon P, Edwards RA, Mokili JL (2012) Reference-independent comparative metagenomics using cross-assembly: crAss. Bioinformatics 28:3225–3231. https://doi.org/10.1093/bioinformatics/bts613

doi: 10.1093/bioinformatics/bts613 pubmed: 23074261 pmcid: 3519457

Edwards RA, Olson R, Disz T, Pusch GD, Vonstein V, Stevens R, Overbeek R (2012) Real Time Metagenomics: Using k-mers to annotate metagenomes. Bioinformatics 28:3316–3317. https://doi.org/10.1093/bioinformatics/bts599

doi: 10.1093/bioinformatics/bts599 pubmed: 23047562 pmcid: 3519453

Edgar RC (2013) UPARSE: highly accurate OTU sequences from microbial amplicon reads. Nat Methods 10:996–998. https://doi.org/10.1038/nmeth.2604

doi: 10.1038/nmeth.2604 pubmed: 23955772

Escudié F, Auer L, Bernard M, Mariadassou M, Cauquil L, Vidal K, Maman S, Hernandez-Raquet G, Combes S, Pascal G (2018) FROGS: Find, Rapidly, OTUs with Galaxy Solution. Bioinformatics 34:1287–1294. https://doi.org/10.1093/bioinformatics/btx791

doi: 10.1093/bioinformatics/btx791 pubmed: 29228191

Esfandyarpour H, Parizi KB, Barmi MR, Rategh H, Wang L et al (2019) High accuracy DNA sequencing on a small, scalable platform via electrical detection of single base incorporations. bioRxiv 604553. https://doi.org/10.1101/604553

Esling P, Lejzerowicz F, Pawlowski J (2015) Accurate multiplexing and filtering for high-throughput amplicon-sequencing. Nucleic Acids Res 43:2513–2524. https://doi.org/10.1093/nar/gkv107

doi: 10.1093/nar/gkv107 pubmed: 25690897 pmcid: 4357712

Ewels P, Magnusson M, Lundin S, Käller M (2016) MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32:3047–3048. https://doi.org/10.1093/bioinformatics/btw354

doi: 10.1093/bioinformatics/btw354 pubmed: 5039924 pmcid: 5039924

Fabregat A, Sidiropoulos K, Viteri G, Forner O, Marin-Garcia P et al (2017) Reactome pathway analysis: a high-performance in-memory approach. BMC Bioinformatics 18:142. https://doi.org/10.1186/s12859-017-1559-2

doi: 10.1186/s12859-017-1559-2 pubmed: 28249561 pmcid: 5333408

Fadrosh DW, Ma B, Gajer P, Sengamalay N, Ott S, Brotman RM, Ravel J (2014) An improved dual-indexing approach for multiplexed 16S rRNA gene sequencing on the Illumina MiSeq platform. Microbiome 2:6. https://doi.org/10.1186/2049-2618-2-6

doi: 10.1186/2049-2618-2-6 pubmed: 3940169 pmcid: 3940169

Fazekas D, Koltai M, Türei D, Módos D et al (2013) SignaLink 2 – a signaling pathway resource with multi-layered regulatory networks. BMC Syst Biol 7:7. https://doi.org/10.1186/1752-0509-7-7

doi: 10.1186/1752-0509-7-7 pubmed: 23331499 pmcid: 3599410

Fierst JL, Murdock DA (2017) Decontaminating eukaryotic genome assemblies with machine learning. BMC Bioinformatics 18:533. https://doi.org/10.1186/s12859-017-1941-0

doi: 10.1186/s12859-017-1941-0 pubmed: 29191179 pmcid: 5709863

Finn RD, Mistry J, Tate J, Coggill P, Heger A (2014) Pfam: the protein families database. Nucleic Acids Res. https://doi.org/10.1093/nar/gkt1223

Firtina C, Bar-Joseph Z, Alkan C, Cicek AE (2018) Hercules: a profile HMM-based hybrid error correction algorithm for long reads. Nucleic Acids Res 46:e125–e125. https://doi.org/10.1093/nar/gky724

doi: 10.1093/nar/gky724 pubmed: 30124947 pmcid: 6265270

Flygare S, Simmon K, Miller C, Qiao Y, Kennedy B et al (2016) Taxonomer: an interactive metagenomics analysis portal for universal pathogen detection and host mRNA expression profiling. Genome Biol 17:111. https://doi.org/10.1186/s13059-016-0969-1

doi: 10.1186/s13059-016-0969-1 pubmed: 27224977 pmcid: 4880956

Fotouhi A, Majidi M, Külekci MO (2018) Quality Assessment of High-Throughput DNA Sequencing Data via Range Analysis. In: Rojas I., Ortuño F. (eds) Bioinformatics and Biomedical Engineering. IWBBIO 2018. Lecture Notes in Computer Science, vol 10813. Cham: Springer. https://doi.org/10.1007/978-3-319-78723-7_37

Foster ZSL, Sharpton TJ, Grünwald NJ (2017) Metacoder: An R package for visualization and manipulation of community taxonomic diversity data. PLoS Comput Biol 13:e1005404. https://doi.org/10.1371/journal.pcbi.1005404

doi: 10.1371/journal.pcbi.1005404 pubmed: 28222096 pmcid: 5340466

Freese NH, Norris DC, Loraine AE (2016) Integrated genome browser: visual analytics platform for genomics. Bioinformatics 32:2089–2095. https://doi.org/10.1093/bioinformatics/btw069

doi: 10.1093/bioinformatics/btw069 pubmed: 27153568 pmcid: 4937187

French KE (2017) Engineering Mycorrhizal Symbioses to Alter Plant Metabolism and Improve Crop Health. Front Microbiol 8:1403. https://doi.org/10.3389/fmicb.2017.01403

doi: 10.3389/fmicb.2017.01403 pubmed: 28785256 pmcid: 5519612

Galanti L, Shasha D, Gunsalus KC (2017) Pheniqs: Fast and flexible quality-aware sequence demultiplexing. bioRxiv 128512. https://doi.org/10.1101/128512

Genovo AD, Buena-Atienza E, Ossowski S, Sagot MF (2019) WENGAN: Efficient and high quality hybrid de novo assembly of human genomes bioRxiv 840447. https://doi.org/10.1101/840447

Gieg LM, Toth CR (2016) Anaerobic biodegradation of hydrocarbons: metagenomics and metabolomics. Springer

Gillespie JJ, Wattam AR, Cammer SA, Gabbard JL et al (2011) PATRIC: the comprehensive bacterial bioinformatics resource with a focus on human pathogenic species. Infect Immun 79:4286–4298. https://doi.org/10.1128/IAI.00207-11

doi: 10.1128/IAI.00207-11 pubmed: 21896772 pmcid: 3257917

Giraldo-Calderón GI, Emrich SJ, MacCallum RM, Maslen G, Dialynas E, Topalis P et al (2015) VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases. Nucleic Acids Res 43:D707–D713. https://doi.org/10.1093/nar/gku1117

doi: 10.1093/nar/gku1117 pubmed: 25510499

Girotto S, Pizzi C, Comin M (2016) MetaProb: accurate metagenomic reads binning based on probabilistic sequence signatures. Bioinformatics 32:i567–i575. https://doi.org/10.1093/bioinformatics/btw466

doi: 10.1093/bioinformatics/btw466 pubmed: 27587676

Gori F, Folino G, Jetten MSM, Marchiori E (2011) MTR: taxonomic annotation of short metagenomic reads using clustering at multiple taxonomic ranks. Bioinformatics 27:196–203. https://doi.org/10.1093/bioinformatics/btq649

doi: 10.1093/bioinformatics/btq649 pubmed: 21127032

Goswami M, Chakraborty P, Mukherjee K, Mitra G, Bhattacharyya P, Dey S, Tribedi P (2018) Bioaugmentation and biostimulation: a potential strategy for environmental remediation. J Microbiol Exp 6:223–231. https://doi.org/10.15406/jmen.2018.06.00219

doi: 10.15406/jmen.2018.06.00219

Graham EDHJ, Tully BJ (2017) BinSanity: unsupervised clustering of environmental microbial assemblies using coverage and affinity propagation. PeerJ 5:e3035. https://doi.org/10.7717/peerj.3035

doi: 10.7717/peerj.3035 pubmed: 28289564 pmcid: 5345454

Gregor I, Schönhuth A, McHardy AC (2016) Snowball: strain aware gene assembly of metagenomes. Bioinformatics 32:i649–i657. https://doi.org/10.1093/bioinformatics/btw426

doi: 10.1093/bioinformatics/btw426 pubmed: 27587685

Guan D, Liu B, Wang Y (2018) deSPI: efficient classification of metagenomics reads with lightweight de Bruijn graph-based reference indexing2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 265–269. https://doi.org/10.1101/080200

Guo X, Yu N, Ding X, Wang J, Pan Y (2015) DIME: A Novel Framework for De Novo Metagenomic Sequence Assembly. J Comput Biol 22:159–177. https://doi.org/10.1089/cmb.2014.0251

doi: 10.1089/cmb.2014.0251 pubmed: 25684202 pmcid: 4326031

Haft DH, Selengut JD, Richter RA, Harkins D, Basu MK, Beck E (2003) TIGRFAMs and genome properties in 2013. Nucleic Acids Res 41:D387–D395. https://doi.org/10.1093/nar/gks1234

doi: 10.1093/nar/gks1234

Haider B, Ahn T-H, Bushnell B, Chai J, Copeland A, Pan C (2014) Omega: an overlap-graph de novo assembler for metagenomics. Bioinformatics 30:2717–2722. https://doi.org/10.1093/bioinformatics/btu395

doi: 10.1093/bioinformatics/btu395 pubmed: 24947750

Hamilton JJ, Reed JL (2012) Identification of Functional Differences in Metabolic Networks Using Comparative Genomics and Constraint-Based Models. PLoS ONE 7:e34670. https://doi.org/10.1371/journal.pone.0034670

doi: 10.1371/journal.pone.0034670 pubmed: 22666308 pmcid: 3359066

Hanson NW, Konwar KM, Hallam SJ (2016) LCA*: an entropy-based measure for taxonomic assignment within assembled metagenomes. Bioinformatics 32:3535–3542. https://doi.org/10.1093/bioinformatics/btw400

doi: 10.1093/bioinformatics/btw400 pubmed: 27515739 pmcid: 5181528

Harismah K, Mirzaei M, Ghasemi N, Nejati M (2018) Non-Covalent Functionalisation of C30 Fullerene by Pyrrole-n-Carboxylic Acid (n=2, 3): Density Functional Theory Studies. Z Nat Forsch A J Phys Sci 73:51–56. https://doi.org/10.1515/zna-2017-0233

doi: 10.1515/zna-2017-0233

Hatzopoulos T, Watkins SC, Putonti C (2016) PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets. Microb Genom 2:e000053. https://doi.org/10.1099/mgen.0.000053

doi: 10.1099/mgen.0.000053 pubmed: 28348848 pmcid: 5320576

Hitch TCA, Creevey CJ (2018) Spherical: an iterative workflow for assembling metagenomic datasets. BMC Bioinformatics 19:20. https://doi.org/10.1186/s12859-018-2028-2

doi: 10.1186/s12859-018-2028-2 pubmed: 29361904 pmcid: 5781261

Hong C, Manimaran S, Shen Y, Perez-Rogers JF, Byrd AL, Castro-Nallar E, Crandall KA, Johnson WE (2014) PathoScope 2.0: a complete computational framework for strain identification in environmental or clinical sequencing samples. Microbiome 2:1–15. https://doi.org/10.1186/2049-2618-2-33

doi: 10.1186/2049-2618-2-33

Howe KL, Bolt BJ, Shafie M, Kersey P, Berriman M (2017) WormBase ParaSite − a comprehensive resource for helminth genomics. Mol Biochem Parasitol 215:2–10. https://doi.org/10.1016/j.molbiopara.2016.11.005

doi: 10.1016/j.molbiopara.2016.11.005 pubmed: 27899279 pmcid: 5486357

Huse SM, Mark Welch DB, Voorhis A, Shipunova A, Morrison HG, Eren AM, Sogin ML (2014) VAMPS: a website for visualization and analysis of microbial population structures. BMC Bioinformatics 15:41. https://doi.org/10.1186/1471-2105-15-41

doi: 10.1186/1471-2105-15-41 pubmed: 24499292 pmcid: 3922339

Huson DH, Weber N (2013) Chapter Twenty-One - Microbial Community Analysis Using MEGAN. In: DeLong EF (ed) Methods Enzymol. Academic Press 465–485.

Hyatt D, LoCascio PF, Hauser LJ, Uberbacher EC (2012) Gene and translation initiation site prediction in metagenomic sequences. Bioinformatics 28:2223–2230. https://doi.org/10.1093/bioinformatics/bts429

doi: 10.1093/bioinformatics/bts429 pubmed: 22796954

Icay K, Chen P, Cervera A, Rantanen V, Lehtonen R, Hautaniemi S (2016) SePIA: RNA and small RNA sequence processing, integration, and analysis. BioData Min 9:20. https://doi.org/10.1186/s13040-016-0099-z

doi: 10.1186/s13040-016-0099-z pubmed: 27213017 pmcid: 4875694

Imelfort M, Parks D, Woodcroft BJ, Dennis P, Hugenholtz P, Tyson GW (2014) GroopM: an automated tool for the recovery of population genomes from related metagenomes. PeerJ 2:e603. https://doi.org/10.7717/peerj.603

doi: 10.7717/peerj.603 pubmed: 25289188 pmcid: 4183954

Ismail WM, Ye Y, Tang H (2014) Gene finding in metatranscriptomic sequences. BMC Bioinformatics 15:S8. https://doi.org/10.1186/1471-2105-15-S9-S8

doi: 10.1186/1471-2105-15-S9-S8 pubmed: 25253067 pmcid: 4168707

Iyer S, Bouzek H, Deng W, Larsen B, Casey E, Mullins JI (2013) Quality score based identification and correction of pyrosequencing errors. PLoS ONE 8:e73015. https://doi.org/10.1371/journal.pone.0073015

doi: 10.1371/journal.pone.0073015 pubmed: 24039850 pmcid: 3764156

Jain M, Olsen HE, Paten B, Akeson M (2016) The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol 17:239. https://doi.org/10.1186/s13059-016-1103-0

doi: 10.1186/s13059-016-1103-0 pubmed: 27887629 pmcid: 5124260

Jadeja NB, Purohit HJ, Kapley A (2019) Decoding microbial community intelligence through metagenomics for efficient wastewater treatment. Funct Integr Genomics 19:839–851. https://doi.org/10.1007/s10142-019-00681-4

doi: 10.1007/s10142-019-00681-4 pubmed: 31111267

Ji P, Zhang Y, Wang J, Zhao F (2017) MetaSort untangles metagenome assembly by reducing microbial community complexity. Nat Commun 8:14306. https://doi.org/10.1038/ncomms14306

doi: 10.1038/ncomms14306 pubmed: 28112173 pmcid: 5264255

Jia P, Xuan L, Liu L, Wei C (2011) MetaBinG: Using GPUs to Accelerate Metagenomic Sequence Classification. PLoS ONE 6:e25353. https://doi.org/10.1371/journal.pone.0025353

doi: 10.1371/journal.pone.0025353 pubmed: 22132069 pmcid: 3223155

Jiang H, An L, Lin SM, Feng G, Qiu Y (2012) A Statistical Framework for Accurate Taxonomic Assignment of Metagenomic Sequencing Reads. PLoS ONE 7:e46450. https://doi.org/10.1371/journal.pone.0046450

doi: 10.1371/journal.pone.0046450 pubmed: 23049702 pmcid: 3462201

Jonathan B, Puritz CMH, Gold JR (2014) dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms. PeerJ 2:e431. https://doi.org/10.7717/peerj.431

doi: 10.7717/peerj.431

Jost L, DeVries P, Walla T, Greeney H, Chao A, Ricotta C (2010) Partitioning diversity for conservation analyses. Divers Distrib 16:65–76. https://doi.org/10.1111/j.1472-4642.2009.00626.x

doi: 10.1111/j.1472-4642.2009.00626.x

Jourdren L, Bernard M, Dillies M-A, Le Crom S (2012) Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses. Bioinformatics 28:1542–1543. https://doi.org/10.1093/bioinformatics/bts165

doi: 10.1093/bioinformatics/bts165 pubmed: 22492314

Kamath GM, Shomorony I, Xia F, Courtade TA, Tse DN (2017) HINGE: long-read assembly achieves optimal repeat resolution. Genome Res 27:747–756. https://doi.org/10.1101/gr.216465.116

doi: 10.1101/gr.216465.116 pubmed: 28320918 pmcid: 5411769

Kamneva OK (2017) Genome composition and phylogeny of microbes predict their co-occurrence in the environment. PLoS Comput Biol 13:e1005366. https://doi.org/10.1371/journal.pcbi.1005366

doi: 10.1371/journal.pcbi.1005366 pubmed: 28152007 pmcid: 5313232

Kanehisa M, Sato Y, Morishima K (2016) BlastKOALA and GhostKOALA: KEGG Tools for Functional Characterization of Genome and Metagenome Sequences. J Mol Biol 428:726–731. https://doi.org/10.1016/j.jmb.2015.11.006

doi: 10.1016/j.jmb.2015.11.006 pubmed: 26585406 pmcid: 26585406

Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K (2017) KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res 45:D353–D361. https://doi.org/10.1093/nar/gkw1092

doi: 10.1093/nar/gkw1092 pubmed: 27899662

Kang DD, Froula J, Egan R, Wang Z (2015) MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ 3:e1165. https://doi.org/10.7717/peerj.1165

doi: 10.7717/peerj.1165 pubmed: 26336640 pmcid: 4556158

Kawulok J, Deorowicz S (2015) CoMeta: Classification of Metagenomes Using k-mers. PLoS ONE 10:e0121453. https://doi.org/10.1371/journal.pone.0121453

doi: 10.1371/journal.pone.0121453 pubmed: 25884504 pmcid: 4401624

Kelley DR, Liu B, Delcher AL, Pop M, Salzberg SL (2012) Gene prediction with Glimmer for metagenomic sequences augmented by classification and clustering. Nucleic Acids Res 40:e9–e9. https://doi.org/10.1093/nar/gkr1067

doi: 10.1093/nar/gkr1067 pubmed: 22102569

Kelley DR, Salzberg SL (2010) Clustering metagenomic sequences with interpolated Markov models. BMC Bioinformatics 11:544. https://doi.org/10.1186/1471-2105-11-544

doi: 10.1186/1471-2105-11-544 pubmed: 21044341 pmcid: 3098094

Kerepesi C, Szalkai B, Grolmusz V (2015) Visual analysis of the quantitative composition of metagenomic communities: the AmphoraVizu webserver. Microb Ecol 69:695–697. https://doi.org/10.1007/s00248-014-0502-6

doi: 10.1007/s00248-014-0502-6 pubmed: 25296554

Kolmogorov M, Yuan J, Lin Y et al (2019) Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. https://doi.org/10.1038/s41587-019-0072-8

doi: 10.1038/s41587-019-0072-8 pubmed: 30936562

Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM (2017) Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. https://doi.org/10.1101/gr.215087.116

doi: 10.1101/gr.215087.116 pubmed: 28298431 pmcid: 5411767

Koringa PG, Thakkar JR, Pandit RJ, Hinsu AT et al (2018) Metagenomic characterization of ruminal bacterial diversity in buffaloes from birth to adulthood using 16S rRNA gene amplicon sequencing. Funct Integr Genomics 19:237–247. https://doi.org/10.1007/s10142-018-0640-x

doi: 10.1007/s10142-018-0640-x pubmed: 30357583

Kho ZY, Lal SK (2018) The Human gut microbiome- A potential controller of wellness and disease. Front Microbiol 9:1835. https://doi.org/10.3389/fmicb.2018.01835

doi: 10.3389/fmicb.2018.01835 pubmed: 6102370 pmcid: 6102370

Kim D, Song L, Breitwieser FP, Salzberg SL (2016) Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res 26:1721–1729. https://doi.org/10.1101/gr.210641.116

doi: 10.1101/gr.210641.116 pubmed: 27852649 pmcid: 5131823

Kobus R, Hundt C, Müller A, Schmidt B (2017) Accelerating metagenomic read classification on CUDA-enabled GPUs. BMC Bioinformatics 18:11. https://doi.org/10.1186/s12859-016-1434-6

doi: 10.1186/s12859-016-1434-6 pubmed: 28049411 pmcid: 5209836

Kornobis E, Cabellos L, Aguilar F, Frías-López C, Rozas J, Marco J, Zardoya R (2015) TRUFA: a user-friendly web server for de novo RNA-seq analysis using cluster computing. Evol Bioinforma 11:97–104. https://doi.org/10.4137/EBO.S23873

doi: 10.4137/EBO.S23873

Koslicki D, Foucart S, Rosen G (2014) WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification. PLoS ONE 9:e91784. https://doi.org/10.1371/journal.pone.0091784

doi: 10.1371/journal.pone.0091784 pubmed: 24626336 pmcid: 3953531

Kozlov AM, Zhang J, Yilmaz P, Glöckner FO, Stamatakis A (2016) Phylogeny-aware identification and correction of taxonomically mislabeled sequences. Nucleic Acids Res 44:5022–5033. https://doi.org/10.1093/nar/gkw396

doi: 10.1093/nar/gkw396 pubmed: 27166378 pmcid: 4914121

Kroll KW, Mokaram NE, Pelletier AR, Frankhouser DE, Westphal MS, Stump PA, Stump CL, Bundschuh R, Blachly JS, Yan P (2014) Quality Control for RNA-Seq (QuaCRS): an integrated quality control pipeline. Cancer Inform 13:17–17. https://doi.org/10.4137/CIN.S14022

doi: 10.4137/CIN.S14022

Kultima JR, Coelho LP, Forslund K, Huerta-Cepas J, Li SS, Driessen M, Voigt AY, Zeller G, Sunagawa S, Bork P (2016) MOCAT2: a metagenomic assembly, annotation and profiling framework. Bioinformatics 32:2520–2523. https://doi.org/10.1093/bioinformatics/btw183

doi: 10.1093/bioinformatics/btw183 pubmed: 27153620 pmcid: 4978931

Kumar S, Jones M, Koutsovoulos G, Clarke M, Blaxter M (2013) Blobology: exploring raw genome data for contaminants, symbionts and parasites using taxon-annotated GC-coverage plots. Front Genet 4. https://doi.org/10.3389/fgene.2013.00237

Laczny CC, Sternal T, Plugaru V, Gawron P, Atashpendar A, Margossian HH, Coronado S, der Maaten LV, Vlassis N, Wilmes P (2015) VizBin - an application for reference-independent visualization and human-augmented binning of metagenomic data. Microbiome 3:1. https://doi.org/10.1186/s40168-014-0066-1

doi: 10.1186/s40168-014-0066-1 pubmed: 25621171 pmcid: 4305225

Lafond-Lapalme J, Duceppe M-O, Wang S, Moffett P, Mimee B (2016) A new method for decontamination of de novo transcriptomes using a hierarchical clustering algorithm. Bioinformatics 33:1293–1300. https://doi.org/10.1093/bioinformatics/btw793%JBioinformatics

doi: 10.1093/bioinformatics/btw793%JBioinformatics

Lai B, Wang F, Wang X, Duan L, Zhu H (2015) InteMAP: Integrated metagenomic assembly pipeline for NGS short reads. BMC Bioinformatics 16:244. https://doi.org/10.1186/s12859-015-0686-x

doi: 10.1186/s12859-015-0686-x pubmed: 26250558 pmcid: 4545859

Lam K-K, Hall R, Clum A, Rao S (2016) BIGMAC : breaking inaccurate genomes and merging assembled contigs for long read metagenomic assembly. BMC Bioinformatics 17:435. https://doi.org/10.1186/s12859-016-1288-y

doi: 10.1186/s12859-016-1288-y pubmed: 27793084 pmcid: 5084376

Land M, Hauser L, Jun SR, Nookaew I et al (2015) Insights from 20 years of bacterial genome sequencing. Funct Integr Genomics 15:141–161. https://doi.org/10.1007/s10142-015-0433-4

doi: 10.1007/s10142-015-0433-4 pubmed: 25722247 pmcid: 4361730

Laserson J, Jojic V, Koller D (2011) Genovo: De Novo Assembly for Metagenomes. J Comput Biol 18:429–443. https://doi.org/10.1089/cmb.2010.0244

doi: 10.1089/cmb.2010.0244 pubmed: 21385045

Lassmann T, Hayashizaki Y, Daub CO (2011) SAMStat: monitoring biases in next generation sequencing data. Bioinformatics 27:130–131. https://doi.org/10.1093/bioinformatics/btq614

doi: 10.1093/bioinformatics/btq614 pubmed: 21088025

Le Boulch M, Déhais P, Combes S, Pascal G (2019) The MACADAM database: a MetAboliC pAthways DAtabase for Microbial taxonomic groups for mining potential metabolic capacities of archaeal and bacterial taxonomic groups. Database 2019. https://doi.org/10.1093/database/baz049

Le VV, Tran LV, Tran HV (2016) A novel semi-supervised algorithm for the taxonomic assignment of metagenomic reads. BMC Bioinformatics 17:22. https://doi.org/10.1186/s12859-015-0872-x

Lechat P, Souche E, Moszer I (2013) SynTView — an interactive multi-view genome browser for next-generation comparative microorganism genomics. BMC Bioinformatics 14:277. https://doi.org/10.1186/1471-2105-14-277

doi: 10.1186/1471-2105-14-277 pubmed: 24053737 pmcid: 3849071

Lees JG, Lee D, Studer RA, Dawson NL, Sillitoe I et al (2014) Gene3D: Multi-domain annotations for protein sequence and comparative genome analysis. Nucleic Acids Res 42:D240–D245. https://doi.org/10.1093/nar/gkt1205

doi: 10.1093/nar/gkt1205 pubmed: 24270792

Leggett RM, Clavijo BJ, Clissold L, Clark MD, Caccamo M (2014) NextClip: an analysis and read preparation tool for Nextera Long Mate Pair libraries. Bioinformatics 30:566–568. https://doi.org/10.1093/bioinformatics/btt702

doi: 10.1093/bioinformatics/btt702 pubmed: 24297520

Leinonen R, Akhtar R, Birney E, Bower L, Cerdeno-Tarraga A et al (2010) The European Nucleotide Archive. Nucleic Acids Res 39:D28–D31. https://doi.org/10.1093/nar/gkq967

doi: 10.1093/nar/gkq967 pubmed: 20972220 pmcid: 3013801

Leinonen R, Sugawara H, Shumway M, on behalf of the International Nucleotide Sequence Database C (2011) The Sequence Read Archive. Nucleic Acids Res 39:D19–D21. https://doi.org/10.1093/nar/gkq1019

doi: 10.1093/nar/gkq1019 pubmed: 21062823

Li R, Zhu H, Ruan J, Qian W et al (2010) (2010) De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20(2):265–272. https://doi.org/10.1101/gr.097261.109

doi: 10.1101/gr.097261.109 pubmed: 20019144 pmcid: 2813482

Li D, Huang Y, Leung C-M, Luo R, Ting H-F, Lam T-W (2017) MegaGTA: a sensitive and accurate metagenomic gene-targeted assembler using iterative de Bruijn graphs. BMC Bioinformatics 18:408. https://doi.org/10.1186/s12859-017-1825-3

doi: 10.1186/s12859-017-1825-3 pubmed: 29072142 pmcid: 5657035

Li D, Liu C-M, Luo R, Sadakane K, Lam T-W (2015) MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31:1674–1676. https://doi.org/10.1093/bioinformatics/btv033

doi: 10.1093/bioinformatics/btv033

Li Z, Chen Y, Mu D, Yuan J, Shi Y (2016) Comparison of the two major classes of assembly algorithms: overlap-layout consensus and de-bruijn-graph. Brief Funct Genomics 11:25–37. https://doi.org/10.1093/bfgp/elr035

doi: 10.1093/bfgp/elr035

Lin H-H, Liao Y-C (2016) Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes. Sci Rep 6:24175. https://doi.org/10.1038/srep24175

doi: 10.1038/srep24175 pubmed: 27067514 pmcid: 4828714

Lin Y-Y, Hsieh C-H, Chen J-H, Lu X, Kao J-H, Chen P-J, Chen D-S, Wang H-Y (2017) De novo assembly of highly polymorphic metagenomic data using in situ generated reference sequences and a novel BLAST-based assembly pipeline. BMC Bioinformatics 18:223. https://doi.org/10.1186/s12859-017-1630-z

doi: 10.1186/s12859-017-1630-z pubmed: 28446139 pmcid: 5406902

Lindner MS, Kollock M, Zickmann F, Renard BY (2013) Analyzing genome coverage profiles with applications to quality control in metagenomics. Bioinformatics 29:1260–1267. https://doi.org/10.1093/bioinformatics/btt147

doi: 10.1093/bioinformatics/btt147 pubmed: 23589648

Liu B, Gibbons T, Ghodsi M, Treangen T, Pop M (2011) Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences. Genome Biol 12:P11. https://doi.org/10.1186/1465-6906-12-S1-P11

doi: 10.1186/1465-6906-12-S1-P11 pmcid: 3439023

Liu J, Wang H, Yang H, Zhang Y, Wang J, Zhao F, Qi J (2013) Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms. Nucleic Acids Res 41:e3–e3. https://doi.org/10.1093/nar/gks828

doi: 10.1093/nar/gks828 pubmed: 22941634

Liu Y, Ripp F, Koeppel R, Schmidt H, Hellmann SL, Weber M, Krombholz CF, Schmidt B, Hankeln T (2017) AFS: identification and quantification of species composition by metagenomic sequencing. Bioinformatics 33:1396–1398. https://doi.org/10.1093/bioinformatics/btw822

doi: 10.1093/bioinformatics/btw822 pubmed: 28453677

Lo C-C, Chain PS (2014) Rapid evaluation and quality control of next generation sequencing data with FaQCs. BMC Bioinformatics 15:1–8. https://doi.org/10.1186/s12859-014-0366-2

doi: 10.1186/s12859-014-0366-2

Lohse M, Bolger AM, Nagel A, Fernie AR, Lunn JE, Stitt M, Usadel B (2012) R obi NA: A user-friendly, integrated software solution for RNA-Seq-based transcriptomics. Nucleic Acids Res 40:W622–W627. https://doi.org/10.1093/nar/gks540

doi: 10.1093/nar/gks540 pubmed: 22684630 pmcid: 3394330

Loman T (2017) A Novel Method for Predicting Ribosomal RNA Genes in Prokaryotic Genomes. http://lup.lub.lu.se/student-papers/record/8914064

Lu YY, Chen T, Fuhrman JA, Sun F (2017) COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge. Bioinformatics 33:791–798. https://doi.org/10.1093/bioinformatics/btw290

doi: 10.1093/bioinformatics/btw290 pubmed: 27256312

Luo C, Rodriguez-R LM, Konstantinidis KT (2014) MyTaxa: an advanced taxonomic classifier for genomic and metagenomic sequences. Nucleic Acids Res 42:e73–e73. https://doi.org/10.1093/nar/gku169

doi: 10.1093/nar/gku169 pubmed: 24589583 pmcid: 4005636

Lux M, Krüger J, Rinke C, Maus I, Schlüter A, Woyke T, Sczyrba A, Hammer B (2016) acdc – Automated Contamination Detection and Confidence estimation for single-cell genome data. BMC Bioinformatics 17:543. https://doi.org/10.1186/s12859-016-1397-7

doi: 10.1186/s12859-016-1397-7 pubmed: 27998267 pmcid: 5168860

MacDonald NJ, Parks DH, Beiko RG (2012) Rapid identification of high-confidence taxonomic assignments for metagenomic data. Nucleic Acids Res 40:e111–e111. https://doi.org/10.1093/nar/gks335

doi: 10.1093/nar/gks335 pubmed: 22532608 pmcid: 3413139

Maglott D, Ostell J, Pruitt KD, Tatusova T (2011) Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res 39:D52–D57. https://doi.org/10.1093/nar/gkq1237

doi: 10.1093/nar/gkq1237 pubmed: 21115458

Mallet L, Bitard-Feildel T, Cerutti F, Chiapello H (2017) PhylOligo: a package to identify contaminant or untargeted organism sequences in genome assemblies. Bioinformatics 33:3283–3285. https://doi.org/10.1093/bioinformatics/btx396

doi: 10.1093/bioinformatics/btx396 pubmed: 28637232 pmcid: 5860033

Manconi A, Manca E, Moscatelli M, Gnocchi M, Orro A, Armano G, Milanesi L, Biotechnology (2015) G-CNV: a GPU-based tool for preparing data to detect CNVs with read-depth methods. Front Bioeng Biotechnol 3:28. https://doi.org/10.3389/fbioe.2015.00028

doi: 10.3389/fbioe.2015.00028 pubmed: 25806367 pmcid: 4354384

Mapleson D, Garcia Accinelli G, Kettleborough G, Wright J, Clavijo BJ (2017) KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies. Bioinformatics 33:574–576. https://doi.org/10.1093/bioinformatics/btw663

doi: 10.1093/bioinformatics/btw663 pubmed: 27797770

Mariette J, Noirot C, Klopp C (2011) Assessment of replicate bias in 454 pyrosequencing and a multi-purpose read-filtering tool. BMC Res Notes 4:1–4. https://doi.org/10.1186/1756-0500-4-149

doi: 10.1186/1756-0500-4-149

Martin J, Bruno VM, Fang Z, Meng X, Blow M, Zhang T et al (2010) Rnnotator: an automated de novo transcriptome assembly pipeline from stranded RNA-Seq reads. BMC Genomics 11:1–8. https://doi.org/10.1186/1471-2164-11-663

doi: 10.1186/1471-2164-11-663

Markowitz VM, Chen IMA, Palaniappan K, Chu K et al (2012) IMG: the integrated microbial genomes database and comparative analysis system. Nucleic Acids Res 40:D115–D122

doi: 10.1093/nar/gkr1044

Masella AP, Bartram AK, Truszkowski JM, Brown DG, Neufeld JD (2012) PANDAseq: paired-end assembler for illumina sequences. BMC Bioinformatics 13:1–7. https://doi.org/10.1186/1471-2105-13-31

doi: 10.1186/1471-2105-13-31

Matsen FA, Kodner RB, Armbrust EV (2010) pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics 11:538. https://doi.org/10.1186/1471-2105-11-538

doi: 10.1186/1471-2105-11-538 pubmed: 21034504 pmcid: 3098090

May A, Abeln S, Buijs MJ, Heringa J, Crielaard W, Brandt BW (2015) NGS-eval: NGS Error analysis and novel sequence VAriant detection tooL. Nucleic Acids Res 43:W301–W305. https://doi.org/10.1093/nar/gkv346

doi: 10.1093/nar/gkv346 pubmed: 25878034 pmcid: 4489229

McNally CP, Eng A, Noecker C, Gagne-Maynard WC, Borenstein E (2018) BURRITO: An Interactive Multi-Omic Tool for Visualizing Taxa-Function Relationships in Microbiome Data. Front Microbiol 9:365. https://doi.org/10.3389/fmicb.2018.00365

doi: 10.3389/fmicb.2018.00365 pubmed: 29545787 pmcid: 5837987

Meinicke P (2015) UProC: tools for ultra-fast protein domain classification. Bioinformatics 31:1382–1388. https://doi.org/10.1093/bioinformatics/btu843

doi: 10.1093/bioinformatics/btu843 pubmed: 25540185

Meißner T, Fisch KM, Gioia L, Su AI (2015) OncoRep: an n-of-1 reporting tool to support genome-guided treatment for breast cancer patients using RNA-sequencing. BMC Med Genomics 8:1–8. https://doi.org/10.1186/s12920-015-0095-z

doi: 10.1186/s12920-015-0095-z

Mendoza-Parra MA, Saleem M-AM, Blum M, Cholley P-E, Gronemeyer H (2016) NGS-QC generator: a quality control system for ChIP-Seq and related deep sequencing-generated datasetsStatistical Genomics. Springer 243–265

Menzel P, Ng KL, Krogh A (2016) Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat Commun 7:11257. https://doi.org/10.1038/ncomms11257

doi: 10.1038/ncomms11257 pubmed: 27071849 pmcid: 4833860

Merriman B, Rothberg JM (2012) Progress in ion torrent semiconductor chip based sequencing. Electrophoresis 33:3397–3417. https://doi.org/10.1002/elps.201200424

doi: 10.1002/elps.201200424 pubmed: 23208921

Metwally AA, Dai Y, Finn PW, Perkins DL (2016) WEVOTE: Weighted Voting Taxonomic Identification Method of Microbial Sequences. PLoS ONE 11:e0163527. https://doi.org/10.1371/journal.pone.0163527

doi: 10.1371/journal.pone.0163527 pubmed: 27683082 pmcid: 5040256

Meyer F, Hofmann P, Belmann P, Garrido-Oter R, Fritz A, Sczyrba A, McHardy AC (2018) AMBER: Assessment of Metagenome BinnERs. GigaScience 7. https://doi.org/10.1093/gigascience/giy069

Mikheenko A, Saveliev V, Gurevich A (2016) MetaQUAST: evaluation of metagenome assemblies. Bioinformatics 32:1088–1090. https://doi.org/10.1093/bioinformatics/btv697

doi: 10.1093/bioinformatics/btv697 pubmed: 26614127

Miller CS, Baker BJ, Thomas BC, Singer SW, Banfield JF (2011) EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data. Genome Biol 12:R44. https://doi.org/10.1186/gb-2011-12-5-r44

doi: 10.1186/gb-2011-12-5-r44 pubmed: 21595876 pmcid: 3219967

Minot SS, Krumm N, Greenfield NB (2015) One codex: a sensitive and accurate data platform for genomic microbial identification. BioRxiv 027607. https://doi.org/10.1101/027607

Mitchell A, Chang H-Y, Daugherty L, Fraser M, Hunter S et al (2015) The InterPro protein families database: the classification resource after 15 years. Nucleic Acids Res 43:D213–D221. https://doi.org/10.1093/nar/gku1243

doi: 10.1093/nar/gku1243 pubmed: 25428371

Morgat A, Coissac E, Coudert E, Axelsen KB, Keller G, Bairoch A, Bridge A, Bougueleret L, Xenarios I, Viari A (2012) UniPathway: a resource for the exploration and annotation of metabolic pathways. Nucleic Acids Res 40:D761–D769. https://doi.org/10.1093/nar/gkr1023

doi: 10.1093/nar/gkr1023 pubmed: 22102589

Moss EL, Bishara A, Tkachenko E, Kang JB, Andermann TM, Wood C, Handy C, Ji H, Batzoglou S, Bhatt AS (2017) De novo assembly of microbial genomes from human gut metagenomes using barcoded short read sequences. bioRxiv 125211. https://doi.org/10.1101/125211

Nakano Y, Takeshita T, Yasui M, Yamashita Y (2010) Prediction of plausible bacterial composition based on terminal restriction fragment length polymorphisms using a Monte Carlo method. Microb Ecol 60:364–372. https://doi.org/10.1007/s00248-010-9703-9

doi: 10.1007/s00248-010-9703-9 pubmed: 20574825

Namiki T, Hachiya T, Tanaka H, Sakakibara Y (2012) MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads. Nucleic Acids Res 40:e155–e155. https://doi.org/10.1093/nar/gks678

doi: 10.1093/nar/gks678 pubmed: 22821567 pmcid: 3488206

Navarro JF, Sjöstrand J, Salmén F, Lundeberg J, Ståhl PL (2017) ST Pipeline: an automated pipeline for spatial mapping of unique transcripts. Bioinformatics 33:2591–2593. https://doi.org/10.1093/bioinformatics/btx211

doi: 10.1093/bioinformatics/btx211 pubmed: 28398467

Nayfach S, Rodriguez-Mueller B, Garud N, Pollard KS (2016) An integrated metagenomics pipeline for strain profiling reveals novel patterns of bacterial transmission and biogeography. Genome Res 26:1612–1625. https://doi.org/10.1101/gr.201863.115

doi: 10.1101/gr.201863.115 pubmed: 27803195 pmcid: 5088602

Nazir A (2016) Review on metagenomics and its applications. J Imp J Intersd Res 2:10

Ng C, Li H, Wu WKK, Wong SH, Yu J (2019) Genomics and metagenomics of colorectal cancer. J Gastrointest Oncol 10:1164–1170. https://doi.org/10.21037/jgo.2019.06.04

doi: 10.21037/jgo.2019.06.04 pubmed: 31949936 pmcid: 6955013

Nipperess DA, Matsen FA IV (2013) The mean and variance of phylogenetic diversity under rarefaction. Methods Ecol Evol 4:566–572. https://doi.org/10.1111/2041-210X.12042

doi: 10.1111/2041-210X.12042 pubmed: 23833701 pmcid: 3699894

O’Halloran DM (2017) fastQ_brew: module for analysis, preprocessing, and reformatting of FASTQ sequence data. BMC Res Notes 10:1–4. https://doi.org/10.1186/s13104-017-2616-7

doi: 10.1186/s13104-017-2616-7

Ogasawara O, Kodama Y, Mashima J, Kosuge T, Fujisawa T (2020) DDBJ Database updates and computational infrastructure enhancement. Nucleic Acids Res 48:D45–D50. https://doi.org/10.1093/nar/gkz982

doi: 10.1093/nar/gkz982 pubmed: 31724722

Oh J, Kim BK, Cho W-S, Hong SG, Kim KM (2012) PyroTrimmer: a software with GUI for pre-processing 454 amplicon sequences. J Microbiol 50:766–769. https://doi.org/10.1007/s12275-012-2494-6

doi: 10.1007/s12275-012-2494-6 pubmed: 23124743

Okuda S, Tsuchiya Y, Kiriyama C, Itoh M, Morisaki H (2012) Virtual metagenome reconstruction from 16S rRNA gene sequences. Nat Commun 3:1203. https://doi.org/10.1038/ncomms2203

doi: 10.1038/ncomms2203 pubmed: 23149747

Ondov BD, Bergman NH, Phillippy AM (2011) Interactive metagenomic visualization in a Web browser. BMC Bioinformatics 12:385. https://doi.org/10.1186/1471-2105-12-385

doi: 10.1186/1471-2105-12-385 pubmed: 21961884 pmcid: 3190407

Orakov AN, Sakenova NK, Sorokin A, Goryanin II (2018) ASAR: visual analysis of metagenomes in R. Bioinformatics 34:1404–1405. https://doi.org/10.1093/bioinformatics/btx775

doi: 10.1093/bioinformatics/btx775 pubmed: 29211828

Orellana LH, Rodriguez RL, Konstantinidis KT (2017) ROCker: accurate detection and quantification of target genes in short-read metagenomic data sets by modeling sliding-window bitscores. Nucleic Acids Res 45:e14. https://doi.org/10.1093/nar/gkw900

doi: 10.1093/nar/gkw900 pubmed: 28180325

Pandey RV, Pabinger S, Kriegner A, Weinhäusel A (2016) ClinQC: a tool for quality control and cleaning of Sanger and NGS data in clinical research. BMC Bioinformatics 17:1–9. https://doi.org/10.1186/s12859-016-0915-y

doi: 10.1186/s12859-016-0915-y

Parida S, Sharma D (2019) The power of small changes: Comprehensive analyses of microbial dysbiosis in breast cancer. J Biochim Biophys Acta Rev Cancer 1871:392–405. https://doi.org/10.1016/j.bbcan.2019.04.001

doi: 10.1016/j.bbcan.2019.04.001

Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW (2015) CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. https://doi.org/10.1101/gr.186072.114

doi: 10.1101/gr.186072.114 pubmed: 25977477 pmcid: 4484387

Patel RK, Jain M (2012) NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PLoS ONE 7:e30619. https://doi.org/10.1371/journal.pone.0030619

doi: 10.1371/journal.pone.0030619 pubmed: 22312429 pmcid: 3270013

Pati A, Heath LS, Kyrpides NC, Ivanova N (2011) ClaMS: A Classifier for Metagenomic Sequences. Stand Genom Sci 5:248–253. https://doi.org/10.4056/sigs.2075298

doi: 10.4056/sigs.2075298

Patil KR, Roune L, McHardy AC (2012) The PhyloPythiaS Web Server for Taxonomic Assignment of Metagenome Sequences. PLoS ONE 7:e38581. https://doi.org/10.1371/journal.pone.0038581

doi: 10.1371/journal.pone.0038581 pubmed: 22745671 pmcid: 3380018

Pehrsson EC, Tsukayama P, Patel S, Mejía-Bautista M, Sosa-Soto G, Navarrete KM, Calderon M, Cabrera L, Hoyos-Arango W, Bertoli MT, Berg DE, Gilman RH, Dantas G (2016) Interconnected microbiomes and resistomes in low-income human habitats. Nature 533:212–216. https://doi.org/10.1038/nature17672

doi: 10.1038/nature17672 pubmed: 27172044 pmcid: 4869995

Peng Y, Leung HCM, Yiu SM, Chin FYL (2011) Meta-IDBA: a de Novo assembler for metagenomic data. Bioinformatics 27:i94–i101. https://doi.org/10.1093/bioinformatics/btr216

doi: 10.1093/bioinformatics/btr216 pubmed: 21685107 pmcid: 3117360

Peng Y, Leung HCM, Yiu SM, Chin FYL (2012) IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28:1420–1428. https://doi.org/10.1093/bioinformatics/bts174

doi: 10.1093/bioinformatics/bts174

Peng Y, Maxwell AS, Barker ND, Laird JG, Kennedy AJ, Wang N, Zhang C, Gong P (2014) SeqAssist: a novel toolkit for preliminary analysis of next-generation sequencing dataBMC Bioinformatics. Springer 1–11

Perez-Riverol Y, Csordas A, Bai J, Bernal-Llinares M, Hewapathirana S et al (2019) The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res 47:D442–D450. https://doi.org/10.1093/nar/gky1106

doi: 10.1093/nar/gky1106 pubmed: 30395289 pmcid: 30395289

Pericard P, Dufresne Y, Couderc L, Blanquart S, Touzet H (2018) MATAM: reconstruction of phylogenetic marker genes from short sequencing reads in metagenomes. Bioinformatics 34:585–591. https://doi.org/10.1093/bioinformatics/btx644

doi: 10.1093/bioinformatics/btx644 pubmed: 29040406

Peterlongo P, Chikhi R (2012) Mapsembler, targeted and micro assembly of large NGS datasets on a desktop computer. BMC Bioinformatics 13:48. https://doi.org/10.1186/1471-2105-13-48

doi: 10.1186/1471-2105-13-48 pubmed: 22443449 pmcid: 3514201

Petersen TN, Lukjancenko O, Thomsen MCF, Maddalena Sperotto M, Lund O, Møller Aarestrup F, Sicheritz-Pontén T (2017) MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads. PLoS ONE 12:e0176469. https://doi.org/10.1371/journal.pone.0176469

doi: 10.1371/journal.pone.0176469 pubmed: 28467460 pmcid: 5415185

Piro VC, Lindner MS, Renard BY (2016) DUDes: a top-down taxonomic profiler for metagenomics. Bioinformatics 32:2272–2280. https://doi.org/10.1093/bioinformatics/btw150

doi: 10.1093/bioinformatics/btw150 pubmed: 27153591

Porter MS, Beiko RG (2013) SPANNER: taxonomic assignment of sequences using pyramid matching of similarity profiles. Bioinformatics 29:1858–1864. https://doi.org/10.1093/bioinformatics/btt313

doi: 10.1093/bioinformatics/btt313 pubmed: 23732273 pmcid: 3712219

Potter SC, Luciani A, Eddy SR, Park Y, Lopez R, Finn RD (2018) HMMER web server: 2018 update. Nucleic Acids Res 46:W200–W204. https://doi.org/10.1093/nar/gky448

doi: 10.1093/nar/gky448 pubmed: 29905871 pmcid: 6030962

Pujar S, O’Leary NA, Farrell CM, Loveland JE et al (2018) Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation. Nucleic Acids Res 46:D221–D228. https://doi.org/10.1093/nar/gkx1031

doi: 10.1093/nar/gkx1031 pubmed: 29126148

Qiu Y, Tian X, Zhang S (2015) Infer Metagenomic Abundance and Reveal Homologous Genomes Based on the Structure of Taxonomy Tree. IEEE/ACM Trans Comput Biol Bioinform 12:1112–1122. https://doi.org/10.1109/TCBB.2015.2415814

doi: 10.1109/TCBB.2015.2415814 pubmed: 26451823

Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T et al (2013) The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res 41:D590–D596. https://doi.org/10.1093/nar/gks1219

doi: 10.1093/nar/gks1219 pubmed: 23193283

Ramanan R, Kim B-H, Cho D-H, Oh H-M, Kim H-S (2016) Algae–bacteria interactions: Evolution, ecology and emerging applications. Biotechnol Adv 34:14–29. https://doi.org/10.1016/j.biotechadv.2015.12.003

doi: 10.1016/j.biotechadv.2015.12.003 pubmed: 26657897

Ramirez-Gonzales RH, Leggett RM, Waite D et al (2013) StatsDB: platform-agnostic storage and understanding of next generation sequencing run metrics. F1000Research 2:248. https://doi.org/10.12688/f1000research.2-248.v2

doi: 10.12688/f1000research.2-248.v2

Ramos RT, Carneiro AR, Baumbach J, Azevedo V, Schneider MP, Silva A (2011) Analysis of quality raw data of second generation sequencers with Quality Assessment Software. BMC Res Notes 4:1–6. https://doi.org/10.1186/1756-0500-4-130

doi: 10.1186/1756-0500-4-130

Rappoport N, Linial N, Linial M (2013) ProtoNet: charting the expanding universe of protein sequences. Nat Biotechnol 31:290–292. https://doi.org/10.1038/nbt.2553

doi: 10.1038/nbt.2553 pubmed: 23563419

Rasheed Z, Rangwala H (2012) Metagenomic taxonomic classification using extreme learning machines. J Bioinform Comput Biol 10:1250015. https://doi.org/10.1142/S0219720012500151

doi: 10.1142/S0219720012500151 pubmed: 22849369

Rho M, Tang H, Ye Y (2010) FragGeneScan: predicting genes in short and error-prone reads. Nucleic Acids Res 38:e191–e191. https://doi.org/10.1093/nar/gkq747

doi: 10.1093/nar/gkq747 pubmed: 20805240 pmcid: 2978382

Rhoads A, Au KF (2015) PacBio Sequencing and Its Application. Genom Proteom Bioinf 13:278–289. https://doi.org/10.1016/j.gpb.2015.08.002

doi: 10.1016/j.gpb.2015.08.002

Rineh A, Kelso MJ, Vatansever F, Tegos GP, Hamblin MR (2014) Clostridium difficile infection: molecular pathogenesis and novel therapeutics. Expert Rev Anti Infect Ther 12:131–150. https://doi.org/10.1586/14787210.2014.866515

doi: 10.1586/14787210.2014.866515 pubmed: 24410618 pmcid: 4306399

Robertson CE, Harris JK, Wagner BD, Granger D, Browne K, Tatem B, Feazel LM, Park K, Pace NR, Frank DN (2013) Explicet: graphical user interface software for metadata-driven management, analysis and visualization of microbiome data. Bioinformatics 29:3100–3101. https://doi.org/10.1093/bioinformatics/btt526

doi: 10.1093/bioinformatics/btt526 pubmed: 24021386 pmcid: 3834795

Rodrigue S, Materna AC, Timberlake SC, Blackburn MC, Malmstrom RR, Alm EJ, Chisholm SW (2010) Unlocking Short Read Sequencing for Metagenomics. PLoS ONE 5:e11840. https://doi.org/10.1371/journal.pone.0011840

doi: 10.1371/journal.pone.0011840 pubmed: 20676378 pmcid: 2911387

Rodriguez-Martinez A, Ayala R, Posma JM, Harvey N et al (2019) pJRES Binning Algorithm (JBA): a new method to facilitate the recovery of metabolic information from pJRES 1H NMR spectra. Bioinformatics 35:1916–1922. https://doi.org/10.1093/bioinformatics/bty837

doi: 10.1093/bioinformatics/bty837 pubmed: 30351417

Rodriguez-Martinez A, Ayala R, Posma JM et al (2017) MetaboSignal: a network-based approach for topological analysis of metabotype regulation via metabolic and signaling pathways. Bioinformatics 33:773–775. https://doi.org/10.1093/bioinformatics/btw697

doi: 10.1093/bioinformatics/btw697 pubmed: 28011775

Rodriguez-Martinez A, Posma JM, Ayala R et al (2018) MWASTools: an R/bioconductor package for metabolome-wide association studies. Bioinformatics 34:890–892. https://doi.org/10.1093/bioinformatics/btx477

doi: 10.1093/bioinformatics/btx477 pubmed: 28961702

Rodriguez-r LM, Konstantinidis KT (2014) Estimating coverage in metagenomic data sets and why it matters. ISME J 8:2349–2351. https://doi.org/10.1038/ismej.2014.76

doi: 10.1038/ismej.2014.76 pubmed: 24824669 pmcid: 4992084

Rosenbloom KR, Armstrong J, Barber GP, Casper J, Clawson H et al (2015) The UCSC Genome Browser database: 2015 update. Nucleic Acids Res 43:D670–D681. https://doi.org/10.1093/nar/gku1177

doi: 10.1093/nar/gku1177 pubmed: 25428374

Rosen GL, Reichenberger ER, Rosenfeld AM (2011) NBC: the Naïve Bayes Classification tool webserver for taxonomic classification of metagenomic reads. Bioinformatics 27:127–129. https://doi.org/10.1093/bioinformatics/btq619

doi: 10.1093/bioinformatics/btq619 pubmed: 21062764

Rozov R, Goldshlager G, Halperin E, Shamir R (2018) Faucet: streaming de novo assembly graph construction. Bioinformatics 34:147–154. https://doi.org/10.1093/bioinformatics/btx471

doi: 10.1093/bioinformatics/btx471 pubmed: 29036597

Ruan J, Li H (2020) Fast and accurate long-read assembly with wtdbg2. Nat Methods 17:155–158. https://doi.org/10.1038/s41592-019-0669-3

doi: 10.1038/s41592-019-0669-3 pubmed: 31819265

Ruby JG, Bellare P, DeRisi JL (2013) PRICE: Software for the Targeted Assembly of Components of (Meta) Genomic Sequence Data. G3 Genes Genomes Genet 3:865–880. https://doi.org/10.1534/g3.113.005967

doi: 10.1534/g3.113.005967

Samaras P, Schmidt T, Frejno M, Gessulat S et al (2020) ProteomicsDB: a multi-omics and multi-organism resource for life science research. Nucleic Acids Res 48:D1153–D1163. https://doi.org/10.1093/nar/gkz974

doi: 10.1093/nar/gkz974 pubmed: 31665479

Sato K, Sakakibara Y (2013) An extended genovo metagenomic assembler by incorporating paired-end information. PeerJ 1:e196. https://doi.org/10.7717/peerj.196

doi: 10.7717/peerj.196 pubmed: 24281688 pmcid: 3817583

Sato Y, Kojima K, Nariai N, Yamaguchi-Kabata Y et al (2014) SUGAR: graphical user interface-based data refiner for high-throughput DNA sequencing. BMC Genomics 15:1–5. https://doi.org/10.1186/1471-2164-15-664

doi: 10.1186/1471-2164-15-664

Schaab C, Geiger T, Stoehr G, Cox J, Mann M (2012) Analysis of High Accuracy, Quantitative Proteomics Data in the MaxQB Database*. Mol Cell Proteomics 11(M111):014068. https://doi.org/10.1074/mcp.M111.014068

doi: 10.1074/mcp.M111.014068 pubmed: 22301388

Scheuch M, Höper D, Beer M (2015) RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets. BMC Bioinformatics 16:69. https://doi.org/10.1186/s12859-015-0503-6

doi: 10.1186/s12859-015-0503-6 pubmed: 25886935 pmcid: 4351923

Schmieder R, Edwards R (2011a) Fast identification and removal of sequence contamination from genomic and metagenomic datasets. PLoS ONE 6:e17288. https://doi.org/10.1371/journal.pone.0017288

doi: 10.1371/journal.pone.0017288 pubmed: 21408061 pmcid: 3052304

Schmieder R, Edwards R (2011b) Quality control and preprocessing of metagenomic datasets. Bioinformatics 27:863–864. https://doi.org/10.1093/bioinformatics/btr026

doi: 10.1093/bioinformatics/btr026 pubmed: 21278185 pmcid: 3051327

Schreiber F, Gumrich P, Daniel R, Meinicke P (2010) Treephyler: fast taxonomic profiling of metagenomes. Bioinformatics 26:960–961. https://doi.org/10.1093/bioinformatics/btq070

doi: 10.1093/bioinformatics/btq070 pubmed: 20172941

Schröder J, Corbin V, Papenfuss AT (2016) HYSYS: have you swapped your samples? Bioinformatics 33:596–598. https://doi.org/10.1093/bioinformatics/btw685

doi: 10.1093/bioinformatics/btw685 pmcid: 5408803

Schroeder CM, Hilke FJ, Löffler MW, Bitzer M, Lenz F, Sturm M (2017) A comprehensive quality control workflow for paired tumor-normal NGS experiments. Bioinformatics 33:1721–1722. https://doi.org/10.1093/bioinformatics/btx032

doi: 10.1093/bioinformatics/btx032 pubmed: 28130233

Segata N, Waldron L, Ballarini A, Narasimhan V, Jousson O, Huttenhower C (2012) Metagenomic microbial community profiling using unique clade-specific marker genes. Nat Methods 9:811–814. https://doi.org/10.1038/nmeth.2066

doi: 10.1038/nmeth.2066 pubmed: 22688413 pmcid: 22688413

Sharma AK, Gupta A, Kumar S, Dhakan DB, Sharma VK (2015) Woods: A fast and accurate functional annotator and classifier of genomic and metagenomic sequences. Genomics 106:1–6. https://doi.org/10.1016/j.ygeno.2015.04.001

doi: 10.1016/j.ygeno.2015.04.001 pubmed: 25863333

Shafin K, Pesout T, Lorig-Roach R et al (2020) Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes. Nat Biotechnol 38:1044–1053. https://doi.org/10.1038/s41587-020-0503-6

doi: 10.1038/s41587-020-0503-6 pubmed: 32686750 pmcid: 7483855

Sigrist CJA, Cerutti L, de Castro E et al (2010) PROSITE, a protein domain database for functional characterization and annotation. Nucleic Acids Res 38:D161–D166. https://doi.org/10.1093/nar/gkp885

doi: 10.1093/nar/gkp885 pubmed: 19858104

Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM (2015) BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212. https://doi.org/10.1093/bioinformatics/btv351

doi: 10.1093/bioinformatics/btv351 pubmed: 26059717

Simon M, Dittami EC (2017) Detection of bacterial contaminants and hybrid sequences in the genome of the kelp Saccharina japonica using Taxoblast. PeerJ 5:e4073. https://doi.org/10.7717/peerj.4073

doi: 10.7717/peerj.4073

Simpson JT, Durbin R (2012) Efficient de novo assembly of large genomes using compressed data structures. Genome Res 22:549–556. https://doi.org/10.1101/gr.126953.111

doi: 10.1101/gr.126953.111 pubmed: 22156294 pmcid: 3290790

Singer J, Ruscheweyh H-J, Hofmann AL, Thurnherr T et al (2018) NGS-pipe: a flexible, easily extendable and highly configurable framework for NGS analysis. Bioinformatics 34:107–108. https://doi.org/10.1093/bioinformatics/btx540

doi: 10.1093/bioinformatics/btx540 pubmed: 28968639

Singh B, Crippen TL, Zheng L, Fields AT, Yu Z et al (2015) A metagenomic assessment of the bacteria associated with Lucilia sericata and Lucilia cuprina (Diptera: Calliphoridae). Appl Microbiol Biotechnol 99:869–883. https://doi.org/10.1007/s00253-014-6115-7

doi: 10.1007/s00253-014-6115-7 pubmed: 25306907

Schloss PD, Westcott SL, Ryabin T, Hall JR et al (2020) Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities. Appl Environ Microbiol 75:7537–7541. https://doi.org/10.1128/AEM.01541-09

doi: 10.1128/AEM.01541-09

Smith BC, McAndrew T, Chen Z, Harari A, Barris DM et al (2012) The cervical microbiome over 7 years and a comparison of methodologies for its characterization. PLoS ONE 7:e40425. https://doi.org/10.1371/journal.pone.0040425

doi: 10.1371/journal.pone.0040425 pubmed: 22792313 pmcid: 3392218

Sohn MB, An L, Pookhao N, Li Q (2014) Accurate genome relative abundance estimation for closely related species in a metagenomic sample. BMC Bioinformatics 15:242. https://doi.org/10.1186/1471-2105-15-242

doi: 10.1186/1471-2105-15-242 pubmed: 25027647 pmcid: 4131027

Somervuo P, Koskela S, Pennanen J, Henrik Nilsson R, Ovaskainen O (2016) Unbiased probabilistic taxonomic classification for DNA barcoding. Bioinformatics 32:2920–2927. https://doi.org/10.1093/bioinformatics/btw346

doi: 10.1093/bioinformatics/btw346 pubmed: 27296980

Stark M, Berger SA, Stamatakis A, von Mering C (2010) MLTreeMap - accurate Maximum Likelihood placement of environmental DNA sequences into taxonomic and functional reference phylogenies. BMC Genomics 11:461. https://doi.org/10.1186/1471-2164-11-461

doi: 10.1186/1471-2164-11-461 pubmed: 20687950 pmcid: 3091657

Starostina E, Tamazian G, Dobrynin P, O’Brien S, Komissarov A (2015) Cookiecutter: a tool for kmer-based read filtering and extraction. bioRxiv 024679. https://doi.org/10.1101/024679

Stewart RD, Auffret MD, Snelling TJ, Roehe R, Watson M (2019) MAGpy: a reproducible pipeline for the downstream analysis of metagenome-assembled genomes (MAGs). Bioinformatics 35:2150–2152. https://doi.org/10.1093/bioinformatics/bty905

doi: 10.1093/bioinformatics/bty905 pubmed: 30418481

Strous M, Kraft B, Bisdorf R, Tegetmeyer H (2012) The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures. Front Microbiol 3:410. https://doi.org/10.3389/fmicb.2012.00410

doi: 10.3389/fmicb.2012.00410 pubmed: 23227024 pmcid: 3514610

Sunagawa S, Mende DR, Zeller G, Izquierdo-Carrasco F et al (2013) Metagenomic species profiling using universal phylogenetic marker genes. Nat Methods 10:1196–1199. https://doi.org/10.1038/nmeth.2693

doi: 10.1038/nmeth.2693 pubmed: 24141494

Tanaseichuk O, Borneman J, Jiang T (2012) Separating metagenomic short reads into genomes via clustering. Algorithms Mol Biol 7:27. https://doi.org/10.1186/1748-7188-7-27

doi: 10.1186/1748-7188-7-27 pubmed: 23009059 pmcid: 3537596

Tang S, Antonov I, Borodovsky M (2013) MetaGeneTack: ab initio detection of frameshifts in metagenomic sequences. Bioinformatics 29:114–116. https://doi.org/10.1093/bioinformatics/bts636

doi: 10.1093/bioinformatics/bts636 pubmed: 23129300

The UniProt C (2019) UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res 47:D506–D515. https://doi.org/10.1093/nar/gky1049

doi: 10.1093/nar/gky1049

Thompson JF, Oliver JS (2012) Mapping and sequencing DNA using nanopores and nanodetectors. Electrophoresis 33:3429–3436. https://doi.org/10.1002/elps.201200136

doi: 10.1002/elps.201200136 pubmed: 23208922

Thompson JF, Steinmann KE (2010) Single molecule sequencing with a HeliScope Genetic Analysis System. Curr Protoc Mol Biol 92:7.10.1-7.10.14. https://doi.org/10.1002/0471142727.mb0710s92

doi: 10.1002/0471142727.mb0710s92

Tiwari R, Nain L, Labrou NE, Shukla P (2018) Bioprospecting of functional cellulases from metagenome for second generation biofuel production: a review. Crit Rev Microbiol 44:244–257. https://doi.org/10.1080/1040841X.2017.1337713

doi: 10.1080/1040841X.2017.1337713 pubmed: 28609211

Torkamaneh D, Laroche J, Bastien M, Abed A, Belzile F (2017) Fast-GBS: a new pipeline for the efficient and highly accurate calling of SNPs from genotyping-by-sequencing data. BMC Bioinformatics 18:1–7. https://doi.org/10.1186/s12859-016-1431-9

doi: 10.1186/s12859-016-1431-9

Treangen TJ, Koren S, Sommer DD, Liu B et al (2013) MetAMOS: a modular and open source metagenomic assembly and analysis pipeline. Genome Biol 14:R2. https://doi.org/10.1186/gb-2013-14-1-r2

doi: 10.1186/gb-2013-14-1-r2 pubmed: 23320958 pmcid: 4053804

Uchiyama T, Irie M, Mori H, Kurokawa K, Yamada T (2015) FuncTree: Functional Analysis and Visualization for Large-Scale Omics Data. PLoS ONE 10:e0126967. https://doi.org/10.1371/journal.pone.0126967

doi: 10.1371/journal.pone.0126967 pubmed: 25974630 pmcid: 4431737

Uchiyama T, Mihara M, Nishide H, Chiba H (2013) MBGD update 2013: the microbial genome database for exploring the diversity of microbial world. Nucleic Acids Res 41:D631–D635. https://doi.org/10.1093/nar/gks1006

doi: 10.1093/nar/gks1006 pubmed: 23118485

Ulyantsev VI, Kazakov SV, Dubinkina VB, Tyakht AV, Alexeev DG (2016) MetaFast: fast reference-free graph-based comparison of shotgun metagenomic data. Bioinformatics 32:2760–2767. https://doi.org/10.1093/bioinformatics/btw312

doi: 10.1093/bioinformatics/btw312 pubmed: 27259541

Uritskiy GV, DiRuggiero J, Taylor J (2018) MetaWRAP - a flexible pipeline for genome-resolved metagenomic data analysis. bioRxiv 277442. https://doi.org/10.1101/277442

Valencia CA, Pervaiz MA, Husami A, Qian Y, Zhang K (2013) Sanger Sequencing Principles, History, and Landmarks. In: Next Generation Sequencing Technologies in Medical Genetics. SpringerBriefs in Genetics. New York: Springer. https://doi.org/10.1007/978-1-4614-9032-6_1

Vaziri ND, Wong J, Pahl M, Piceno YM et al (2013) Chronic kidney disease alters intestinal microbial flora. Kidney Int 83(2):308–315. https://doi.org/10.1038/ki.2012.345

doi: 10.1038/ki.2012.345 pubmed: 22992469

Wagner J, Chelaru F, Kancherla J, Paulson JN et al (2018) Metaviz: interactive statistical and visual analysis of metagenomic data. Nucleic Acids Res 46:2777–2787. https://doi.org/10.1093/nar/gky136

doi: 10.1093/nar/gky136 pubmed: 29529268 pmcid: 5887897

Wajid B, Serpedin E (2011) Minimum description length based selection of reference sequences for comparative assemblers2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS) 230–233

Wajid B, Serpedin E (2012) Review of general algorithmic features for genome assemblers for next generation sequencers. GPB 10:58–73. https://doi.org/10.1016/j.gpb.2012.05.006

doi: 10.1016/j.gpb.2012.05.006 pubmed: 22768980 pmcid: 5054208

Wajid B, Serpedin E (2016) Do it yourself guide to genome assembly. Brief Funct Genom 15:1–9. https://doi.org/10.1093/bfgp/elu042

doi: 10.1093/bfgp/elu042

Wajid B, Serpedin E, Nounou M, Nounou H (2012a) MiB: a comparative assembly processing pipelineProceedings 2012 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS). IEEE 86-89. https://doi.org/10.1109/GENSIPS.2012.6507733

Wajid B, Serpedin E, Nounou M, Nounou H (2012b) Optimal reference sequence selection for genome assembly using minimum description length principle. EURASIP J Bioinform Syst Biol 2012:1–11. https://doi.org/10.1186/1687-4153-2012-18

doi: 10.1186/1687-4153-2012-18

Wajid B, Ekti AR, Noor A, Serpedin E, Ayyaz MN, Nounou H, Nounou M (2013) Supersonic mib2013 IEEE International Workshop on Genomic Signal Processing and Statistics. IEEE, 86–87. https://doi.org/10.1109/GENSIPS.2013.6735941

Wajid B, Serpedin E, Nounou M, Nounou H (2015) MARAGAP: a modular approach to reference assisted genome assembly pipeline. IJCBDD 8:226–250. https://doi.org/10.1504/IJCBDD.2015.072073

doi: 10.1504/IJCBDD.2015.072073

Wajid B, Sohail MU, Ekti AR, Serpedin E (2016) The A, C, G, and T of genome assembly. Biomed Res Int 2016. https://doi.org/10.1155/2016/6329217

Waldherr S (2014) A guideline to model reduction by stoichiometric decomposition for biochemical network analysis. Proc. of the 21st International Symposium on Mathematical Theory of Networks and Systems 490–495

Wang Y, Mehta G, Mayani R, Lu J et al (2011) RseqFlow: workflows for RNA-Seq data analysis. Bioinformatics 27:2598–2600. https://doi.org/10.1093/bioinformatics/btr441

doi: 10.1093/bioinformatics/btr441 pubmed: 21795323 pmcid: 3167055

Wang K, Singh D, Zeng Z, Coleman SJ et al (2010) MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res 38:e178–e178. https://doi.org/10.1093/nar/gkq622

doi: 10.1093/nar/gkq622 pubmed: 20802226 pmcid: 2952873

Wang Y, Leung HCM, Yiu SM, Chin FYL (2014) MetaCluster-TA: taxonomic annotation for metagenomic data based on assembly-assisted binning. BMC Genomics. BioMed Central 1–9

Wang Q, Fish JA, Gilman M, Sun Y et al (2015a) Xander: employing a novel method for efficient gene-targeted metagenomic assembly. Microbiome 3:32. https://doi.org/10.1186/s40168-015-0093-6

doi: 10.1186/s40168-015-0093-6 pubmed: 26246894 pmcid: 4526283

Wang Y, Hu H, Li X (2015b) MBBC: an efficient approach for metagenomic binning based on clustering. BMC Bioinformatics 16:36. https://doi.org/10.1186/s12859-015-0473-8

doi: 10.1186/s12859-015-0473-8 pubmed: 25652152 pmcid: 4339733

Wang C, Dong D, Wang H et al (2016) Metagenomic analysis of microbial consortia enriched from compost: new insights into the role of Actinobacteria in lignocellulose decomposition. Biotechnol Biofuels 9:22. https://doi.org/10.1186/s13068-016-0440-2

Wang Y, Wang K, Lu YY, Sun F (2017) Improving contig binning of metagenomic data using [Formula: see text] oligonucleotide frequency dissimilarity. BMC Bioinformatics 18:425. https://doi.org/10.1186/s12859-017-1835-1

doi: 10.1186/s12859-017-1835-1 pubmed: 28931373 pmcid: 5607646

Wanichthanarak K, Fan S, Grapov D, Barupal DK, Fiehn O (2017) Metabox: A Toolbox for Metabolomic Data Analysis, Interpretation and Integrative Exploration. PLoS ONE 12:e0171046. https://doi.org/10.1371/journal.pone.0171046

doi: 10.1371/journal.pone.0171046 pubmed: 28141874 pmcid: 5283729

Ward J, Cole C, Febrer M, Barton GJ (2016) AlmostSignificant: simplifying quality control of high-throughput sequencing data. Bioinformatics 32:3850–3851. https://doi.org/10.1093/bioinformatics/btw559

doi: 10.1093/bioinformatics/btw559 pubmed: 27559158 pmcid: 5167069

Watson SJ, Welkers MR, Depledge DP et al (2013) Viral population analysis and minority-variant detection using short read next-generation sequencing. PHILOS T R SOC B 368:20120205. https://doi.org/10.1098/rstb.2012.0205

doi: 10.1098/rstb.2012.0205

Weber M, Teeling H, Huang S, Waldmann J et al (2011) Practical application of self-organizing maps to interrelate biodiversity and functional data in NGS-based metagenomics. ISME J 5:918–928. https://doi.org/10.1038/ismej.2010.180

doi: 10.1038/ismej.2010.180 pubmed: 21160538

Wienkoop S, Staudinger C, Hoehenwarter W, Weckwerth W, Egelhofer V (2012) ProMEX – a mass spectral reference database for plant proteomics. Front Plant Sci 3:125. https://doi.org/10.3389/fpls.2012.00125

doi: 10.3389/fpls.2012.00125 pubmed: 22685450 pmcid: 3368217

Williams W, Trindade M (2017) Metagenomics for the discovery of novel biosurfactants. Functional metagenomics: tools and applications. Springer, pp. 95–117

Wittig U, Kania R, Golebiewski M, Rey M, Shi L et al (2012) SABIO-RK —database for biochemical reaction kinetics. Nucleic Acids Res 40:D790–D796. https://doi.org/10.1093/nar/gkr1046

doi: 10.1093/nar/gkr1046 pubmed: 22102587

Wolfien M, Rimmbach C, Schmitz U, Jung JJ, Krebs S, Steinhoff G, David R, Wolkenhauer O (2016) TRAPLINE: a standardized and automated pipeline for RNA sequencing data analysis, evaluation and annotation. BMC Bioinformatics 17:1–11. https://doi.org/10.1186/s12859-015-0873-9

doi: 10.1186/s12859-015-0873-9

Wood DE, Salzberg SL (2014) Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15:R46. https://doi.org/10.1186/gb-2014-15-3-r46

doi: 10.1186/gb-2014-15-3-r46 pubmed: 24580807 pmcid: 4053813

Wu S, Zhu Z, Fu L, Niu B, Li W (2011) WebMGA: a customizable web server for fast metagenomic sequence analysis. BMC Genomics 12:444. https://doi.org/10.1186/1471-2164-12-444

doi: 10.1186/1471-2164-12-444 pubmed: 21899761 pmcid: 3180703

Wu Y-W, Rho M, Doak TG, Ye Y (2012) Stitching gene fragments with a network matching algorithm improves gene assembly for metagenomics. Bioinformatics 28:i363–i369. https://doi.org/10.1093/bioinformatics/bts388

doi: 10.1093/bioinformatics/bts388 pubmed: 22962453 pmcid: 3436815

Wu Y-W, Simmons BA, Singer SW (2016) MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32:605–607. https://doi.org/10.1093/bioinformatics/btv638

doi: 10.1093/bioinformatics/btv638 pubmed: 26515820

Wu Y-W, Ye Y (2011) A Novel Abundance-Based Algorithm for Binning Metagenomic Sequences Using l-tuples. J Comput Biol 18:523–534. https://doi.org/10.1089/cmb.2010.0245

doi: 10.1089/cmb.2010.0245 pubmed: 21385052 pmcid: 3123841

Xie W, Wang F, Guo L, Chen Z, Sievert SM et al (2011) Comparative metagenomics of microbial communities inhabiting deep-sea hydrothermal vent chimneys with contrasting chemistries. ISME J 5:414–426. https://doi.org/10.1038/ismej.2010.144

doi: 10.1038/ismej.2010.144 pubmed: 20927138

Yang X, Liu D, Liu F, Wu J et al (2013) HTQC: a fast quality control toolkit for Illumina sequencing data. BMC Bioinformatics 14:1–4. https://doi.org/10.1186/1471-2105-14-33

doi: 10.1186/1471-2105-14-33

Yates AD, Achuthan P, Akanni W, Allen J et al (2020) Ensembl 2020. Nucleic Acids Res 48:D682–D688. https://doi.org/10.1093/nar/gkz966

doi: 10.1093/nar/gkz966 pubmed: 31691826

Yoon SH, Ha SM, Kwon S, Lim J, Kim Y, Seo H, Chun J (2017) Introducing EzBioCloud: A taxonomically united database of 16S rRNA and whole genome assemblies. Int J Syst Evol Microbiol 67:1613–1617. https://doi.org/10.1099/ijsem.0.001755

doi: 10.1099/ijsem.0.001755 pubmed: 28005526 pmcid: 5563544

Yourstone SM, Lundberg DS, Dangl JL, Jones CD (2014) MT-Toolbox: improved amplicon sequencing using molecule tags. BMC Bioinformatics 15:1–7. https://doi.org/10.1186/1471-2105-15-284

doi: 10.1186/1471-2105-15-284

Yuan C, Lei J, Cole J, Sun Y (2015) Reconstructing 16S rRNA genes in metagenomic data. Bioinformatics 31:i35–i43. https://doi.org/10.1093/bioinformatics/btv231

doi: 10.1093/bioinformatics/btv231 pubmed: 26072503 pmcid: 4765874

Zacharias HU, Rehberg T, Mehrl S, Richtmann D et al (2017) Scale-Invariant Biomarker Discovery in Urine and Plasma Metabolite Fingerprints. J Proteome Res 16:3596–3605. https://doi.org/10.1021/acs.jproteome.7b00325

doi: 10.1021/acs.jproteome.7b00325 pubmed: 28825821

Zakrzewski M, Bekel T, Ander C, Pühler A et al (2013) MetaSAMS—A novel software platform for taxonomic classification, functional annotation and comparative analysis of metagenome datasets. J Biotechnol 167:156–165. https://doi.org/10.1016/j.jbiotec.2012.09.013

doi: 10.1016/j.jbiotec.2012.09.013 pubmed: 23026555

Zeng F, Wang Z, Wang Y, Zhou J, Chen T (2017) Large-scale 16S gene assembly using metagenomics shotgun sequences. Bioinformatics 33:1447–1456. https://doi.org/10.1093/bioinformatics/btx018

doi: 10.1093/bioinformatics/btx018 pubmed: 28158392

Zhang T, Luo Y, Liu K, Pan L, Zhang B, Yu J, Hu S (2011) BIGpre: a quality assessment package for next-generation sequencing data. Genom Proteom Bioinform 9:238–244. https://doi.org/10.1016/S1672-0229(11)60027-2

doi: 10.1016/S1672-0229(11)60027-2

Zhang Y, Sun Y, Cole JR (2013) A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads. Bioinformatics 29:2103–2111. https://doi.org/10.1093/bioinformatics/btt357

doi: 10.1093/bioinformatics/btt357 pubmed: 23782615

Zhang Y, Sun Y, Cole JR (2014) A Scalable and Accurate Targeted Gene Assembly Tool (SAT-Assembler) for Next-Generation Sequencing Data. PLoS Comput Biol 10:e1003737. https://doi.org/10.1371/journal.pcbi.1003737

doi: 10.1371/journal.pcbi.1003737 pubmed: 25122209 pmcid: 4133164

Zhao W, Liu W, Tian D, Tang B et al (2011) wapRNA: a web-based application for the processing of RNA sequences. Bioinformatics 27:3076–3077. https://doi.org/10.1093/bioinformatics/btr504

doi: 10.1093/bioinformatics/btr504 pubmed: 21896507

Zhou Q, Su X, Jing G, Chen S, Ning K (2018) RNA-QC-chain: comprehensive and fast quality control for RNA-Seq data. BMC Genomics 19:1–10. https://doi.org/10.1186/s12864-018-4503-6

doi: 10.1186/s12864-018-4503-6

Zhu W, Lomsadze A, Borodovsky M (2010) Ab initio gene identification in metagenomic sequences. Nucleic Acids Res 38:e132–e132. https://doi.org/10.1093/nar/gkq275

doi: 10.1093/nar/gkq275 pubmed: 20403810 pmcid: 2896542

Zhu J, Liao M, Yao Z, Liang W et al (2018) Breast cancer in postmenopausal women is associated with an altered gut metagenome. Microbiome 6:136. https://doi.org/10.1186/s40168-018-0515-3

doi: 10.1186/s40168-018-0515-3 pubmed: 30081953 pmcid: 6080540

Zitvogel L, Ma Y, Raoult D, Kroemer G, Gajewski TF (2018) The microbiome in cancer immunotherapy: Diagnostic tools and therapeutic strategies. Science 359:1366–1370. https://doi.org/10.1126/science.aar6918

doi: 10.1126/science.aar6918 pubmed: 29567708

Zou B, Li J, Zhou Q, Quan Z-X (2017) MIPE: A metagenome-based community structure explorer and SSU primer evaluation tool. PLoS ONE 12:e0174609. https://doi.org/10.1371/journal.pone.0174609

doi: 10.1371/journal.pone.0174609 pubmed: 28350876 pmcid: 5370157

Zytnicki M, Quesneville H (2011) S-MART, a software toolbox to aid RNA-Seq data analysis. PLoS ONE 6:e25988. https://doi.org/10.1371/journal.pone.0025988

doi: 10.1371/journal.pone.0025988 pubmed: 21998740 pmcid: 3188586

Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Commentaires et corrections

Informations de copyright

Références

Auteurs

Bilal Wajid (B)

Faria Anwar (F)

Imran Wajid (I)

Haseeb Nisar (H)

Sharoze Meraj (S)

Ali Zafar (A)

Mustafa Kamal Al-Shawaqfeh (MK)

Ali Riza Ekti (AR)

Asia Khatoon (A)

Jan S Suchodolski (JS)

Articles similaires

Comprehensive comparative analysis and development of molecular markers for Lasianthus species based on complete chloroplast genome sequences.

Selecting optimal software code descriptors-The case of Java.

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Fasciola hepatica and Fasciola hybrid form co-existence in yak from Tibet of China: application of rDNA internal transcribed spacer.

Classifications MeSH