Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.
Functional annotation
Metagenomics
Networks
Next-generation sequencing
Pathways
Taxonomy
Visualization
Journal
Functional & integrative genomics
ISSN: 1438-7948
Titre abrégé: Funct Integr Genomics
Pays: Germany
ID NLM: 100939343
Informations de publication
Date de publication:
Feb 2022
Feb 2022
Historique:
received:
29
03
2021
accepted:
03
10
2021
revised:
25
09
2021
pubmed:
19
10
2021
medline:
15
3
2022
entrez:
18
10
2021
Statut:
ppublish
Résumé
This humble effort highlights the intricate details of metagenomics in a simple, poetic, and rhythmic way. The paper enforces the significance of the research area, provides details about major analytical methods, examines the taxonomy and assembly of genomes, emphasizes some tools, and concludes by celebrating the richness of the ecosystem populated by the "metagenome."
Identifiants
pubmed: 34657989
doi: 10.1007/s10142-021-00810-y
pii: 10.1007/s10142-021-00810-y
doi:
Types de publication
Journal Article
Review
Langues
eng
Sous-ensembles de citation
IM
Pagination
3-26Commentaires et corrections
Type : ErratumIn
Informations de copyright
© 2021. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
Références
Afiahayati, Sato K, Sakakibara Y (2015) MetaVelvet-SL: an extension of the Velvet assembler to a de novo metagenomic assembler utilizing supervised learning. DNA Res 22:69-77. https://doi.org/10.1093/dnares/dsu041
Aggio RBM, Ruggiero K, Villas-Bôas SG (2010) Pathway Activity Profiling (PAPi): from the metabolite profile to the metabolic pathway activity. Bioinformatics 26:2969–2976. https://doi.org/10.1093/bioinformatics/btq567
doi: 10.1093/bioinformatics/btq567
pubmed: 20929912
Ainsworth D, Sternberg MJE, Raczy C, Butcher SA (2017) k-SLAM: accurate and ultra-fast taxonomic classification and gene identification for large metagenomic data sets. Nucleic Acids Res 45:1649–1656. https://doi.org/10.1093/nar/gkw1248
doi: 10.1093/nar/gkw1248
pubmed: 27965413
Alic AS, Blanquer I (2016) MuffinInfo: HTML5-Based Statistics Extractor from Next-Generation Sequencing Data. J Comput Biol 23:750–755. https://doi.org/10.1089/cmb.2016.0031
doi: 10.1089/cmb.2016.0031
pubmed: 27606794
Alkhateeb A, Rueda L (2017) Zseq: an approach for preprocessing next-generation sequencing data. J Comput Biol 24:746–755. https://doi.org/10.1089/cmb.2017.0021
doi: 10.1089/cmb.2017.0021
pubmed: 28414515
pmcid: 5563921
Alneberg J, Bjarnason BS, de Bruijn I, Schirmer M, Quick J, Ijaz UZ, Lahti L, Loman NJ, Andersson AF, Quince C (2014) Binning metagenomic contigs by coverage and composition. Nat Methods 11:1144–1146. https://doi.org/10.1038/nmeth.3103
doi: 10.1038/nmeth.3103
pubmed: 25218180
Alonso A, Lasseigne BN, Williams K, Nielsen J, Ramaker RC, Hardigan AA, Johnston B, Roberts BS, Cooper SJ, Marsal S, Myers RM (2017) aRNApipe: a balanced, efficient and distributed pipeline for processing RNA-seq data in high-performance computing environments. Bioinformatics 33:1727–1729. https://doi.org/10.1093/bioinformatics/btx023
doi: 10.1093/bioinformatics/btx023
pubmed: 28108448
pmcid: 5447234
Alshawaqfeh M, Bashaireh A, Serpedin E, Suchodolski J (2017a) Reliable Biomarker discovery from Metagenomic data via RegLRSD algorithm. BMC Bioinformatics 18:328. https://doi.org/10.1186/s12859-017-1738-1
doi: 10.1186/s12859-017-1738-1
pubmed: 28693478
pmcid: 5504766
AlShawaqfeh M, Wajid B, Minamoto Y, Markel M, Lidbury J, Steiner J, Serpedin E, Suchodolski J (2017b) A dysbiosis index to assess microbial changes in fecal samples of dogs with chronic inflammatory enteropathy. J FEMS Microbiol Ecol 93:fix136. https://doi.org/10.1093/femsec/fix136
doi: 10.1093/femsec/fix136
Alshawaqfeh M, Gharaibeh A, Wajid B (2019) A Hybrid Feature Selection Method for Classifying Metagenomic Data in Relation to Inflammatory Bowel DiseaseICAAI 2019: Proceedings of the 2019 3rd International Conference on Advances in Artificial Intelligence 86–89. 10.1145/ 3369114.3371675
Ames SK, Hysom DA, Gardner SN, Lloyd GS, Gokhale MB, Allen JE (2013) Scalable metagenomic taxonomy classification using a reference genome database. Bioinformatics 29:2253–2260. https://doi.org/10.1093/bioinformatics/btt389
doi: 10.1093/bioinformatics/btt389
pubmed: 23828782
pmcid: 3753567
Anand G, Zarrinpar A, Loomba R (2016) Targeting dysbiosis for the treatment of liver disease. Semin Liver Dis 36:37–47. https://doi.org/10.1055/s-0035-1571276
doi: 10.1055/s-0035-1571276
pubmed: 26870931
pmcid: 6658186
Anders S, Pyl PT, Huber W (2015) HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31:166–169. https://doi.org/10.1093/bioinformatics/btu638
doi: 10.1093/bioinformatics/btu638
Andreas B, McHardy AC (2018) Critical assessment of metagenome interpretation enters the second round. mSystems 3:e00103-e118. https://doi.org/10.1128/mSystems.00103-18
doi: 10.1128/mSystems.00103-18
Andrés-León E, Núñez-Torres R, Rojas AM (2016) miARma-Seq: a comprehensive tool for miRNA, mRNA and circRNA analysis. Sci Rep 6:1–8. https://doi.org/10.1038/srep25749
doi: 10.1038/srep25749
Andrews S (2010) FastQC: a quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ . Accessed 13 Aug 2021
Arango-Argoty G, Singh G, Heath LS, Pruden A, Xiao W, Zhang L (2016) MetaStorm: A Public Resource for Customizable Metagenomics Annotation. PLoS ONE 11:e0162442. https://doi.org/10.1371/journal.pone.0162442
doi: 10.1371/journal.pone.0162442
pubmed: 27632579
pmcid: 5025195
Asnicar F, Weingart G, Tickle TL, Huttenhower C, Segata N (2015) Compact graphical representation of phylogenetic data and metadata with GraPhlAn. PeerJ 3:e1029. https://doi.org/10.7717/peerj.1029
doi: 10.7717/peerj.1029
pubmed: 26157614
pmcid: 4476132
Attwood TK, Coletta A, Muirhead G, Pavlopoulou A, Philippou PB et al (2012) The PRINTS database: a fine- grained protein sequence annotation and analysis resource—its status in 2012. Database 2012: bas019. https://doi.org/10.1093/database/bas019
Ayyala DN, Lin S (2015) GrammR: graphical representation and modeling of count data with application in metagenomics. Bioinformatics 31:1648–1654. https://doi.org/10.1093/bioinformatics/btv032
doi: 10.1093/bioinformatics/btv032
pubmed: 25609792
Bacci G, Bazzicalupo M, Benedetti A, Mengoni A (2014) StreamingTrim 1.0: a Java software for dynamic trimming of 16S rRNA sequence data from metagenetic studies. Mol Ecol Resour 14:426–434. https://doi.org/10.1111/1755-0998.12187
doi: 10.1111/1755-0998.12187
pubmed: 24128146
Banerjee J, Mishra N, Dhas YJMg (2015) Metagenomics: A new horizon in cancer research. J Meta Gene 5:84–89. https://doi.org/10.1016/j.mgene.2015.05.005
doi: 10.1016/j.mgene.2015.05.005
Bankevich A, Nurk S, Antipov D et al (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. https://doi.org/10.1089/cmb.2012.0021
doi: 10.1089/cmb.2012.0021
pubmed: 22506599
pmcid: 3342519
Basler G, Nikoloski Z (2011) JMassBalance: mass-balanced randomization and analysis of metabolic networks. Bioinformatics 27:2761–2762. https://doi.org/10.1093/bioinformatics/btr448
doi: 10.1093/bioinformatics/btr448
pubmed: 21803803
pmcid: 3179655
Benoit G, Peterlongo P, Mariadassou M, Drezen E, Schbath S, Lavenier D, Lemaitre C (2016) Multiple comparative metagenomics using multiset k-mer counting. PeerJ Comput Sci 2:e94. https://doi.org/10.7717/peerj-cs.94
doi: 10.7717/peerj-cs.94
Berendzen J, Bruno WJ, Cohn JD, Hengartner NW, Kuske CR, McMahon BH, Wolinsky MA, Xie G (2012) Rapid phylogenetic and functional classification of short genomic fragments with signature peptides. BMC Res Notes 5:460. https://doi.org/10.1186/1756-0500-5-460
doi: 10.1186/1756-0500-5-460
pubmed: 22925230
pmcid: 3772700
Bergmann EA, Chen BJ, Arora K, Vacic V, Zody MC (2016) Conpair: concordance and contamination estimator for matched tumor-normal pairs. Bioinformatics 32:3196–3198. https://doi.org/10.1093/bioinformatics/btw389
doi: 10.1093/bioinformatics/btw389
pubmed: 27354699
pmcid: 5048070
Berini F, Casciello C, Marcone GL, Marinelli F (2017) Metagenomics: novel enzymes from non-culturable microbes. FEMS Microbiol Lett 364:fnx211. https://doi.org/10.1093/femsle/fnx211
doi: 10.1093/femsle/fnx211
Bertrand D, Shaw J, Kalathiyappan M et al (2019) Hybrid metagenomic assembly enables high-resolution analysis of resistance determinants and mobile elements in human microbiomes. Nat Biotechnol 37:937–944. https://doi.org/10.1038/s41587-019-0191-2
doi: 10.1038/s41587-019-0191-2
pubmed: 31359005
Blin K, Pascal Andreu V, de los Santos ELC, Del Carratore F, Lee SY, Medema MH, Weber T (2019a) The antiSMASH database version 2: a comprehensive resource on secondary metabolite biosynthetic gene clusters. Nucleic Acids Res 47:D625-D630. https://doi.org/10.1093/nar/gky1060
Blin K, Shaw S, Steinke K, Villebro R, Ziemert N, Lee SY, Medema MH, Weber T (2019b) antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res 47:W81–W87. https://doi.org/10.1093/nar/gkz310
doi: 10.1093/nar/gkz310
pubmed: 31032519
pmcid: 6602434
Blom J, Kreis J, Spanig S, Juhre T et al (2016) EDGAR 2.0: an enhanced software platform for comparative gene content analyses. Nucleic Acids Res 44:W22–W28. https://doi.org/10.1093/nar/gkw255
doi: 10.1093/nar/gkw255
pubmed: 27098043
pmcid: 4987874
Boisvert S, Raymond F, Godzaridis É, Laviolette F, Corbeil J (2012) Ray Meta: scalable de novo metagenome assembly and profiling. Genome Biol 13:R122. https://doi.org/10.1186/gb-2012-13-12-r122
doi: 10.1186/gb-2012-13-12-r122
pubmed: 23259615
pmcid: 4056372
Booth SC, Weljie AM, Turner RJ (2013) Comput Struct Biotechnol J 4:e201301003. https://doi.org/10.5936/csbj.201301003
doi: 10.5936/csbj.201301003
pubmed: 24688685
pmcid: 3962093
Borozan I, Ferretti V (2016) CSSSCL: a python package that uses combined sequence similarity scores for accurate taxonomic classification of long and short sequence reads. Bioinformatics 32:453–455. https://doi.org/10.1093/bioinformatics/btv587
doi: 10.1093/bioinformatics/btv587
pubmed: 26454281
Brandt BW, Bonder MJ, Huse SM, Zaura E (2012) TaxMan: a server to trim rRNA reference databases and inspect taxonomic coverage. Nucleic Acids Res 40:W82–W87. https://doi.org/10.1093/nar/gks418
doi: 10.1093/nar/gks418
pubmed: 22618877
pmcid: 3394339
Brown J, Pirrung M, McCue LA (2017) FQC Dashboard: integrates FastQC results into a web-based, interactive, and extensible FASTQ quality control tool. Bioinformatics 33:3137–3139. https://doi.org/10.1093/bioinformatics/btx373
doi: 10.1093/bioinformatics/btx373
pubmed: 28605449
pmcid: 5870778
Buchfink B, Xie C, Huson DH (2015) Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60. https://doi.org/10.1038/nmeth.3176
doi: 10.1038/nmeth.3176
pubmed: 25402007
pmcid: 25402007
Bushnell B (2014) BBMap: A Fast, Accurate, Splice-Aware Aligner. Berkeley: Lawrence Berkeley National Lab. (LBNL)
Cabanski CR, Cavin K, Bizon C, Wilkerson MD, Parker JS, Wilhelmsen KC, Perou CM, Marron J, Hayes DN (2012) ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data. BMC Bioinformatics 13:1–10. https://doi.org/10.1186/1471-2105-13-221
doi: 10.1186/1471-2105-13-221
Caboche S, Even G, Loywick A, Audebert C, Hot D (2017) MICRA: an automatic pipeline for fast characterization of microbial genomes from high-throughput sequencing data. Genome Biol 18:233. https://doi.org/10.1186/s13059-017-1367-z
doi: 10.1186/s13059-017-1367-z
pubmed: 29258574
pmcid: 5738152
Cantor M, Nordberg H, Smirnova T, Hess M, Tringe S, Dubchak I (2015) Elviz – exploration of metagenome assemblies with an interactive visualization tool. BMC Bioinf 16:130. https://doi.org/10.1186/s12859-015-0566-4
doi: 10.1186/s12859-015-0566-4
Cao R, Freitas C, Chan L, Sun M, Jiang H, Chen Z (2017) ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network. Molecules 22. https://doi.org/10.3390/molecules22101732
Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD et al (2010) QIIME allows analysis of high-throughput community sequencing data. Nat Methods 7:335–336. https://doi.org/10.1038/nmeth.f.303
doi: 10.1038/nmeth.f.303
pubmed: 20383131
pmcid: 20383131
Caspi R, Altman T, Dale JM, Dreher K et al (2010) The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 38:D473–D479. https://doi.org/10.1093/nar/gkp875
doi: 10.1093/nar/gkp875
pubmed: 19850718
Cédric Cabau FE, Djari A, Guiguen Y, Bobe J, Klopp C (2017) Compacting and correcting Trinity and Oases RNA-Seq de novo assemblies. PeerJ 5:e2988. https://doi.org/10.7717/peerj.2988
doi: 10.7717/peerj.2988
pubmed: 28224052
pmcid: 5316280
Cepeda V, Liu B, Almeida M, Hill CM, Koren S, Treangen TJ, Pop M (2017) MetaCompass: reference-guided assembly of metagenomes. BioRxiv 212506. https://doi.org/10.1101/212506
Chaves I, Costa BV, Rodrigues AS, Bohn A, Miguel CM (2017) mi RP ursuit—a pipeline for automated analyses of small RNA s in model and nonmodel plants. FEBS Lett 591:2261–2268. https://doi.org/10.1002/1873-3468.12746
doi: 10.1002/1873-3468.12746
pubmed: 28686301
Chen S, Huang T, Zhou Y, Han Y, Xu M, Gu J (2017) AfterQC: automatic filtering, trimming, error removing and quality control for fastq data. BMC Bioinformatics 18:91–100. https://doi.org/10.1186/s12859-017-1469-3
doi: 10.1186/s12859-017-1469-3
Chen Y, Chen Y, Shi C, Huang Z, Zhang Y, Li S, Li Y, Ye J, Yu C, Li Z (2018) SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7:gix120. https://doi.org/10.1093/gigascience/gix120
doi: 10.1093/gigascience/gix120
Chiara M, Gioiosa S, Chillemi G, D’Antonio M, Flati T et al (2018a) CoVaCS: a consensus variant calling system. BMC Genomics 19:1–9. https://doi.org/10.1186/s12864-018-4508-1
doi: 10.1186/s12864-018-4508-1
Chiara M, Placido A, Picardi E, Ceci LR, Horner DS, Pesole G (2018b) A-GAME: improving the assembly of pooled functional metagenomics sequence data. BMC Genomics 19:1–10. https://doi.org/10.1186/s12864-017-4369-z
doi: 10.1186/s12864-017-4369-z
Chikhi R, Rizk G (2013) Space-efficient and exact de Bruijn graph representation based on a Bloom filter. Algorithms Mol Biol 8:22. https://doi.org/10.1186/1748-7188-8-22
doi: 10.1186/1748-7188-8-22
pubmed: 24040893
pmcid: 3848682
Chiu CY, Miller SA (2019) Clinical metagenomics. Nat Rev Genet 20:341–355. https://doi.org/10.1038/s41576-019-0113-7
doi: 10.1038/s41576-019-0113-7
pubmed: 30918369
pmcid: 6858796
Choi K, Smith LP, Medley JK, Sauro HM (2016) phraSED-ML: A paraphrased, human-readable adaptation of SED-ML. J Bioinform Comput Biol 14:1650035. https://doi.org/10.1142/s0219720016500359
doi: 10.1142/s0219720016500359
pubmed: 27774871
pmcid: 5313123
Chu J, Sadeghi S, Raymond A, Jackman SD, Nip KM, Mar R, Mohamadi H, Butterfield YS, Robertson AG, Birol I (2014) BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters. Bioinformatics 30:3402–3404. https://doi.org/10.1093/bioinformatics/btu558
doi: 10.1093/bioinformatics/btu558
pubmed: 25143290
pmcid: 4816029
Cibulskis K, McKenna A, Fennell T, Banks E, DePristo M, Getz G (2011) ContEst: estimating cross-contamination of human samples in next-generation sequencing data. Bioinformatics 27:2601–2602. https://doi.org/10.1093/bioinformatics/btr446
doi: 10.1093/bioinformatics/btr446
pubmed: 21803805
pmcid: 3167057
Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW (2016) GenBank. Nucleic Acids Res 44:D67–D72. https://doi.org/10.1093/nar/gkv1276
doi: 10.1093/nar/gkv1276
pubmed: 26590407
Clark SC, Egan R, Frazier PI, Wang Z (2013) ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies. Bioinformatics 29:435–443. https://doi.org/10.1093/bioinformatics/bts723
doi: 10.1093/bioinformatics/bts723
pubmed: 23303509
Clos-Garcia M, Garcia K, Alonso C et al (2020) Integrative Analysis of Fecal Metagenomics and Metabolomics in Colorectal Cancer. Cancers (basel) 12(5):1142. https://doi.org/10.3390/cancers12051142
doi: 10.3390/cancers12051142
Correia D, Doppelt-Azeroual O, Denis J-B, Vandenbogaert M, Caro V (2015) MetaGenSense: A web-application for analysis and exploration of high throughput sequencing metagenomic data. F1000Research 4:86. https://doi.org/10.12688/f1000research.6139.3
doi: 10.12688/f1000research.6139.3
pubmed: 28451381
Cole JR, Wang Q, Fish JA, Chai B, McGarrell DM et al (2014) Ribosomal Database Project: data and tools for high throughput rRNA analysis. Nucleic Acids Res 42:D633–D642. https://doi.org/10.1093/nar/gkt1244
doi: 10.1093/nar/gkt1244
pubmed: 24288368
Compeau PEC, Pevzner PA, Tesler G (2011) How to apply de Bruijn graphs to genome assembly. Nat Biotechnol 29:987–991. https://doi.org/10.1038/nbt.2023
doi: 10.1038/nbt.2023
pubmed: 22068540
pmcid: 5531759
Cox MP, Peterson DA, Biggs PJ (2010) SolexaQA: At-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinformatics 11:1–6. https://doi.org/10.1186/1471-2105-11-485
doi: 10.1186/1471-2105-11-485
Crispatzu G, Kulkarni P, Toliat MR, Nürnberg P, Herling M, Herling CD, Frommolt P (2017) Semi-automated cancer genome analysis using high-performance computing. Hum Mutat 38:1325–1335. https://doi.org/10.1002/humu.23275
doi: 10.1002/humu.23275
pubmed: 28598576
Cuccuru G, Orsini M, Pinna A, Sbardellati A, Soranzo N, Travaglione A, Uva P, Zanetti G, Fotia G (2014) Orione, a web-based framework for NGS analysis in microbiology. Bioinformatics 30:1928–1929. https://doi.org/10.1093/bioinformatics/btu135
doi: 10.1093/bioinformatics/btu135
pubmed: 24618473
pmcid: 4071203
D’Antonio M, D’Onorio De Meo P, Pallocca M, Picardi E, D’Erchia AM, Calogero RA, Castrignanò T, Pesole G (2015) RAP: RNA-Seq Analysis Pipeline, a new cloud-based NGS web application. BMC Genomics 16:S3. https://doi.org/10.1186/1471-2164-16-S6-S3
doi: 10.1186/1471-2164-16-S6-S3
pubmed: 26046471
pmcid: 4461013
Darling AE, Jospin G, Lowe E, Matsen FA IV, Bik HM, Eisen JA (2014) PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ 2:e243. https://doi.org/10.7717/peerj.243
doi: 10.7717/peerj.243
pubmed: 24482762
pmcid: 3897386
Davenport CF, Neugebauer J, Beckmann N, Friedrich B et al (2012) Genometa - A Fast and Accurate Classifier for Short Metagenomic Shotgun Reads. PLoS ONE 7:e41224. https://doi.org/10.1371/journal.pone.0041224
doi: 10.1371/journal.pone.0041224
pubmed: 22927906
pmcid: 3424124
Davis MP, van Dongen S, Abreu-Goodger C, Bartonicek N, Enright AJ (2013) Kraken: a set of tools for quality control and analysis of high-throughput sequence data. Methods 63:41–49. https://doi.org/10.1016/j.ymeth.2013.06.027
doi: 10.1016/j.ymeth.2013.06.027
pubmed: 23816787
pmcid: 3991327
Davis NM, Proctor DM, Holmes SP, Relman DA, Callahan BJ (2018) Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data. bioRxiv 221499. https://doi.org/10.1101/221499
De Anda V, Zapata-Peñasco I, Poot-Hernandez AC, Eguiarte LE, Contreras-Moreira B, Souza V (2017) MEBS, a software platform to evaluate large (meta)genomic collections according to their metabolic machinery: unraveling the sulfur cycle. GigaScience 6. https://doi.org/10.1093/gigascience/gix096
de Oliveira GLV, Leite AZ, Higuchi BS, Gonzaga MI, Mariano VS (2017) Intestinal dysbiosis and probiotic applications in autoimmune diseases. J Immunology 152:1–12. https://doi.org/10.1111/imm.12765
doi: 10.1111/imm.12765
DeSantis TZ, Hugenholtz P, Larsen N, Rojas M et al (2020) Greengenes, a Chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol 72:5069–5072. https://doi.org/10.1128/AEM.03006-05
doi: 10.1128/AEM.03006-05
Deutsch EW (2010) The PeptideAtlas Project. In: Hubbard S, Jones A (eds) Proteome Bioinformatics. Methods in Molecular Biology™ (Methods and Protocols), vol 604. Humana Press. 10.1007/978-1-60761-444-9_19
Dhawan A, Barberis A, Cheng W-C, Domingo E et al (2017) sigQC: A procedural approach for standardising the evaluation of gene signatures. bioRxiv 203729. https://doi.org/10.1101/203729
Ding X, Cheng F, Cao C, Sun X (2015) DectICO: an alignment-free supervised metagenomic classification method based on feature extraction and dynamic selection. BMC Bioinformatics 16:323. https://doi.org/10.1186/s12859-015-0753-3
doi: 10.1186/s12859-015-0753-3
pubmed: 26446672
pmcid: 4596415
Dong X, Kleiner M, Sharp CE, Thorson E, Li C, Liu D, Strous M (2017) Fast and Simple Analysis of MiSeq Amplicon Sequencing Data with MetaAmp. Front Microbiol 8:1461. https://doi.org/10.3389/fmicb.2017.01461
doi: 10.3389/fmicb.2017.01461
pubmed: 28824589
pmcid: 5540949
Douglas-Klotz N (2005) The Sufi book of life: 99 pathways of the heart for the modern dervish. Penguin
Drost H-G, Paszkowski J (2017) Biomartr: genomic data retrieval with R. Bioinformatics 33:1216–1217. https://doi.org/10.1093/bioinformatics/btw821
doi: 10.1093/bioinformatics/btw821
pubmed: 28110292
pmcid: 5408848
Dutilh BE, Schmieder R, Nulton J, Felts B, Salamon P, Edwards RA, Mokili JL (2012) Reference-independent comparative metagenomics using cross-assembly: crAss. Bioinformatics 28:3225–3231. https://doi.org/10.1093/bioinformatics/bts613
doi: 10.1093/bioinformatics/bts613
pubmed: 23074261
pmcid: 3519457
Edwards RA, Olson R, Disz T, Pusch GD, Vonstein V, Stevens R, Overbeek R (2012) Real Time Metagenomics: Using k-mers to annotate metagenomes. Bioinformatics 28:3316–3317. https://doi.org/10.1093/bioinformatics/bts599
doi: 10.1093/bioinformatics/bts599
pubmed: 23047562
pmcid: 3519453
Edgar RC (2013) UPARSE: highly accurate OTU sequences from microbial amplicon reads. Nat Methods 10:996–998. https://doi.org/10.1038/nmeth.2604
doi: 10.1038/nmeth.2604
pubmed: 23955772
Escudié F, Auer L, Bernard M, Mariadassou M, Cauquil L, Vidal K, Maman S, Hernandez-Raquet G, Combes S, Pascal G (2018) FROGS: Find, Rapidly, OTUs with Galaxy Solution. Bioinformatics 34:1287–1294. https://doi.org/10.1093/bioinformatics/btx791
doi: 10.1093/bioinformatics/btx791
pubmed: 29228191
Esfandyarpour H, Parizi KB, Barmi MR, Rategh H, Wang L et al (2019) High accuracy DNA sequencing on a small, scalable platform via electrical detection of single base incorporations. bioRxiv 604553. https://doi.org/10.1101/604553
Esling P, Lejzerowicz F, Pawlowski J (2015) Accurate multiplexing and filtering for high-throughput amplicon-sequencing. Nucleic Acids Res 43:2513–2524. https://doi.org/10.1093/nar/gkv107
doi: 10.1093/nar/gkv107
pubmed: 25690897
pmcid: 4357712
Ewels P, Magnusson M, Lundin S, Käller M (2016) MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32:3047–3048. https://doi.org/10.1093/bioinformatics/btw354
doi: 10.1093/bioinformatics/btw354
pubmed: 5039924
pmcid: 5039924
Fabregat A, Sidiropoulos K, Viteri G, Forner O, Marin-Garcia P et al (2017) Reactome pathway analysis: a high-performance in-memory approach. BMC Bioinformatics 18:142. https://doi.org/10.1186/s12859-017-1559-2
doi: 10.1186/s12859-017-1559-2
pubmed: 28249561
pmcid: 5333408
Fadrosh DW, Ma B, Gajer P, Sengamalay N, Ott S, Brotman RM, Ravel J (2014) An improved dual-indexing approach for multiplexed 16S rRNA gene sequencing on the Illumina MiSeq platform. Microbiome 2:6. https://doi.org/10.1186/2049-2618-2-6
doi: 10.1186/2049-2618-2-6
pubmed: 3940169
pmcid: 3940169
Fazekas D, Koltai M, Türei D, Módos D et al (2013) SignaLink 2 – a signaling pathway resource with multi-layered regulatory networks. BMC Syst Biol 7:7. https://doi.org/10.1186/1752-0509-7-7
doi: 10.1186/1752-0509-7-7
pubmed: 23331499
pmcid: 3599410
Fierst JL, Murdock DA (2017) Decontaminating eukaryotic genome assemblies with machine learning. BMC Bioinformatics 18:533. https://doi.org/10.1186/s12859-017-1941-0
doi: 10.1186/s12859-017-1941-0
pubmed: 29191179
pmcid: 5709863
Finn RD, Mistry J, Tate J, Coggill P, Heger A (2014) Pfam: the protein families database. Nucleic Acids Res. https://doi.org/10.1093/nar/gkt1223
Firtina C, Bar-Joseph Z, Alkan C, Cicek AE (2018) Hercules: a profile HMM-based hybrid error correction algorithm for long reads. Nucleic Acids Res 46:e125–e125. https://doi.org/10.1093/nar/gky724
doi: 10.1093/nar/gky724
pubmed: 30124947
pmcid: 6265270
Flygare S, Simmon K, Miller C, Qiao Y, Kennedy B et al (2016) Taxonomer: an interactive metagenomics analysis portal for universal pathogen detection and host mRNA expression profiling. Genome Biol 17:111. https://doi.org/10.1186/s13059-016-0969-1
doi: 10.1186/s13059-016-0969-1
pubmed: 27224977
pmcid: 4880956
Fotouhi A, Majidi M, Külekci MO (2018) Quality Assessment of High-Throughput DNA Sequencing Data via Range Analysis. In: Rojas I., Ortuño F. (eds) Bioinformatics and Biomedical Engineering. IWBBIO 2018. Lecture Notes in Computer Science, vol 10813. Cham: Springer. https://doi.org/10.1007/978-3-319-78723-7_37
Foster ZSL, Sharpton TJ, Grünwald NJ (2017) Metacoder: An R package for visualization and manipulation of community taxonomic diversity data. PLoS Comput Biol 13:e1005404. https://doi.org/10.1371/journal.pcbi.1005404
doi: 10.1371/journal.pcbi.1005404
pubmed: 28222096
pmcid: 5340466
Freese NH, Norris DC, Loraine AE (2016) Integrated genome browser: visual analytics platform for genomics. Bioinformatics 32:2089–2095. https://doi.org/10.1093/bioinformatics/btw069
doi: 10.1093/bioinformatics/btw069
pubmed: 27153568
pmcid: 4937187
French KE (2017) Engineering Mycorrhizal Symbioses to Alter Plant Metabolism and Improve Crop Health. Front Microbiol 8:1403. https://doi.org/10.3389/fmicb.2017.01403
doi: 10.3389/fmicb.2017.01403
pubmed: 28785256
pmcid: 5519612
Galanti L, Shasha D, Gunsalus KC (2017) Pheniqs: Fast and flexible quality-aware sequence demultiplexing. bioRxiv 128512. https://doi.org/10.1101/128512
Genovo AD, Buena-Atienza E, Ossowski S, Sagot MF (2019) WENGAN: Efficient and high quality hybrid de novo assembly of human genomes bioRxiv 840447. https://doi.org/10.1101/840447
Gieg LM, Toth CR (2016) Anaerobic biodegradation of hydrocarbons: metagenomics and metabolomics. Springer
Gillespie JJ, Wattam AR, Cammer SA, Gabbard JL et al (2011) PATRIC: the comprehensive bacterial bioinformatics resource with a focus on human pathogenic species. Infect Immun 79:4286–4298. https://doi.org/10.1128/IAI.00207-11
doi: 10.1128/IAI.00207-11
pubmed: 21896772
pmcid: 3257917
Giraldo-Calderón GI, Emrich SJ, MacCallum RM, Maslen G, Dialynas E, Topalis P et al (2015) VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases. Nucleic Acids Res 43:D707–D713. https://doi.org/10.1093/nar/gku1117
doi: 10.1093/nar/gku1117
pubmed: 25510499
Girotto S, Pizzi C, Comin M (2016) MetaProb: accurate metagenomic reads binning based on probabilistic sequence signatures. Bioinformatics 32:i567–i575. https://doi.org/10.1093/bioinformatics/btw466
doi: 10.1093/bioinformatics/btw466
pubmed: 27587676
Gori F, Folino G, Jetten MSM, Marchiori E (2011) MTR: taxonomic annotation of short metagenomic reads using clustering at multiple taxonomic ranks. Bioinformatics 27:196–203. https://doi.org/10.1093/bioinformatics/btq649
doi: 10.1093/bioinformatics/btq649
pubmed: 21127032
Goswami M, Chakraborty P, Mukherjee K, Mitra G, Bhattacharyya P, Dey S, Tribedi P (2018) Bioaugmentation and biostimulation: a potential strategy for environmental remediation. J Microbiol Exp 6:223–231. https://doi.org/10.15406/jmen.2018.06.00219
doi: 10.15406/jmen.2018.06.00219
Graham EDHJ, Tully BJ (2017) BinSanity: unsupervised clustering of environmental microbial assemblies using coverage and affinity propagation. PeerJ 5:e3035. https://doi.org/10.7717/peerj.3035
doi: 10.7717/peerj.3035
pubmed: 28289564
pmcid: 5345454
Gregor I, Schönhuth A, McHardy AC (2016) Snowball: strain aware gene assembly of metagenomes. Bioinformatics 32:i649–i657. https://doi.org/10.1093/bioinformatics/btw426
doi: 10.1093/bioinformatics/btw426
pubmed: 27587685
Guan D, Liu B, Wang Y (2018) deSPI: efficient classification of metagenomics reads with lightweight de Bruijn graph-based reference indexing2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 265–269. https://doi.org/10.1101/080200
Guo X, Yu N, Ding X, Wang J, Pan Y (2015) DIME: A Novel Framework for De Novo Metagenomic Sequence Assembly. J Comput Biol 22:159–177. https://doi.org/10.1089/cmb.2014.0251
doi: 10.1089/cmb.2014.0251
pubmed: 25684202
pmcid: 4326031
Haft DH, Selengut JD, Richter RA, Harkins D, Basu MK, Beck E (2003) TIGRFAMs and genome properties in 2013. Nucleic Acids Res 41:D387–D395. https://doi.org/10.1093/nar/gks1234
doi: 10.1093/nar/gks1234
Haider B, Ahn T-H, Bushnell B, Chai J, Copeland A, Pan C (2014) Omega: an overlap-graph de novo assembler for metagenomics. Bioinformatics 30:2717–2722. https://doi.org/10.1093/bioinformatics/btu395
doi: 10.1093/bioinformatics/btu395
pubmed: 24947750
Hamilton JJ, Reed JL (2012) Identification of Functional Differences in Metabolic Networks Using Comparative Genomics and Constraint-Based Models. PLoS ONE 7:e34670. https://doi.org/10.1371/journal.pone.0034670
doi: 10.1371/journal.pone.0034670
pubmed: 22666308
pmcid: 3359066
Hanson NW, Konwar KM, Hallam SJ (2016) LCA*: an entropy-based measure for taxonomic assignment within assembled metagenomes. Bioinformatics 32:3535–3542. https://doi.org/10.1093/bioinformatics/btw400
doi: 10.1093/bioinformatics/btw400
pubmed: 27515739
pmcid: 5181528
Harismah K, Mirzaei M, Ghasemi N, Nejati M (2018) Non-Covalent Functionalisation of C30 Fullerene by Pyrrole-n-Carboxylic Acid (n=2, 3): Density Functional Theory Studies. Z Nat Forsch A J Phys Sci 73:51–56. https://doi.org/10.1515/zna-2017-0233
doi: 10.1515/zna-2017-0233
Hatzopoulos T, Watkins SC, Putonti C (2016) PhagePhisher: a pipeline for the discovery of covert viral sequences in complex genomic datasets. Microb Genom 2:e000053. https://doi.org/10.1099/mgen.0.000053
doi: 10.1099/mgen.0.000053
pubmed: 28348848
pmcid: 5320576
Hitch TCA, Creevey CJ (2018) Spherical: an iterative workflow for assembling metagenomic datasets. BMC Bioinformatics 19:20. https://doi.org/10.1186/s12859-018-2028-2
doi: 10.1186/s12859-018-2028-2
pubmed: 29361904
pmcid: 5781261
Hong C, Manimaran S, Shen Y, Perez-Rogers JF, Byrd AL, Castro-Nallar E, Crandall KA, Johnson WE (2014) PathoScope 2.0: a complete computational framework for strain identification in environmental or clinical sequencing samples. Microbiome 2:1–15. https://doi.org/10.1186/2049-2618-2-33
doi: 10.1186/2049-2618-2-33
Howe KL, Bolt BJ, Shafie M, Kersey P, Berriman M (2017) WormBase ParaSite − a comprehensive resource for helminth genomics. Mol Biochem Parasitol 215:2–10. https://doi.org/10.1016/j.molbiopara.2016.11.005
doi: 10.1016/j.molbiopara.2016.11.005
pubmed: 27899279
pmcid: 5486357
Huse SM, Mark Welch DB, Voorhis A, Shipunova A, Morrison HG, Eren AM, Sogin ML (2014) VAMPS: a website for visualization and analysis of microbial population structures. BMC Bioinformatics 15:41. https://doi.org/10.1186/1471-2105-15-41
doi: 10.1186/1471-2105-15-41
pubmed: 24499292
pmcid: 3922339
Huson DH, Weber N (2013) Chapter Twenty-One - Microbial Community Analysis Using MEGAN. In: DeLong EF (ed) Methods Enzymol. Academic Press 465–485.
Hyatt D, LoCascio PF, Hauser LJ, Uberbacher EC (2012) Gene and translation initiation site prediction in metagenomic sequences. Bioinformatics 28:2223–2230. https://doi.org/10.1093/bioinformatics/bts429
doi: 10.1093/bioinformatics/bts429
pubmed: 22796954
Icay K, Chen P, Cervera A, Rantanen V, Lehtonen R, Hautaniemi S (2016) SePIA: RNA and small RNA sequence processing, integration, and analysis. BioData Min 9:20. https://doi.org/10.1186/s13040-016-0099-z
doi: 10.1186/s13040-016-0099-z
pubmed: 27213017
pmcid: 4875694
Imelfort M, Parks D, Woodcroft BJ, Dennis P, Hugenholtz P, Tyson GW (2014) GroopM: an automated tool for the recovery of population genomes from related metagenomes. PeerJ 2:e603. https://doi.org/10.7717/peerj.603
doi: 10.7717/peerj.603
pubmed: 25289188
pmcid: 4183954
Ismail WM, Ye Y, Tang H (2014) Gene finding in metatranscriptomic sequences. BMC Bioinformatics 15:S8. https://doi.org/10.1186/1471-2105-15-S9-S8
doi: 10.1186/1471-2105-15-S9-S8
pubmed: 25253067
pmcid: 4168707
Iyer S, Bouzek H, Deng W, Larsen B, Casey E, Mullins JI (2013) Quality score based identification and correction of pyrosequencing errors. PLoS ONE 8:e73015. https://doi.org/10.1371/journal.pone.0073015
doi: 10.1371/journal.pone.0073015
pubmed: 24039850
pmcid: 3764156
Jain M, Olsen HE, Paten B, Akeson M (2016) The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol 17:239. https://doi.org/10.1186/s13059-016-1103-0
doi: 10.1186/s13059-016-1103-0
pubmed: 27887629
pmcid: 5124260
Jadeja NB, Purohit HJ, Kapley A (2019) Decoding microbial community intelligence through metagenomics for efficient wastewater treatment. Funct Integr Genomics 19:839–851. https://doi.org/10.1007/s10142-019-00681-4
doi: 10.1007/s10142-019-00681-4
pubmed: 31111267
Ji P, Zhang Y, Wang J, Zhao F (2017) MetaSort untangles metagenome assembly by reducing microbial community complexity. Nat Commun 8:14306. https://doi.org/10.1038/ncomms14306
doi: 10.1038/ncomms14306
pubmed: 28112173
pmcid: 5264255
Jia P, Xuan L, Liu L, Wei C (2011) MetaBinG: Using GPUs to Accelerate Metagenomic Sequence Classification. PLoS ONE 6:e25353. https://doi.org/10.1371/journal.pone.0025353
doi: 10.1371/journal.pone.0025353
pubmed: 22132069
pmcid: 3223155
Jiang H, An L, Lin SM, Feng G, Qiu Y (2012) A Statistical Framework for Accurate Taxonomic Assignment of Metagenomic Sequencing Reads. PLoS ONE 7:e46450. https://doi.org/10.1371/journal.pone.0046450
doi: 10.1371/journal.pone.0046450
pubmed: 23049702
pmcid: 3462201
Jonathan B, Puritz CMH, Gold JR (2014) dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms. PeerJ 2:e431. https://doi.org/10.7717/peerj.431
doi: 10.7717/peerj.431
Jost L, DeVries P, Walla T, Greeney H, Chao A, Ricotta C (2010) Partitioning diversity for conservation analyses. Divers Distrib 16:65–76. https://doi.org/10.1111/j.1472-4642.2009.00626.x
doi: 10.1111/j.1472-4642.2009.00626.x
Jourdren L, Bernard M, Dillies M-A, Le Crom S (2012) Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses. Bioinformatics 28:1542–1543. https://doi.org/10.1093/bioinformatics/bts165
doi: 10.1093/bioinformatics/bts165
pubmed: 22492314
Kamath GM, Shomorony I, Xia F, Courtade TA, Tse DN (2017) HINGE: long-read assembly achieves optimal repeat resolution. Genome Res 27:747–756. https://doi.org/10.1101/gr.216465.116
doi: 10.1101/gr.216465.116
pubmed: 28320918
pmcid: 5411769
Kamneva OK (2017) Genome composition and phylogeny of microbes predict their co-occurrence in the environment. PLoS Comput Biol 13:e1005366. https://doi.org/10.1371/journal.pcbi.1005366
doi: 10.1371/journal.pcbi.1005366
pubmed: 28152007
pmcid: 5313232
Kanehisa M, Sato Y, Morishima K (2016) BlastKOALA and GhostKOALA: KEGG Tools for Functional Characterization of Genome and Metagenome Sequences. J Mol Biol 428:726–731. https://doi.org/10.1016/j.jmb.2015.11.006
doi: 10.1016/j.jmb.2015.11.006
pubmed: 26585406
pmcid: 26585406
Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K (2017) KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res 45:D353–D361. https://doi.org/10.1093/nar/gkw1092
doi: 10.1093/nar/gkw1092
pubmed: 27899662
Kang DD, Froula J, Egan R, Wang Z (2015) MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ 3:e1165. https://doi.org/10.7717/peerj.1165
doi: 10.7717/peerj.1165
pubmed: 26336640
pmcid: 4556158
Kawulok J, Deorowicz S (2015) CoMeta: Classification of Metagenomes Using k-mers. PLoS ONE 10:e0121453. https://doi.org/10.1371/journal.pone.0121453
doi: 10.1371/journal.pone.0121453
pubmed: 25884504
pmcid: 4401624
Kelley DR, Liu B, Delcher AL, Pop M, Salzberg SL (2012) Gene prediction with Glimmer for metagenomic sequences augmented by classification and clustering. Nucleic Acids Res 40:e9–e9. https://doi.org/10.1093/nar/gkr1067
doi: 10.1093/nar/gkr1067
pubmed: 22102569
Kelley DR, Salzberg SL (2010) Clustering metagenomic sequences with interpolated Markov models. BMC Bioinformatics 11:544. https://doi.org/10.1186/1471-2105-11-544
doi: 10.1186/1471-2105-11-544
pubmed: 21044341
pmcid: 3098094
Kerepesi C, Szalkai B, Grolmusz V (2015) Visual analysis of the quantitative composition of metagenomic communities: the AmphoraVizu webserver. Microb Ecol 69:695–697. https://doi.org/10.1007/s00248-014-0502-6
doi: 10.1007/s00248-014-0502-6
pubmed: 25296554
Kolmogorov M, Yuan J, Lin Y et al (2019) Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. https://doi.org/10.1038/s41587-019-0072-8
doi: 10.1038/s41587-019-0072-8
pubmed: 30936562
Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM (2017) Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. https://doi.org/10.1101/gr.215087.116
doi: 10.1101/gr.215087.116
pubmed: 28298431
pmcid: 5411767
Koringa PG, Thakkar JR, Pandit RJ, Hinsu AT et al (2018) Metagenomic characterization of ruminal bacterial diversity in buffaloes from birth to adulthood using 16S rRNA gene amplicon sequencing. Funct Integr Genomics 19:237–247. https://doi.org/10.1007/s10142-018-0640-x
doi: 10.1007/s10142-018-0640-x
pubmed: 30357583
Kho ZY, Lal SK (2018) The Human gut microbiome- A potential controller of wellness and disease. Front Microbiol 9:1835. https://doi.org/10.3389/fmicb.2018.01835
doi: 10.3389/fmicb.2018.01835
pubmed: 6102370
pmcid: 6102370
Kim D, Song L, Breitwieser FP, Salzberg SL (2016) Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res 26:1721–1729. https://doi.org/10.1101/gr.210641.116
doi: 10.1101/gr.210641.116
pubmed: 27852649
pmcid: 5131823
Kobus R, Hundt C, Müller A, Schmidt B (2017) Accelerating metagenomic read classification on CUDA-enabled GPUs. BMC Bioinformatics 18:11. https://doi.org/10.1186/s12859-016-1434-6
doi: 10.1186/s12859-016-1434-6
pubmed: 28049411
pmcid: 5209836
Kornobis E, Cabellos L, Aguilar F, Frías-López C, Rozas J, Marco J, Zardoya R (2015) TRUFA: a user-friendly web server for de novo RNA-seq analysis using cluster computing. Evol Bioinforma 11:97–104. https://doi.org/10.4137/EBO.S23873
doi: 10.4137/EBO.S23873
Koslicki D, Foucart S, Rosen G (2014) WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification. PLoS ONE 9:e91784. https://doi.org/10.1371/journal.pone.0091784
doi: 10.1371/journal.pone.0091784
pubmed: 24626336
pmcid: 3953531
Kozlov AM, Zhang J, Yilmaz P, Glöckner FO, Stamatakis A (2016) Phylogeny-aware identification and correction of taxonomically mislabeled sequences. Nucleic Acids Res 44:5022–5033. https://doi.org/10.1093/nar/gkw396
doi: 10.1093/nar/gkw396
pubmed: 27166378
pmcid: 4914121
Kroll KW, Mokaram NE, Pelletier AR, Frankhouser DE, Westphal MS, Stump PA, Stump CL, Bundschuh R, Blachly JS, Yan P (2014) Quality Control for RNA-Seq (QuaCRS): an integrated quality control pipeline. Cancer Inform 13:17–17. https://doi.org/10.4137/CIN.S14022
doi: 10.4137/CIN.S14022
Kultima JR, Coelho LP, Forslund K, Huerta-Cepas J, Li SS, Driessen M, Voigt AY, Zeller G, Sunagawa S, Bork P (2016) MOCAT2: a metagenomic assembly, annotation and profiling framework. Bioinformatics 32:2520–2523. https://doi.org/10.1093/bioinformatics/btw183
doi: 10.1093/bioinformatics/btw183
pubmed: 27153620
pmcid: 4978931
Kumar S, Jones M, Koutsovoulos G, Clarke M, Blaxter M (2013) Blobology: exploring raw genome data for contaminants, symbionts and parasites using taxon-annotated GC-coverage plots. Front Genet 4. https://doi.org/10.3389/fgene.2013.00237
Laczny CC, Sternal T, Plugaru V, Gawron P, Atashpendar A, Margossian HH, Coronado S, der Maaten LV, Vlassis N, Wilmes P (2015) VizBin - an application for reference-independent visualization and human-augmented binning of metagenomic data. Microbiome 3:1. https://doi.org/10.1186/s40168-014-0066-1
doi: 10.1186/s40168-014-0066-1
pubmed: 25621171
pmcid: 4305225
Lafond-Lapalme J, Duceppe M-O, Wang S, Moffett P, Mimee B (2016) A new method for decontamination of de novo transcriptomes using a hierarchical clustering algorithm. Bioinformatics 33:1293–1300. https://doi.org/10.1093/bioinformatics/btw793%JBioinformatics
doi: 10.1093/bioinformatics/btw793%JBioinformatics
Lai B, Wang F, Wang X, Duan L, Zhu H (2015) InteMAP: Integrated metagenomic assembly pipeline for NGS short reads. BMC Bioinformatics 16:244. https://doi.org/10.1186/s12859-015-0686-x
doi: 10.1186/s12859-015-0686-x
pubmed: 26250558
pmcid: 4545859
Lam K-K, Hall R, Clum A, Rao S (2016) BIGMAC : breaking inaccurate genomes and merging assembled contigs for long read metagenomic assembly. BMC Bioinformatics 17:435. https://doi.org/10.1186/s12859-016-1288-y
doi: 10.1186/s12859-016-1288-y
pubmed: 27793084
pmcid: 5084376
Land M, Hauser L, Jun SR, Nookaew I et al (2015) Insights from 20 years of bacterial genome sequencing. Funct Integr Genomics 15:141–161. https://doi.org/10.1007/s10142-015-0433-4
doi: 10.1007/s10142-015-0433-4
pubmed: 25722247
pmcid: 4361730
Laserson J, Jojic V, Koller D (2011) Genovo: De Novo Assembly for Metagenomes. J Comput Biol 18:429–443. https://doi.org/10.1089/cmb.2010.0244
doi: 10.1089/cmb.2010.0244
pubmed: 21385045
Lassmann T, Hayashizaki Y, Daub CO (2011) SAMStat: monitoring biases in next generation sequencing data. Bioinformatics 27:130–131. https://doi.org/10.1093/bioinformatics/btq614
doi: 10.1093/bioinformatics/btq614
pubmed: 21088025
Le Boulch M, Déhais P, Combes S, Pascal G (2019) The MACADAM database: a MetAboliC pAthways DAtabase for Microbial taxonomic groups for mining potential metabolic capacities of archaeal and bacterial taxonomic groups. Database 2019. https://doi.org/10.1093/database/baz049
Le VV, Tran LV, Tran HV (2016) A novel semi-supervised algorithm for the taxonomic assignment of metagenomic reads. BMC Bioinformatics 17:22. https://doi.org/10.1186/s12859-015-0872-x
Lechat P, Souche E, Moszer I (2013) SynTView — an interactive multi-view genome browser for next-generation comparative microorganism genomics. BMC Bioinformatics 14:277. https://doi.org/10.1186/1471-2105-14-277
doi: 10.1186/1471-2105-14-277
pubmed: 24053737
pmcid: 3849071
Lees JG, Lee D, Studer RA, Dawson NL, Sillitoe I et al (2014) Gene3D: Multi-domain annotations for protein sequence and comparative genome analysis. Nucleic Acids Res 42:D240–D245. https://doi.org/10.1093/nar/gkt1205
doi: 10.1093/nar/gkt1205
pubmed: 24270792
Leggett RM, Clavijo BJ, Clissold L, Clark MD, Caccamo M (2014) NextClip: an analysis and read preparation tool for Nextera Long Mate Pair libraries. Bioinformatics 30:566–568. https://doi.org/10.1093/bioinformatics/btt702
doi: 10.1093/bioinformatics/btt702
pubmed: 24297520
Leinonen R, Akhtar R, Birney E, Bower L, Cerdeno-Tarraga A et al (2010) The European Nucleotide Archive. Nucleic Acids Res 39:D28–D31. https://doi.org/10.1093/nar/gkq967
doi: 10.1093/nar/gkq967
pubmed: 20972220
pmcid: 3013801
Leinonen R, Sugawara H, Shumway M, on behalf of the International Nucleotide Sequence Database C (2011) The Sequence Read Archive. Nucleic Acids Res 39:D19–D21. https://doi.org/10.1093/nar/gkq1019
doi: 10.1093/nar/gkq1019
pubmed: 21062823
Li R, Zhu H, Ruan J, Qian W et al (2010) (2010) De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20(2):265–272. https://doi.org/10.1101/gr.097261.109
doi: 10.1101/gr.097261.109
pubmed: 20019144
pmcid: 2813482
Li D, Huang Y, Leung C-M, Luo R, Ting H-F, Lam T-W (2017) MegaGTA: a sensitive and accurate metagenomic gene-targeted assembler using iterative de Bruijn graphs. BMC Bioinformatics 18:408. https://doi.org/10.1186/s12859-017-1825-3
doi: 10.1186/s12859-017-1825-3
pubmed: 29072142
pmcid: 5657035
Li D, Liu C-M, Luo R, Sadakane K, Lam T-W (2015) MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31:1674–1676. https://doi.org/10.1093/bioinformatics/btv033
doi: 10.1093/bioinformatics/btv033
Li Z, Chen Y, Mu D, Yuan J, Shi Y (2016) Comparison of the two major classes of assembly algorithms: overlap-layout consensus and de-bruijn-graph. Brief Funct Genomics 11:25–37. https://doi.org/10.1093/bfgp/elr035
doi: 10.1093/bfgp/elr035
Lin H-H, Liao Y-C (2016) Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes. Sci Rep 6:24175. https://doi.org/10.1038/srep24175
doi: 10.1038/srep24175
pubmed: 27067514
pmcid: 4828714
Lin Y-Y, Hsieh C-H, Chen J-H, Lu X, Kao J-H, Chen P-J, Chen D-S, Wang H-Y (2017) De novo assembly of highly polymorphic metagenomic data using in situ generated reference sequences and a novel BLAST-based assembly pipeline. BMC Bioinformatics 18:223. https://doi.org/10.1186/s12859-017-1630-z
doi: 10.1186/s12859-017-1630-z
pubmed: 28446139
pmcid: 5406902
Lindner MS, Kollock M, Zickmann F, Renard BY (2013) Analyzing genome coverage profiles with applications to quality control in metagenomics. Bioinformatics 29:1260–1267. https://doi.org/10.1093/bioinformatics/btt147
doi: 10.1093/bioinformatics/btt147
pubmed: 23589648
Liu B, Gibbons T, Ghodsi M, Treangen T, Pop M (2011) Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences. Genome Biol 12:P11. https://doi.org/10.1186/1465-6906-12-S1-P11
doi: 10.1186/1465-6906-12-S1-P11
pmcid: 3439023
Liu J, Wang H, Yang H, Zhang Y, Wang J, Zhao F, Qi J (2013) Composition-based classification of short metagenomic sequences elucidates the landscapes of taxonomic and functional enrichment of microorganisms. Nucleic Acids Res 41:e3–e3. https://doi.org/10.1093/nar/gks828
doi: 10.1093/nar/gks828
pubmed: 22941634
Liu Y, Ripp F, Koeppel R, Schmidt H, Hellmann SL, Weber M, Krombholz CF, Schmidt B, Hankeln T (2017) AFS: identification and quantification of species composition by metagenomic sequencing. Bioinformatics 33:1396–1398. https://doi.org/10.1093/bioinformatics/btw822
doi: 10.1093/bioinformatics/btw822
pubmed: 28453677
Lo C-C, Chain PS (2014) Rapid evaluation and quality control of next generation sequencing data with FaQCs. BMC Bioinformatics 15:1–8. https://doi.org/10.1186/s12859-014-0366-2
doi: 10.1186/s12859-014-0366-2
Lohse M, Bolger AM, Nagel A, Fernie AR, Lunn JE, Stitt M, Usadel B (2012) R obi NA: A user-friendly, integrated software solution for RNA-Seq-based transcriptomics. Nucleic Acids Res 40:W622–W627. https://doi.org/10.1093/nar/gks540
doi: 10.1093/nar/gks540
pubmed: 22684630
pmcid: 3394330
Loman T (2017) A Novel Method for Predicting Ribosomal RNA Genes in Prokaryotic Genomes. http://lup.lub.lu.se/student-papers/record/8914064
Lu YY, Chen T, Fuhrman JA, Sun F (2017) COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge. Bioinformatics 33:791–798. https://doi.org/10.1093/bioinformatics/btw290
doi: 10.1093/bioinformatics/btw290
pubmed: 27256312
Luo C, Rodriguez-R LM, Konstantinidis KT (2014) MyTaxa: an advanced taxonomic classifier for genomic and metagenomic sequences. Nucleic Acids Res 42:e73–e73. https://doi.org/10.1093/nar/gku169
doi: 10.1093/nar/gku169
pubmed: 24589583
pmcid: 4005636
Lux M, Krüger J, Rinke C, Maus I, Schlüter A, Woyke T, Sczyrba A, Hammer B (2016) acdc – Automated Contamination Detection and Confidence estimation for single-cell genome data. BMC Bioinformatics 17:543. https://doi.org/10.1186/s12859-016-1397-7
doi: 10.1186/s12859-016-1397-7
pubmed: 27998267
pmcid: 5168860
MacDonald NJ, Parks DH, Beiko RG (2012) Rapid identification of high-confidence taxonomic assignments for metagenomic data. Nucleic Acids Res 40:e111–e111. https://doi.org/10.1093/nar/gks335
doi: 10.1093/nar/gks335
pubmed: 22532608
pmcid: 3413139
Maglott D, Ostell J, Pruitt KD, Tatusova T (2011) Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res 39:D52–D57. https://doi.org/10.1093/nar/gkq1237
doi: 10.1093/nar/gkq1237
pubmed: 21115458
Mallet L, Bitard-Feildel T, Cerutti F, Chiapello H (2017) PhylOligo: a package to identify contaminant or untargeted organism sequences in genome assemblies. Bioinformatics 33:3283–3285. https://doi.org/10.1093/bioinformatics/btx396
doi: 10.1093/bioinformatics/btx396
pubmed: 28637232
pmcid: 5860033
Manconi A, Manca E, Moscatelli M, Gnocchi M, Orro A, Armano G, Milanesi L, Biotechnology (2015) G-CNV: a GPU-based tool for preparing data to detect CNVs with read-depth methods. Front Bioeng Biotechnol 3:28. https://doi.org/10.3389/fbioe.2015.00028
doi: 10.3389/fbioe.2015.00028
pubmed: 25806367
pmcid: 4354384
Mapleson D, Garcia Accinelli G, Kettleborough G, Wright J, Clavijo BJ (2017) KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies. Bioinformatics 33:574–576. https://doi.org/10.1093/bioinformatics/btw663
doi: 10.1093/bioinformatics/btw663
pubmed: 27797770
Mariette J, Noirot C, Klopp C (2011) Assessment of replicate bias in 454 pyrosequencing and a multi-purpose read-filtering tool. BMC Res Notes 4:1–4. https://doi.org/10.1186/1756-0500-4-149
doi: 10.1186/1756-0500-4-149
Martin J, Bruno VM, Fang Z, Meng X, Blow M, Zhang T et al (2010) Rnnotator: an automated de novo transcriptome assembly pipeline from stranded RNA-Seq reads. BMC Genomics 11:1–8. https://doi.org/10.1186/1471-2164-11-663
doi: 10.1186/1471-2164-11-663
Markowitz VM, Chen IMA, Palaniappan K, Chu K et al (2012) IMG: the integrated microbial genomes database and comparative analysis system. Nucleic Acids Res 40:D115–D122
doi: 10.1093/nar/gkr1044
Masella AP, Bartram AK, Truszkowski JM, Brown DG, Neufeld JD (2012) PANDAseq: paired-end assembler for illumina sequences. BMC Bioinformatics 13:1–7. https://doi.org/10.1186/1471-2105-13-31
doi: 10.1186/1471-2105-13-31
Matsen FA, Kodner RB, Armbrust EV (2010) pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics 11:538. https://doi.org/10.1186/1471-2105-11-538
doi: 10.1186/1471-2105-11-538
pubmed: 21034504
pmcid: 3098090
May A, Abeln S, Buijs MJ, Heringa J, Crielaard W, Brandt BW (2015) NGS-eval: NGS Error analysis and novel sequence VAriant detection tooL. Nucleic Acids Res 43:W301–W305. https://doi.org/10.1093/nar/gkv346
doi: 10.1093/nar/gkv346
pubmed: 25878034
pmcid: 4489229
McNally CP, Eng A, Noecker C, Gagne-Maynard WC, Borenstein E (2018) BURRITO: An Interactive Multi-Omic Tool for Visualizing Taxa-Function Relationships in Microbiome Data. Front Microbiol 9:365. https://doi.org/10.3389/fmicb.2018.00365
doi: 10.3389/fmicb.2018.00365
pubmed: 29545787
pmcid: 5837987
Meinicke P (2015) UProC: tools for ultra-fast protein domain classification. Bioinformatics 31:1382–1388. https://doi.org/10.1093/bioinformatics/btu843
doi: 10.1093/bioinformatics/btu843
pubmed: 25540185
Meißner T, Fisch KM, Gioia L, Su AI (2015) OncoRep: an n-of-1 reporting tool to support genome-guided treatment for breast cancer patients using RNA-sequencing. BMC Med Genomics 8:1–8. https://doi.org/10.1186/s12920-015-0095-z
doi: 10.1186/s12920-015-0095-z
Mendoza-Parra MA, Saleem M-AM, Blum M, Cholley P-E, Gronemeyer H (2016) NGS-QC generator: a quality control system for ChIP-Seq and related deep sequencing-generated datasetsStatistical Genomics. Springer 243–265
Menzel P, Ng KL, Krogh A (2016) Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat Commun 7:11257. https://doi.org/10.1038/ncomms11257
doi: 10.1038/ncomms11257
pubmed: 27071849
pmcid: 4833860
Merriman B, Rothberg JM (2012) Progress in ion torrent semiconductor chip based sequencing. Electrophoresis 33:3397–3417. https://doi.org/10.1002/elps.201200424
doi: 10.1002/elps.201200424
pubmed: 23208921
Metwally AA, Dai Y, Finn PW, Perkins DL (2016) WEVOTE: Weighted Voting Taxonomic Identification Method of Microbial Sequences. PLoS ONE 11:e0163527. https://doi.org/10.1371/journal.pone.0163527
doi: 10.1371/journal.pone.0163527
pubmed: 27683082
pmcid: 5040256
Meyer F, Hofmann P, Belmann P, Garrido-Oter R, Fritz A, Sczyrba A, McHardy AC (2018) AMBER: Assessment of Metagenome BinnERs. GigaScience 7. https://doi.org/10.1093/gigascience/giy069
Mikheenko A, Saveliev V, Gurevich A (2016) MetaQUAST: evaluation of metagenome assemblies. Bioinformatics 32:1088–1090. https://doi.org/10.1093/bioinformatics/btv697
doi: 10.1093/bioinformatics/btv697
pubmed: 26614127
Miller CS, Baker BJ, Thomas BC, Singer SW, Banfield JF (2011) EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data. Genome Biol 12:R44. https://doi.org/10.1186/gb-2011-12-5-r44
doi: 10.1186/gb-2011-12-5-r44
pubmed: 21595876
pmcid: 3219967
Minot SS, Krumm N, Greenfield NB (2015) One codex: a sensitive and accurate data platform for genomic microbial identification. BioRxiv 027607. https://doi.org/10.1101/027607
Mitchell A, Chang H-Y, Daugherty L, Fraser M, Hunter S et al (2015) The InterPro protein families database: the classification resource after 15 years. Nucleic Acids Res 43:D213–D221. https://doi.org/10.1093/nar/gku1243
doi: 10.1093/nar/gku1243
pubmed: 25428371
Morgat A, Coissac E, Coudert E, Axelsen KB, Keller G, Bairoch A, Bridge A, Bougueleret L, Xenarios I, Viari A (2012) UniPathway: a resource for the exploration and annotation of metabolic pathways. Nucleic Acids Res 40:D761–D769. https://doi.org/10.1093/nar/gkr1023
doi: 10.1093/nar/gkr1023
pubmed: 22102589
Moss EL, Bishara A, Tkachenko E, Kang JB, Andermann TM, Wood C, Handy C, Ji H, Batzoglou S, Bhatt AS (2017) De novo assembly of microbial genomes from human gut metagenomes using barcoded short read sequences. bioRxiv 125211. https://doi.org/10.1101/125211
Nakano Y, Takeshita T, Yasui M, Yamashita Y (2010) Prediction of plausible bacterial composition based on terminal restriction fragment length polymorphisms using a Monte Carlo method. Microb Ecol 60:364–372. https://doi.org/10.1007/s00248-010-9703-9
doi: 10.1007/s00248-010-9703-9
pubmed: 20574825
Namiki T, Hachiya T, Tanaka H, Sakakibara Y (2012) MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads. Nucleic Acids Res 40:e155–e155. https://doi.org/10.1093/nar/gks678
doi: 10.1093/nar/gks678
pubmed: 22821567
pmcid: 3488206
Navarro JF, Sjöstrand J, Salmén F, Lundeberg J, Ståhl PL (2017) ST Pipeline: an automated pipeline for spatial mapping of unique transcripts. Bioinformatics 33:2591–2593. https://doi.org/10.1093/bioinformatics/btx211
doi: 10.1093/bioinformatics/btx211
pubmed: 28398467
Nayfach S, Rodriguez-Mueller B, Garud N, Pollard KS (2016) An integrated metagenomics pipeline for strain profiling reveals novel patterns of bacterial transmission and biogeography. Genome Res 26:1612–1625. https://doi.org/10.1101/gr.201863.115
doi: 10.1101/gr.201863.115
pubmed: 27803195
pmcid: 5088602
Nazir A (2016) Review on metagenomics and its applications. J Imp J Intersd Res 2:10
Ng C, Li H, Wu WKK, Wong SH, Yu J (2019) Genomics and metagenomics of colorectal cancer. J Gastrointest Oncol 10:1164–1170. https://doi.org/10.21037/jgo.2019.06.04
doi: 10.21037/jgo.2019.06.04
pubmed: 31949936
pmcid: 6955013
Nipperess DA, Matsen FA IV (2013) The mean and variance of phylogenetic diversity under rarefaction. Methods Ecol Evol 4:566–572. https://doi.org/10.1111/2041-210X.12042
doi: 10.1111/2041-210X.12042
pubmed: 23833701
pmcid: 3699894
O’Halloran DM (2017) fastQ_brew: module for analysis, preprocessing, and reformatting of FASTQ sequence data. BMC Res Notes 10:1–4. https://doi.org/10.1186/s13104-017-2616-7
doi: 10.1186/s13104-017-2616-7
Ogasawara O, Kodama Y, Mashima J, Kosuge T, Fujisawa T (2020) DDBJ Database updates and computational infrastructure enhancement. Nucleic Acids Res 48:D45–D50. https://doi.org/10.1093/nar/gkz982
doi: 10.1093/nar/gkz982
pubmed: 31724722
Oh J, Kim BK, Cho W-S, Hong SG, Kim KM (2012) PyroTrimmer: a software with GUI for pre-processing 454 amplicon sequences. J Microbiol 50:766–769. https://doi.org/10.1007/s12275-012-2494-6
doi: 10.1007/s12275-012-2494-6
pubmed: 23124743
Okuda S, Tsuchiya Y, Kiriyama C, Itoh M, Morisaki H (2012) Virtual metagenome reconstruction from 16S rRNA gene sequences. Nat Commun 3:1203. https://doi.org/10.1038/ncomms2203
doi: 10.1038/ncomms2203
pubmed: 23149747
Ondov BD, Bergman NH, Phillippy AM (2011) Interactive metagenomic visualization in a Web browser. BMC Bioinformatics 12:385. https://doi.org/10.1186/1471-2105-12-385
doi: 10.1186/1471-2105-12-385
pubmed: 21961884
pmcid: 3190407
Orakov AN, Sakenova NK, Sorokin A, Goryanin II (2018) ASAR: visual analysis of metagenomes in R. Bioinformatics 34:1404–1405. https://doi.org/10.1093/bioinformatics/btx775
doi: 10.1093/bioinformatics/btx775
pubmed: 29211828
Orellana LH, Rodriguez RL, Konstantinidis KT (2017) ROCker: accurate detection and quantification of target genes in short-read metagenomic data sets by modeling sliding-window bitscores. Nucleic Acids Res 45:e14. https://doi.org/10.1093/nar/gkw900
doi: 10.1093/nar/gkw900
pubmed: 28180325
Pandey RV, Pabinger S, Kriegner A, Weinhäusel A (2016) ClinQC: a tool for quality control and cleaning of Sanger and NGS data in clinical research. BMC Bioinformatics 17:1–9. https://doi.org/10.1186/s12859-016-0915-y
doi: 10.1186/s12859-016-0915-y
Parida S, Sharma D (2019) The power of small changes: Comprehensive analyses of microbial dysbiosis in breast cancer. J Biochim Biophys Acta Rev Cancer 1871:392–405. https://doi.org/10.1016/j.bbcan.2019.04.001
doi: 10.1016/j.bbcan.2019.04.001
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW (2015) CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. https://doi.org/10.1101/gr.186072.114
doi: 10.1101/gr.186072.114
pubmed: 25977477
pmcid: 4484387
Patel RK, Jain M (2012) NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PLoS ONE 7:e30619. https://doi.org/10.1371/journal.pone.0030619
doi: 10.1371/journal.pone.0030619
pubmed: 22312429
pmcid: 3270013
Pati A, Heath LS, Kyrpides NC, Ivanova N (2011) ClaMS: A Classifier for Metagenomic Sequences. Stand Genom Sci 5:248–253. https://doi.org/10.4056/sigs.2075298
doi: 10.4056/sigs.2075298
Patil KR, Roune L, McHardy AC (2012) The PhyloPythiaS Web Server for Taxonomic Assignment of Metagenome Sequences. PLoS ONE 7:e38581. https://doi.org/10.1371/journal.pone.0038581
doi: 10.1371/journal.pone.0038581
pubmed: 22745671
pmcid: 3380018
Pehrsson EC, Tsukayama P, Patel S, Mejía-Bautista M, Sosa-Soto G, Navarrete KM, Calderon M, Cabrera L, Hoyos-Arango W, Bertoli MT, Berg DE, Gilman RH, Dantas G (2016) Interconnected microbiomes and resistomes in low-income human habitats. Nature 533:212–216. https://doi.org/10.1038/nature17672
doi: 10.1038/nature17672
pubmed: 27172044
pmcid: 4869995
Peng Y, Leung HCM, Yiu SM, Chin FYL (2011) Meta-IDBA: a de Novo assembler for metagenomic data. Bioinformatics 27:i94–i101. https://doi.org/10.1093/bioinformatics/btr216
doi: 10.1093/bioinformatics/btr216
pubmed: 21685107
pmcid: 3117360
Peng Y, Leung HCM, Yiu SM, Chin FYL (2012) IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28:1420–1428. https://doi.org/10.1093/bioinformatics/bts174
doi: 10.1093/bioinformatics/bts174
Peng Y, Maxwell AS, Barker ND, Laird JG, Kennedy AJ, Wang N, Zhang C, Gong P (2014) SeqAssist: a novel toolkit for preliminary analysis of next-generation sequencing dataBMC Bioinformatics. Springer 1–11
Perez-Riverol Y, Csordas A, Bai J, Bernal-Llinares M, Hewapathirana S et al (2019) The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res 47:D442–D450. https://doi.org/10.1093/nar/gky1106
doi: 10.1093/nar/gky1106
pubmed: 30395289
pmcid: 30395289
Pericard P, Dufresne Y, Couderc L, Blanquart S, Touzet H (2018) MATAM: reconstruction of phylogenetic marker genes from short sequencing reads in metagenomes. Bioinformatics 34:585–591. https://doi.org/10.1093/bioinformatics/btx644
doi: 10.1093/bioinformatics/btx644
pubmed: 29040406
Peterlongo P, Chikhi R (2012) Mapsembler, targeted and micro assembly of large NGS datasets on a desktop computer. BMC Bioinformatics 13:48. https://doi.org/10.1186/1471-2105-13-48
doi: 10.1186/1471-2105-13-48
pubmed: 22443449
pmcid: 3514201
Petersen TN, Lukjancenko O, Thomsen MCF, Maddalena Sperotto M, Lund O, Møller Aarestrup F, Sicheritz-Pontén T (2017) MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads. PLoS ONE 12:e0176469. https://doi.org/10.1371/journal.pone.0176469
doi: 10.1371/journal.pone.0176469
pubmed: 28467460
pmcid: 5415185
Piro VC, Lindner MS, Renard BY (2016) DUDes: a top-down taxonomic profiler for metagenomics. Bioinformatics 32:2272–2280. https://doi.org/10.1093/bioinformatics/btw150
doi: 10.1093/bioinformatics/btw150
pubmed: 27153591
Porter MS, Beiko RG (2013) SPANNER: taxonomic assignment of sequences using pyramid matching of similarity profiles. Bioinformatics 29:1858–1864. https://doi.org/10.1093/bioinformatics/btt313
doi: 10.1093/bioinformatics/btt313
pubmed: 23732273
pmcid: 3712219
Potter SC, Luciani A, Eddy SR, Park Y, Lopez R, Finn RD (2018) HMMER web server: 2018 update. Nucleic Acids Res 46:W200–W204. https://doi.org/10.1093/nar/gky448
doi: 10.1093/nar/gky448
pubmed: 29905871
pmcid: 6030962
Pujar S, O’Leary NA, Farrell CM, Loveland JE et al (2018) Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation. Nucleic Acids Res 46:D221–D228. https://doi.org/10.1093/nar/gkx1031
doi: 10.1093/nar/gkx1031
pubmed: 29126148
Qiu Y, Tian X, Zhang S (2015) Infer Metagenomic Abundance and Reveal Homologous Genomes Based on the Structure of Taxonomy Tree. IEEE/ACM Trans Comput Biol Bioinform 12:1112–1122. https://doi.org/10.1109/TCBB.2015.2415814
doi: 10.1109/TCBB.2015.2415814
pubmed: 26451823
Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T et al (2013) The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res 41:D590–D596. https://doi.org/10.1093/nar/gks1219
doi: 10.1093/nar/gks1219
pubmed: 23193283
Ramanan R, Kim B-H, Cho D-H, Oh H-M, Kim H-S (2016) Algae–bacteria interactions: Evolution, ecology and emerging applications. Biotechnol Adv 34:14–29. https://doi.org/10.1016/j.biotechadv.2015.12.003
doi: 10.1016/j.biotechadv.2015.12.003
pubmed: 26657897
Ramirez-Gonzales RH, Leggett RM, Waite D et al (2013) StatsDB: platform-agnostic storage and understanding of next generation sequencing run metrics. F1000Research 2:248. https://doi.org/10.12688/f1000research.2-248.v2
doi: 10.12688/f1000research.2-248.v2
Ramos RT, Carneiro AR, Baumbach J, Azevedo V, Schneider MP, Silva A (2011) Analysis of quality raw data of second generation sequencers with Quality Assessment Software. BMC Res Notes 4:1–6. https://doi.org/10.1186/1756-0500-4-130
doi: 10.1186/1756-0500-4-130
Rappoport N, Linial N, Linial M (2013) ProtoNet: charting the expanding universe of protein sequences. Nat Biotechnol 31:290–292. https://doi.org/10.1038/nbt.2553
doi: 10.1038/nbt.2553
pubmed: 23563419
Rasheed Z, Rangwala H (2012) Metagenomic taxonomic classification using extreme learning machines. J Bioinform Comput Biol 10:1250015. https://doi.org/10.1142/S0219720012500151
doi: 10.1142/S0219720012500151
pubmed: 22849369
Rho M, Tang H, Ye Y (2010) FragGeneScan: predicting genes in short and error-prone reads. Nucleic Acids Res 38:e191–e191. https://doi.org/10.1093/nar/gkq747
doi: 10.1093/nar/gkq747
pubmed: 20805240
pmcid: 2978382
Rhoads A, Au KF (2015) PacBio Sequencing and Its Application. Genom Proteom Bioinf 13:278–289. https://doi.org/10.1016/j.gpb.2015.08.002
doi: 10.1016/j.gpb.2015.08.002
Rineh A, Kelso MJ, Vatansever F, Tegos GP, Hamblin MR (2014) Clostridium difficile infection: molecular pathogenesis and novel therapeutics. Expert Rev Anti Infect Ther 12:131–150. https://doi.org/10.1586/14787210.2014.866515
doi: 10.1586/14787210.2014.866515
pubmed: 24410618
pmcid: 4306399
Robertson CE, Harris JK, Wagner BD, Granger D, Browne K, Tatem B, Feazel LM, Park K, Pace NR, Frank DN (2013) Explicet: graphical user interface software for metadata-driven management, analysis and visualization of microbiome data. Bioinformatics 29:3100–3101. https://doi.org/10.1093/bioinformatics/btt526
doi: 10.1093/bioinformatics/btt526
pubmed: 24021386
pmcid: 3834795
Rodrigue S, Materna AC, Timberlake SC, Blackburn MC, Malmstrom RR, Alm EJ, Chisholm SW (2010) Unlocking Short Read Sequencing for Metagenomics. PLoS ONE 5:e11840. https://doi.org/10.1371/journal.pone.0011840
doi: 10.1371/journal.pone.0011840
pubmed: 20676378
pmcid: 2911387
Rodriguez-Martinez A, Ayala R, Posma JM, Harvey N et al (2019) pJRES Binning Algorithm (JBA): a new method to facilitate the recovery of metabolic information from pJRES 1H NMR spectra. Bioinformatics 35:1916–1922. https://doi.org/10.1093/bioinformatics/bty837
doi: 10.1093/bioinformatics/bty837
pubmed: 30351417
Rodriguez-Martinez A, Ayala R, Posma JM et al (2017) MetaboSignal: a network-based approach for topological analysis of metabotype regulation via metabolic and signaling pathways. Bioinformatics 33:773–775. https://doi.org/10.1093/bioinformatics/btw697
doi: 10.1093/bioinformatics/btw697
pubmed: 28011775
Rodriguez-Martinez A, Posma JM, Ayala R et al (2018) MWASTools: an R/bioconductor package for metabolome-wide association studies. Bioinformatics 34:890–892. https://doi.org/10.1093/bioinformatics/btx477
doi: 10.1093/bioinformatics/btx477
pubmed: 28961702
Rodriguez-r LM, Konstantinidis KT (2014) Estimating coverage in metagenomic data sets and why it matters. ISME J 8:2349–2351. https://doi.org/10.1038/ismej.2014.76
doi: 10.1038/ismej.2014.76
pubmed: 24824669
pmcid: 4992084
Rosenbloom KR, Armstrong J, Barber GP, Casper J, Clawson H et al (2015) The UCSC Genome Browser database: 2015 update. Nucleic Acids Res 43:D670–D681. https://doi.org/10.1093/nar/gku1177
doi: 10.1093/nar/gku1177
pubmed: 25428374
Rosen GL, Reichenberger ER, Rosenfeld AM (2011) NBC: the Naïve Bayes Classification tool webserver for taxonomic classification of metagenomic reads. Bioinformatics 27:127–129. https://doi.org/10.1093/bioinformatics/btq619
doi: 10.1093/bioinformatics/btq619
pubmed: 21062764
Rozov R, Goldshlager G, Halperin E, Shamir R (2018) Faucet: streaming de novo assembly graph construction. Bioinformatics 34:147–154. https://doi.org/10.1093/bioinformatics/btx471
doi: 10.1093/bioinformatics/btx471
pubmed: 29036597
Ruan J, Li H (2020) Fast and accurate long-read assembly with wtdbg2. Nat Methods 17:155–158. https://doi.org/10.1038/s41592-019-0669-3
doi: 10.1038/s41592-019-0669-3
pubmed: 31819265
Ruby JG, Bellare P, DeRisi JL (2013) PRICE: Software for the Targeted Assembly of Components of (Meta) Genomic Sequence Data. G3 Genes Genomes Genet 3:865–880. https://doi.org/10.1534/g3.113.005967
doi: 10.1534/g3.113.005967
Samaras P, Schmidt T, Frejno M, Gessulat S et al (2020) ProteomicsDB: a multi-omics and multi-organism resource for life science research. Nucleic Acids Res 48:D1153–D1163. https://doi.org/10.1093/nar/gkz974
doi: 10.1093/nar/gkz974
pubmed: 31665479
Sato K, Sakakibara Y (2013) An extended genovo metagenomic assembler by incorporating paired-end information. PeerJ 1:e196. https://doi.org/10.7717/peerj.196
doi: 10.7717/peerj.196
pubmed: 24281688
pmcid: 3817583
Sato Y, Kojima K, Nariai N, Yamaguchi-Kabata Y et al (2014) SUGAR: graphical user interface-based data refiner for high-throughput DNA sequencing. BMC Genomics 15:1–5. https://doi.org/10.1186/1471-2164-15-664
doi: 10.1186/1471-2164-15-664
Schaab C, Geiger T, Stoehr G, Cox J, Mann M (2012) Analysis of High Accuracy, Quantitative Proteomics Data in the MaxQB Database*. Mol Cell Proteomics 11(M111):014068. https://doi.org/10.1074/mcp.M111.014068
doi: 10.1074/mcp.M111.014068
pubmed: 22301388
Scheuch M, Höper D, Beer M (2015) RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets. BMC Bioinformatics 16:69. https://doi.org/10.1186/s12859-015-0503-6
doi: 10.1186/s12859-015-0503-6
pubmed: 25886935
pmcid: 4351923
Schmieder R, Edwards R (2011a) Fast identification and removal of sequence contamination from genomic and metagenomic datasets. PLoS ONE 6:e17288. https://doi.org/10.1371/journal.pone.0017288
doi: 10.1371/journal.pone.0017288
pubmed: 21408061
pmcid: 3052304
Schmieder R, Edwards R (2011b) Quality control and preprocessing of metagenomic datasets. Bioinformatics 27:863–864. https://doi.org/10.1093/bioinformatics/btr026
doi: 10.1093/bioinformatics/btr026
pubmed: 21278185
pmcid: 3051327
Schreiber F, Gumrich P, Daniel R, Meinicke P (2010) Treephyler: fast taxonomic profiling of metagenomes. Bioinformatics 26:960–961. https://doi.org/10.1093/bioinformatics/btq070
doi: 10.1093/bioinformatics/btq070
pubmed: 20172941
Schröder J, Corbin V, Papenfuss AT (2016) HYSYS: have you swapped your samples? Bioinformatics 33:596–598. https://doi.org/10.1093/bioinformatics/btw685
doi: 10.1093/bioinformatics/btw685
pmcid: 5408803
Schroeder CM, Hilke FJ, Löffler MW, Bitzer M, Lenz F, Sturm M (2017) A comprehensive quality control workflow for paired tumor-normal NGS experiments. Bioinformatics 33:1721–1722. https://doi.org/10.1093/bioinformatics/btx032
doi: 10.1093/bioinformatics/btx032
pubmed: 28130233
Segata N, Waldron L, Ballarini A, Narasimhan V, Jousson O, Huttenhower C (2012) Metagenomic microbial community profiling using unique clade-specific marker genes. Nat Methods 9:811–814. https://doi.org/10.1038/nmeth.2066
doi: 10.1038/nmeth.2066
pubmed: 22688413
pmcid: 22688413
Sharma AK, Gupta A, Kumar S, Dhakan DB, Sharma VK (2015) Woods: A fast and accurate functional annotator and classifier of genomic and metagenomic sequences. Genomics 106:1–6. https://doi.org/10.1016/j.ygeno.2015.04.001
doi: 10.1016/j.ygeno.2015.04.001
pubmed: 25863333
Shafin K, Pesout T, Lorig-Roach R et al (2020) Nanopore sequencing and the Shasta toolkit enable efficient de novo assembly of eleven human genomes. Nat Biotechnol 38:1044–1053. https://doi.org/10.1038/s41587-020-0503-6
doi: 10.1038/s41587-020-0503-6
pubmed: 32686750
pmcid: 7483855
Sigrist CJA, Cerutti L, de Castro E et al (2010) PROSITE, a protein domain database for functional characterization and annotation. Nucleic Acids Res 38:D161–D166. https://doi.org/10.1093/nar/gkp885
doi: 10.1093/nar/gkp885
pubmed: 19858104
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM (2015) BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212. https://doi.org/10.1093/bioinformatics/btv351
doi: 10.1093/bioinformatics/btv351
pubmed: 26059717
Simon M, Dittami EC (2017) Detection of bacterial contaminants and hybrid sequences in the genome of the kelp Saccharina japonica using Taxoblast. PeerJ 5:e4073. https://doi.org/10.7717/peerj.4073
doi: 10.7717/peerj.4073
Simpson JT, Durbin R (2012) Efficient de novo assembly of large genomes using compressed data structures. Genome Res 22:549–556. https://doi.org/10.1101/gr.126953.111
doi: 10.1101/gr.126953.111
pubmed: 22156294
pmcid: 3290790
Singer J, Ruscheweyh H-J, Hofmann AL, Thurnherr T et al (2018) NGS-pipe: a flexible, easily extendable and highly configurable framework for NGS analysis. Bioinformatics 34:107–108. https://doi.org/10.1093/bioinformatics/btx540
doi: 10.1093/bioinformatics/btx540
pubmed: 28968639
Singh B, Crippen TL, Zheng L, Fields AT, Yu Z et al (2015) A metagenomic assessment of the bacteria associated with Lucilia sericata and Lucilia cuprina (Diptera: Calliphoridae). Appl Microbiol Biotechnol 99:869–883. https://doi.org/10.1007/s00253-014-6115-7
doi: 10.1007/s00253-014-6115-7
pubmed: 25306907
Schloss PD, Westcott SL, Ryabin T, Hall JR et al (2020) Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities. Appl Environ Microbiol 75:7537–7541. https://doi.org/10.1128/AEM.01541-09
doi: 10.1128/AEM.01541-09
Smith BC, McAndrew T, Chen Z, Harari A, Barris DM et al (2012) The cervical microbiome over 7 years and a comparison of methodologies for its characterization. PLoS ONE 7:e40425. https://doi.org/10.1371/journal.pone.0040425
doi: 10.1371/journal.pone.0040425
pubmed: 22792313
pmcid: 3392218
Sohn MB, An L, Pookhao N, Li Q (2014) Accurate genome relative abundance estimation for closely related species in a metagenomic sample. BMC Bioinformatics 15:242. https://doi.org/10.1186/1471-2105-15-242
doi: 10.1186/1471-2105-15-242
pubmed: 25027647
pmcid: 4131027
Somervuo P, Koskela S, Pennanen J, Henrik Nilsson R, Ovaskainen O (2016) Unbiased probabilistic taxonomic classification for DNA barcoding. Bioinformatics 32:2920–2927. https://doi.org/10.1093/bioinformatics/btw346
doi: 10.1093/bioinformatics/btw346
pubmed: 27296980
Stark M, Berger SA, Stamatakis A, von Mering C (2010) MLTreeMap - accurate Maximum Likelihood placement of environmental DNA sequences into taxonomic and functional reference phylogenies. BMC Genomics 11:461. https://doi.org/10.1186/1471-2164-11-461
doi: 10.1186/1471-2164-11-461
pubmed: 20687950
pmcid: 3091657
Starostina E, Tamazian G, Dobrynin P, O’Brien S, Komissarov A (2015) Cookiecutter: a tool for kmer-based read filtering and extraction. bioRxiv 024679. https://doi.org/10.1101/024679
Stewart RD, Auffret MD, Snelling TJ, Roehe R, Watson M (2019) MAGpy: a reproducible pipeline for the downstream analysis of metagenome-assembled genomes (MAGs). Bioinformatics 35:2150–2152. https://doi.org/10.1093/bioinformatics/bty905
doi: 10.1093/bioinformatics/bty905
pubmed: 30418481
Strous M, Kraft B, Bisdorf R, Tegetmeyer H (2012) The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures. Front Microbiol 3:410. https://doi.org/10.3389/fmicb.2012.00410
doi: 10.3389/fmicb.2012.00410
pubmed: 23227024
pmcid: 3514610
Sunagawa S, Mende DR, Zeller G, Izquierdo-Carrasco F et al (2013) Metagenomic species profiling using universal phylogenetic marker genes. Nat Methods 10:1196–1199. https://doi.org/10.1038/nmeth.2693
doi: 10.1038/nmeth.2693
pubmed: 24141494
Tanaseichuk O, Borneman J, Jiang T (2012) Separating metagenomic short reads into genomes via clustering. Algorithms Mol Biol 7:27. https://doi.org/10.1186/1748-7188-7-27
doi: 10.1186/1748-7188-7-27
pubmed: 23009059
pmcid: 3537596
Tang S, Antonov I, Borodovsky M (2013) MetaGeneTack: ab initio detection of frameshifts in metagenomic sequences. Bioinformatics 29:114–116. https://doi.org/10.1093/bioinformatics/bts636
doi: 10.1093/bioinformatics/bts636
pubmed: 23129300
The UniProt C (2019) UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res 47:D506–D515. https://doi.org/10.1093/nar/gky1049
doi: 10.1093/nar/gky1049
Thompson JF, Oliver JS (2012) Mapping and sequencing DNA using nanopores and nanodetectors. Electrophoresis 33:3429–3436. https://doi.org/10.1002/elps.201200136
doi: 10.1002/elps.201200136
pubmed: 23208922
Thompson JF, Steinmann KE (2010) Single molecule sequencing with a HeliScope Genetic Analysis System. Curr Protoc Mol Biol 92:7.10.1-7.10.14. https://doi.org/10.1002/0471142727.mb0710s92
doi: 10.1002/0471142727.mb0710s92
Tiwari R, Nain L, Labrou NE, Shukla P (2018) Bioprospecting of functional cellulases from metagenome for second generation biofuel production: a review. Crit Rev Microbiol 44:244–257. https://doi.org/10.1080/1040841X.2017.1337713
doi: 10.1080/1040841X.2017.1337713
pubmed: 28609211
Torkamaneh D, Laroche J, Bastien M, Abed A, Belzile F (2017) Fast-GBS: a new pipeline for the efficient and highly accurate calling of SNPs from genotyping-by-sequencing data. BMC Bioinformatics 18:1–7. https://doi.org/10.1186/s12859-016-1431-9
doi: 10.1186/s12859-016-1431-9
Treangen TJ, Koren S, Sommer DD, Liu B et al (2013) MetAMOS: a modular and open source metagenomic assembly and analysis pipeline. Genome Biol 14:R2. https://doi.org/10.1186/gb-2013-14-1-r2
doi: 10.1186/gb-2013-14-1-r2
pubmed: 23320958
pmcid: 4053804
Uchiyama T, Irie M, Mori H, Kurokawa K, Yamada T (2015) FuncTree: Functional Analysis and Visualization for Large-Scale Omics Data. PLoS ONE 10:e0126967. https://doi.org/10.1371/journal.pone.0126967
doi: 10.1371/journal.pone.0126967
pubmed: 25974630
pmcid: 4431737
Uchiyama T, Mihara M, Nishide H, Chiba H (2013) MBGD update 2013: the microbial genome database for exploring the diversity of microbial world. Nucleic Acids Res 41:D631–D635. https://doi.org/10.1093/nar/gks1006
doi: 10.1093/nar/gks1006
pubmed: 23118485
Ulyantsev VI, Kazakov SV, Dubinkina VB, Tyakht AV, Alexeev DG (2016) MetaFast: fast reference-free graph-based comparison of shotgun metagenomic data. Bioinformatics 32:2760–2767. https://doi.org/10.1093/bioinformatics/btw312
doi: 10.1093/bioinformatics/btw312
pubmed: 27259541
Uritskiy GV, DiRuggiero J, Taylor J (2018) MetaWRAP - a flexible pipeline for genome-resolved metagenomic data analysis. bioRxiv 277442. https://doi.org/10.1101/277442
Valencia CA, Pervaiz MA, Husami A, Qian Y, Zhang K (2013) Sanger Sequencing Principles, History, and Landmarks. In: Next Generation Sequencing Technologies in Medical Genetics. SpringerBriefs in Genetics. New York: Springer. https://doi.org/10.1007/978-1-4614-9032-6_1
Vaziri ND, Wong J, Pahl M, Piceno YM et al (2013) Chronic kidney disease alters intestinal microbial flora. Kidney Int 83(2):308–315. https://doi.org/10.1038/ki.2012.345
doi: 10.1038/ki.2012.345
pubmed: 22992469
Wagner J, Chelaru F, Kancherla J, Paulson JN et al (2018) Metaviz: interactive statistical and visual analysis of metagenomic data. Nucleic Acids Res 46:2777–2787. https://doi.org/10.1093/nar/gky136
doi: 10.1093/nar/gky136
pubmed: 29529268
pmcid: 5887897
Wajid B, Serpedin E (2011) Minimum description length based selection of reference sequences for comparative assemblers2011 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS) 230–233
Wajid B, Serpedin E (2012) Review of general algorithmic features for genome assemblers for next generation sequencers. GPB 10:58–73. https://doi.org/10.1016/j.gpb.2012.05.006
doi: 10.1016/j.gpb.2012.05.006
pubmed: 22768980
pmcid: 5054208
Wajid B, Serpedin E (2016) Do it yourself guide to genome assembly. Brief Funct Genom 15:1–9. https://doi.org/10.1093/bfgp/elu042
doi: 10.1093/bfgp/elu042
Wajid B, Serpedin E, Nounou M, Nounou H (2012a) MiB: a comparative assembly processing pipelineProceedings 2012 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS). IEEE 86-89. https://doi.org/10.1109/GENSIPS.2012.6507733
Wajid B, Serpedin E, Nounou M, Nounou H (2012b) Optimal reference sequence selection for genome assembly using minimum description length principle. EURASIP J Bioinform Syst Biol 2012:1–11. https://doi.org/10.1186/1687-4153-2012-18
doi: 10.1186/1687-4153-2012-18
Wajid B, Ekti AR, Noor A, Serpedin E, Ayyaz MN, Nounou H, Nounou M (2013) Supersonic mib2013 IEEE International Workshop on Genomic Signal Processing and Statistics. IEEE, 86–87. https://doi.org/10.1109/GENSIPS.2013.6735941
Wajid B, Serpedin E, Nounou M, Nounou H (2015) MARAGAP: a modular approach to reference assisted genome assembly pipeline. IJCBDD 8:226–250. https://doi.org/10.1504/IJCBDD.2015.072073
doi: 10.1504/IJCBDD.2015.072073
Wajid B, Sohail MU, Ekti AR, Serpedin E (2016) The A, C, G, and T of genome assembly. Biomed Res Int 2016. https://doi.org/10.1155/2016/6329217
Waldherr S (2014) A guideline to model reduction by stoichiometric decomposition for biochemical network analysis. Proc. of the 21st International Symposium on Mathematical Theory of Networks and Systems 490–495
Wang Y, Mehta G, Mayani R, Lu J et al (2011) RseqFlow: workflows for RNA-Seq data analysis. Bioinformatics 27:2598–2600. https://doi.org/10.1093/bioinformatics/btr441
doi: 10.1093/bioinformatics/btr441
pubmed: 21795323
pmcid: 3167055
Wang K, Singh D, Zeng Z, Coleman SJ et al (2010) MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res 38:e178–e178. https://doi.org/10.1093/nar/gkq622
doi: 10.1093/nar/gkq622
pubmed: 20802226
pmcid: 2952873
Wang Y, Leung HCM, Yiu SM, Chin FYL (2014) MetaCluster-TA: taxonomic annotation for metagenomic data based on assembly-assisted binning. BMC Genomics. BioMed Central 1–9
Wang Q, Fish JA, Gilman M, Sun Y et al (2015a) Xander: employing a novel method for efficient gene-targeted metagenomic assembly. Microbiome 3:32. https://doi.org/10.1186/s40168-015-0093-6
doi: 10.1186/s40168-015-0093-6
pubmed: 26246894
pmcid: 4526283
Wang Y, Hu H, Li X (2015b) MBBC: an efficient approach for metagenomic binning based on clustering. BMC Bioinformatics 16:36. https://doi.org/10.1186/s12859-015-0473-8
doi: 10.1186/s12859-015-0473-8
pubmed: 25652152
pmcid: 4339733
Wang C, Dong D, Wang H et al (2016) Metagenomic analysis of microbial consortia enriched from compost: new insights into the role of Actinobacteria in lignocellulose decomposition. Biotechnol Biofuels 9:22. https://doi.org/10.1186/s13068-016-0440-2
Wang Y, Wang K, Lu YY, Sun F (2017) Improving contig binning of metagenomic data using [Formula: see text] oligonucleotide frequency dissimilarity. BMC Bioinformatics 18:425. https://doi.org/10.1186/s12859-017-1835-1
doi: 10.1186/s12859-017-1835-1
pubmed: 28931373
pmcid: 5607646
Wanichthanarak K, Fan S, Grapov D, Barupal DK, Fiehn O (2017) Metabox: A Toolbox for Metabolomic Data Analysis, Interpretation and Integrative Exploration. PLoS ONE 12:e0171046. https://doi.org/10.1371/journal.pone.0171046
doi: 10.1371/journal.pone.0171046
pubmed: 28141874
pmcid: 5283729
Ward J, Cole C, Febrer M, Barton GJ (2016) AlmostSignificant: simplifying quality control of high-throughput sequencing data. Bioinformatics 32:3850–3851. https://doi.org/10.1093/bioinformatics/btw559
doi: 10.1093/bioinformatics/btw559
pubmed: 27559158
pmcid: 5167069
Watson SJ, Welkers MR, Depledge DP et al (2013) Viral population analysis and minority-variant detection using short read next-generation sequencing. PHILOS T R SOC B 368:20120205. https://doi.org/10.1098/rstb.2012.0205
doi: 10.1098/rstb.2012.0205
Weber M, Teeling H, Huang S, Waldmann J et al (2011) Practical application of self-organizing maps to interrelate biodiversity and functional data in NGS-based metagenomics. ISME J 5:918–928. https://doi.org/10.1038/ismej.2010.180
doi: 10.1038/ismej.2010.180
pubmed: 21160538
Wienkoop S, Staudinger C, Hoehenwarter W, Weckwerth W, Egelhofer V (2012) ProMEX – a mass spectral reference database for plant proteomics. Front Plant Sci 3:125. https://doi.org/10.3389/fpls.2012.00125
doi: 10.3389/fpls.2012.00125
pubmed: 22685450
pmcid: 3368217
Williams W, Trindade M (2017) Metagenomics for the discovery of novel biosurfactants. Functional metagenomics: tools and applications. Springer, pp. 95–117
Wittig U, Kania R, Golebiewski M, Rey M, Shi L et al (2012) SABIO-RK —database for biochemical reaction kinetics. Nucleic Acids Res 40:D790–D796. https://doi.org/10.1093/nar/gkr1046
doi: 10.1093/nar/gkr1046
pubmed: 22102587
Wolfien M, Rimmbach C, Schmitz U, Jung JJ, Krebs S, Steinhoff G, David R, Wolkenhauer O (2016) TRAPLINE: a standardized and automated pipeline for RNA sequencing data analysis, evaluation and annotation. BMC Bioinformatics 17:1–11. https://doi.org/10.1186/s12859-015-0873-9
doi: 10.1186/s12859-015-0873-9
Wood DE, Salzberg SL (2014) Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15:R46. https://doi.org/10.1186/gb-2014-15-3-r46
doi: 10.1186/gb-2014-15-3-r46
pubmed: 24580807
pmcid: 4053813
Wu S, Zhu Z, Fu L, Niu B, Li W (2011) WebMGA: a customizable web server for fast metagenomic sequence analysis. BMC Genomics 12:444. https://doi.org/10.1186/1471-2164-12-444
doi: 10.1186/1471-2164-12-444
pubmed: 21899761
pmcid: 3180703
Wu Y-W, Rho M, Doak TG, Ye Y (2012) Stitching gene fragments with a network matching algorithm improves gene assembly for metagenomics. Bioinformatics 28:i363–i369. https://doi.org/10.1093/bioinformatics/bts388
doi: 10.1093/bioinformatics/bts388
pubmed: 22962453
pmcid: 3436815
Wu Y-W, Simmons BA, Singer SW (2016) MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32:605–607. https://doi.org/10.1093/bioinformatics/btv638
doi: 10.1093/bioinformatics/btv638
pubmed: 26515820
Wu Y-W, Ye Y (2011) A Novel Abundance-Based Algorithm for Binning Metagenomic Sequences Using l-tuples. J Comput Biol 18:523–534. https://doi.org/10.1089/cmb.2010.0245
doi: 10.1089/cmb.2010.0245
pubmed: 21385052
pmcid: 3123841
Xie W, Wang F, Guo L, Chen Z, Sievert SM et al (2011) Comparative metagenomics of microbial communities inhabiting deep-sea hydrothermal vent chimneys with contrasting chemistries. ISME J 5:414–426. https://doi.org/10.1038/ismej.2010.144
doi: 10.1038/ismej.2010.144
pubmed: 20927138
Yang X, Liu D, Liu F, Wu J et al (2013) HTQC: a fast quality control toolkit for Illumina sequencing data. BMC Bioinformatics 14:1–4. https://doi.org/10.1186/1471-2105-14-33
doi: 10.1186/1471-2105-14-33
Yates AD, Achuthan P, Akanni W, Allen J et al (2020) Ensembl 2020. Nucleic Acids Res 48:D682–D688. https://doi.org/10.1093/nar/gkz966
doi: 10.1093/nar/gkz966
pubmed: 31691826
Yoon SH, Ha SM, Kwon S, Lim J, Kim Y, Seo H, Chun J (2017) Introducing EzBioCloud: A taxonomically united database of 16S rRNA and whole genome assemblies. Int J Syst Evol Microbiol 67:1613–1617. https://doi.org/10.1099/ijsem.0.001755
doi: 10.1099/ijsem.0.001755
pubmed: 28005526
pmcid: 5563544
Yourstone SM, Lundberg DS, Dangl JL, Jones CD (2014) MT-Toolbox: improved amplicon sequencing using molecule tags. BMC Bioinformatics 15:1–7. https://doi.org/10.1186/1471-2105-15-284
doi: 10.1186/1471-2105-15-284
Yuan C, Lei J, Cole J, Sun Y (2015) Reconstructing 16S rRNA genes in metagenomic data. Bioinformatics 31:i35–i43. https://doi.org/10.1093/bioinformatics/btv231
doi: 10.1093/bioinformatics/btv231
pubmed: 26072503
pmcid: 4765874
Zacharias HU, Rehberg T, Mehrl S, Richtmann D et al (2017) Scale-Invariant Biomarker Discovery in Urine and Plasma Metabolite Fingerprints. J Proteome Res 16:3596–3605. https://doi.org/10.1021/acs.jproteome.7b00325
doi: 10.1021/acs.jproteome.7b00325
pubmed: 28825821
Zakrzewski M, Bekel T, Ander C, Pühler A et al (2013) MetaSAMS—A novel software platform for taxonomic classification, functional annotation and comparative analysis of metagenome datasets. J Biotechnol 167:156–165. https://doi.org/10.1016/j.jbiotec.2012.09.013
doi: 10.1016/j.jbiotec.2012.09.013
pubmed: 23026555
Zeng F, Wang Z, Wang Y, Zhou J, Chen T (2017) Large-scale 16S gene assembly using metagenomics shotgun sequences. Bioinformatics 33:1447–1456. https://doi.org/10.1093/bioinformatics/btx018
doi: 10.1093/bioinformatics/btx018
pubmed: 28158392
Zhang T, Luo Y, Liu K, Pan L, Zhang B, Yu J, Hu S (2011) BIGpre: a quality assessment package for next-generation sequencing data. Genom Proteom Bioinform 9:238–244. https://doi.org/10.1016/S1672-0229(11)60027-2
doi: 10.1016/S1672-0229(11)60027-2
Zhang Y, Sun Y, Cole JR (2013) A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads. Bioinformatics 29:2103–2111. https://doi.org/10.1093/bioinformatics/btt357
doi: 10.1093/bioinformatics/btt357
pubmed: 23782615
Zhang Y, Sun Y, Cole JR (2014) A Scalable and Accurate Targeted Gene Assembly Tool (SAT-Assembler) for Next-Generation Sequencing Data. PLoS Comput Biol 10:e1003737. https://doi.org/10.1371/journal.pcbi.1003737
doi: 10.1371/journal.pcbi.1003737
pubmed: 25122209
pmcid: 4133164
Zhao W, Liu W, Tian D, Tang B et al (2011) wapRNA: a web-based application for the processing of RNA sequences. Bioinformatics 27:3076–3077. https://doi.org/10.1093/bioinformatics/btr504
doi: 10.1093/bioinformatics/btr504
pubmed: 21896507
Zhou Q, Su X, Jing G, Chen S, Ning K (2018) RNA-QC-chain: comprehensive and fast quality control for RNA-Seq data. BMC Genomics 19:1–10. https://doi.org/10.1186/s12864-018-4503-6
doi: 10.1186/s12864-018-4503-6
Zhu W, Lomsadze A, Borodovsky M (2010) Ab initio gene identification in metagenomic sequences. Nucleic Acids Res 38:e132–e132. https://doi.org/10.1093/nar/gkq275
doi: 10.1093/nar/gkq275
pubmed: 20403810
pmcid: 2896542
Zhu J, Liao M, Yao Z, Liang W et al (2018) Breast cancer in postmenopausal women is associated with an altered gut metagenome. Microbiome 6:136. https://doi.org/10.1186/s40168-018-0515-3
doi: 10.1186/s40168-018-0515-3
pubmed: 30081953
pmcid: 6080540
Zitvogel L, Ma Y, Raoult D, Kroemer G, Gajewski TF (2018) The microbiome in cancer immunotherapy: Diagnostic tools and therapeutic strategies. Science 359:1366–1370. https://doi.org/10.1126/science.aar6918
doi: 10.1126/science.aar6918
pubmed: 29567708
Zou B, Li J, Zhou Q, Quan Z-X (2017) MIPE: A metagenome-based community structure explorer and SSU primer evaluation tool. PLoS ONE 12:e0174609. https://doi.org/10.1371/journal.pone.0174609
doi: 10.1371/journal.pone.0174609
pubmed: 28350876
pmcid: 5370157
Zytnicki M, Quesneville H (2011) S-MART, a software toolbox to aid RNA-Seq data analysis. PLoS ONE 6:e25988. https://doi.org/10.1371/journal.pone.0025988
doi: 10.1371/journal.pone.0025988
pubmed: 21998740
pmcid: 3188586