Wochenende - modular and flexible alignment-based shotgun metagenome analysis.


Journal

BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258

Informations de publication

Date de publication:
11 Nov 2022
Historique:
received: 04 04 2022
accepted: 02 11 2022
entrez: 11 11 2022
pubmed: 12 11 2022
medline: 16 11 2022
Statut: epublish

Résumé

Shotgun metagenome analysis provides a robust and verifiable method for comprehensive microbiome analysis of fungal, viral, archaeal and bacterial taxonomy, particularly with regard to visualization of read mapping location, normalization options, growth dynamics and functional gene repertoires. Current read classification tools use non-standard output formats, or do not fully show information on mapping location. As reference datasets are not perfect, portrayal of mapping information is critical for judging results effectively. Our alignment-based pipeline, Wochenende, incorporates flexible quality control, trimming, mapping, various filters and normalization. Results are completely transparent and filters can be adjusted by the user. We observe stringent filtering of mismatches and use of mapping quality sharply reduces the number of false positives. Further modules allow genomic visualization and the calculation of growth rates, as well as integration and subsequent plotting of pipeline results as heatmaps or heat trees. Our novel normalization approach additionally allows calculation of absolute abundance profiles by comparison with reads assigned to the human host genome. Wochenende has the ability to find and filter alignments to all kingdoms of life using both short and long reads, and requires only good quality reference genomes. Wochenende automatically combines multiple available modules ranging from quality control and normalization to taxonomic visualization. Wochenende is available at https://github.com/MHH-RCUG/nf_wochenende .

Sections du résumé

BACKGROUND BACKGROUND
Shotgun metagenome analysis provides a robust and verifiable method for comprehensive microbiome analysis of fungal, viral, archaeal and bacterial taxonomy, particularly with regard to visualization of read mapping location, normalization options, growth dynamics and functional gene repertoires. Current read classification tools use non-standard output formats, or do not fully show information on mapping location. As reference datasets are not perfect, portrayal of mapping information is critical for judging results effectively.
RESULTS RESULTS
Our alignment-based pipeline, Wochenende, incorporates flexible quality control, trimming, mapping, various filters and normalization. Results are completely transparent and filters can be adjusted by the user. We observe stringent filtering of mismatches and use of mapping quality sharply reduces the number of false positives. Further modules allow genomic visualization and the calculation of growth rates, as well as integration and subsequent plotting of pipeline results as heatmaps or heat trees. Our novel normalization approach additionally allows calculation of absolute abundance profiles by comparison with reads assigned to the human host genome.
CONCLUSION CONCLUSIONS
Wochenende has the ability to find and filter alignments to all kingdoms of life using both short and long reads, and requires only good quality reference genomes. Wochenende automatically combines multiple available modules ranging from quality control and normalization to taxonomic visualization. Wochenende is available at https://github.com/MHH-RCUG/nf_wochenende .

Identifiants

pubmed: 36368923
doi: 10.1186/s12864-022-08985-9
pii: 10.1186/s12864-022-08985-9
pmc: PMC9650795
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

748

Informations de copyright

© 2022. The Author(s).

Références

Nat Commun. 2016 Apr 13;7:11257
pubmed: 27071849
Nat Commun. 2018 Jul 16;9(1):2747
pubmed: 30013106
Bioinformatics. 2019 Sep 1;35(17):2966-2973
pubmed: 30649250
Cell. 2016 Aug 25;166(5):1103-1116
pubmed: 27565341
J Clin Microbiol. 1998 Jun;36(6):1756-60
pubmed: 9620414
Science. 2015 Sep 4;349(6252):1101-1106
pubmed: 26229116
Bioinformatics. 2016 Oct 1;32(19):3047-8
pubmed: 27312411
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Bioinformatics. 2011 Jun 15;27(12):1691-2
pubmed: 21493652
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
NPJ Biofilms Microbiomes. 2020 Dec 15;6(1):61
pubmed: 33319812
Bioinformatics. 2018 May 1;34(9):1600-1602
pubmed: 29069305
Elife. 2021 May 04;10:
pubmed: 33944776
J Clin Microbiol. 1986 Jun;23(6):1034-8
pubmed: 3711294
Nat Commun. 2018 Nov 30;9(1):5114
pubmed: 30504855
Nat Methods. 2018 Jun;15(6):461-468
pubmed: 29713083
Genomics Inform. 2019 Mar;17(1):e6
pubmed: 30929407
Bioinformatics. 2010 Mar 1;26(5):589-95
pubmed: 20080505
Forensic Sci Int Genet. 2020 May;46:102257
pubmed: 32058299
Bioinformatics. 2019 Feb 1;35(3):521-522
pubmed: 30016412
Bioinformatics. 2014 Aug 1;30(15):2114-20
pubmed: 24695404
Semin Respir Crit Care Med. 2019 Dec;40(6):727-736
pubmed: 31887768
Microbiome. 2018 Feb 27;6(1):42
pubmed: 29482639
Curr Opin Pediatr. 2018 Feb;30(1):125-130
pubmed: 29189352
Genome Res. 2016 Dec;26(12):1721-1729
pubmed: 27852649
Microbiome. 2018 Feb 26;6(1):41
pubmed: 29482646
Bioinformatics. 2018 Sep 1;34(17):i884-i890
pubmed: 30423086
Nat Rev Microbiol. 2017 May;15(5):259-270
pubmed: 28316330
Nucleic Acids Res. 2012 Jan;40(Database issue):D130-5
pubmed: 22121212
Genome Biol. 2018 Nov 16;19(1):198
pubmed: 30445993
Gigascience. 2021 Feb 16;10(2):
pubmed: 33590861
PLoS One. 2016 Oct 19;11(10):e0165015
pubmed: 27760173
PLoS Comput Biol. 2017 Feb 21;13(2):e1005404
pubmed: 28222096
Nat Biotechnol. 2017 Sep 12;35(9):833-844
pubmed: 28898207
PLoS One. 2018 Feb 14;13(2):e0192898
pubmed: 29444186
Nat Methods. 2018 Jul;15(7):475-476
pubmed: 29967506
Nat Rev Genet. 2019 Jun;20(6):341-355
pubmed: 30918369
Sci Transl Med. 2018 Dec 19;10(472):
pubmed: 30567928
J Clin Microbiol. 2005 Oct;43(10):5122-8
pubmed: 16207973
IUBMB Life. 2020 Apr;72(4):698-705
pubmed: 31869003
Diagn Microbiol Infect Dis. 2002 Oct;44(2):201-4
pubmed: 12458129
Front Microbiol. 2020 May 25;11:953
pubmed: 32528434
Front Microbiol. 2021 Jul 15;12:670336
pubmed: 34335499
Sci Rep. 2020 Aug 12;10(1):13588
pubmed: 32788623
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Arch Pathol Lab Med. 2017 Jun;141(6):776-786
pubmed: 28169558
Bioinformatics. 2015 Jun 15;31(12):2032-4
pubmed: 25697820
BMC Bioinformatics. 2015 Nov 04;16:363
pubmed: 26537885
Eur Respir Rev. 2014 Sep;23(133):320-32
pubmed: 25176968
BMC Biol. 2014 Nov 12;12:87
pubmed: 25387460

Auteurs

Ilona Rosenboom (I)

Clinical Research Group Molecular Pathology of Cystic Fibrosis and Pseudomonas Genomics, Clinic for Pediatric Pneumology, Allergology and Neonatology, Hannover Medical School, Hannover, Germany. rosenboom.ilona@mh-hannover.de.

Tobias Scheithauer (T)

Research Core Unit Genomics, Hannover Medical School, Hannover, Germany.

Fabian C Friedrich (FC)

Research Core Unit Genomics, Hannover Medical School, Hannover, Germany.

Sophia Pörtner (S)

Research Core Unit Genomics, Hannover Medical School, Hannover, Germany.

Lisa Hollstein (L)

Research Core Unit Genomics, Hannover Medical School, Hannover, Germany.

Marie-Madlen Pust (MM)

Clinical Research Group Molecular Pathology of Cystic Fibrosis and Pseudomonas Genomics, Clinic for Pediatric Pneumology, Allergology and Neonatology, Hannover Medical School, Hannover, Germany.

Konstantinos Sifakis (K)

University of Crete, 700 13, Heraklion, Greece.

Tom Wehrbein (T)

Institut Fuer Informationsverarbeitung (TNT), Leibniz University Hannover, Hannover, Germany.

Bodo Rosenhahn (B)

Institut Fuer Informationsverarbeitung (TNT), Leibniz University Hannover, Hannover, Germany.

Lutz Wiehlmann (L)

Research Core Unit Genomics, Hannover Medical School, Hannover, Germany.

Patrick Chhatwal (P)

Department of Microbiology, Hannover Medical School, Hannover, Germany.

Burkhard Tümmler (B)

Clinical Research Group Molecular Pathology of Cystic Fibrosis and Pseudomonas Genomics, Clinic for Pediatric Pneumology, Allergology and Neonatology, Hannover Medical School, Hannover, Germany.

Colin F Davenport (CF)

Research Core Unit Genomics, Hannover Medical School, Hannover, Germany.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH