ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
24 06 2022
Historique:
entrez: 27 6 2022
pubmed: 28 6 2022
medline: 30 6 2022
Statut: ppublish

Résumé

Nanopore sequencers allow targeted sequencing of interesting nucleotide sequences by rejecting other sequences from individual pores. This feature facilitates the enrichment of low-abundant sequences by depleting overrepresented ones in-silico. Existing tools for adaptive sampling either apply signal alignment, which cannot handle human-sized reference sequences, or apply read mapping in sequence space relying on fast graphical processing units (GPU) base callers for real-time read rejection. Using nanopore long-read mapping tools is also not optimal when mapping shorter reads as usually analyzed in adaptive sampling applications. Here, we present a new approach for nanopore adaptive sampling that combines fast CPU and GPU base calling with read classification based on Interleaved Bloom Filters. ReadBouncer improves the potential enrichment of low abundance sequences by its high read classification sensitivity and specificity, outperforming existing tools in the field. It robustly removes even reads belonging to large reference sequences while running on commodity hardware without GPUs, making adaptive sampling accessible for in-field researchers. Readbouncer also provides a user-friendly interface and installer files for end-users without a bioinformatics background. The C++ source code is available at https://gitlab.com/dacs-hpi/readbouncer. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 35758774
pii: 6617484
doi: 10.1093/bioinformatics/btac223
pmc: PMC9235500
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

i153-i160

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press.

Références

Bioinformatics. 2020 Jul 1;36(Suppl_1):i12-i20
pubmed: 32657362
Nat Biotechnol. 2021 Apr;39(4):431-441
pubmed: 33257863
Gigascience. 2019 May 1;8(5):
pubmed: 31089679
Front Genet. 2021 Mar 24;12:620009
pubmed: 33841495
Bioinformatics. 2020 Aug 15;36(14):4191-4192
pubmed: 32374816
Sci Rep. 2022 Mar 7;12(1):4000
pubmed: 35256725
Microbiome. 2017 Aug 14;5(1):101
pubmed: 28807044
Nat Methods. 2016 Sep;13(9):751-4
pubmed: 27454285
Genome Biol. 2022 Jan 24;23(1):11
pubmed: 35067223
Bioinformatics. 2018 Sep 1;34(17):i766-i772
pubmed: 30423080
Science. 2021 Dec 17;374(6574):1509-1513
pubmed: 34735217
Nature. 2016 Feb 11;530(7589):228-232
pubmed: 26840485
iScience. 2021 Jun 08;24(6):102696
pubmed: 34195571
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
J Comput Biol. 2022 Feb;29(2):155-168
pubmed: 35108101
Genome Biol. 2018 Jul 13;19(1):90
pubmed: 30005597
Science. 2022 Apr;376(6588):44-53
pubmed: 35357919
J Exp Bot. 2017 Nov 28;68(20):5419-5429
pubmed: 28992056
Nat Biotechnol. 2021 Apr;39(4):442-450
pubmed: 33257864
Bioinformatics. 2021 May 5;37(5):589-595
pubmed: 32976553
Adv Exp Med Biol. 2019;1129:143-150
pubmed: 30968366
Mol Ecol Resour. 2014 Nov;14(6):1097-102
pubmed: 25187008
Genome Biol. 2019 Jun 24;20(1):129
pubmed: 31234903
Bioinformatics. 2019 Jul 1;35(13):2193-2198
pubmed: 30462145
J Hum Genet. 2020 Jan;65(1):35-40
pubmed: 31582773
Genome Biol. 2016 Jun 20;17(1):132
pubmed: 27323842

Auteurs

Jens-Uwe Ulrich (JU)

Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, 14482 Potsdam, Germany.
Bioinformatics Unit (MF1), Robert Koch Institute, 13353 Berlin, Germany.
Department of Mathematics and Computer Science, Free University of Berlin, 14195 Berlin, Germany.

Ahmad Lutfi (A)

Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, 14482 Potsdam, Germany.
Department of Mathematics and Computer Science, Free University of Berlin, 14195 Berlin, Germany.

Kilian Rutzen (K)

Genome Sequencing Unit (MF2), Robert Koch Institute, 13353 Berlin, Germany.

Bernhard Y Renard (BY)

Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, 14482 Potsdam, Germany.
Bioinformatics Unit (MF1), Robert Koch Institute, 13353 Berlin, Germany.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH