ReadBouncer: precise and scalable adaptive sampling for nanopore sequencing.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
24 06 2022
24 06 2022
Historique:
entrez:
27
6
2022
pubmed:
28
6
2022
medline:
30
6
2022
Statut:
ppublish
Résumé
Nanopore sequencers allow targeted sequencing of interesting nucleotide sequences by rejecting other sequences from individual pores. This feature facilitates the enrichment of low-abundant sequences by depleting overrepresented ones in-silico. Existing tools for adaptive sampling either apply signal alignment, which cannot handle human-sized reference sequences, or apply read mapping in sequence space relying on fast graphical processing units (GPU) base callers for real-time read rejection. Using nanopore long-read mapping tools is also not optimal when mapping shorter reads as usually analyzed in adaptive sampling applications. Here, we present a new approach for nanopore adaptive sampling that combines fast CPU and GPU base calling with read classification based on Interleaved Bloom Filters. ReadBouncer improves the potential enrichment of low abundance sequences by its high read classification sensitivity and specificity, outperforming existing tools in the field. It robustly removes even reads belonging to large reference sequences while running on commodity hardware without GPUs, making adaptive sampling accessible for in-field researchers. Readbouncer also provides a user-friendly interface and installer files for end-users without a bioinformatics background. The C++ source code is available at https://gitlab.com/dacs-hpi/readbouncer. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 35758774
pii: 6617484
doi: 10.1093/bioinformatics/btac223
pmc: PMC9235500
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
i153-i160Informations de copyright
© The Author(s) 2022. Published by Oxford University Press.
Références
Bioinformatics. 2020 Jul 1;36(Suppl_1):i12-i20
pubmed: 32657362
Nat Biotechnol. 2021 Apr;39(4):431-441
pubmed: 33257863
Gigascience. 2019 May 1;8(5):
pubmed: 31089679
Front Genet. 2021 Mar 24;12:620009
pubmed: 33841495
Bioinformatics. 2020 Aug 15;36(14):4191-4192
pubmed: 32374816
Sci Rep. 2022 Mar 7;12(1):4000
pubmed: 35256725
Microbiome. 2017 Aug 14;5(1):101
pubmed: 28807044
Nat Methods. 2016 Sep;13(9):751-4
pubmed: 27454285
Genome Biol. 2022 Jan 24;23(1):11
pubmed: 35067223
Bioinformatics. 2018 Sep 1;34(17):i766-i772
pubmed: 30423080
Science. 2021 Dec 17;374(6574):1509-1513
pubmed: 34735217
Nature. 2016 Feb 11;530(7589):228-232
pubmed: 26840485
iScience. 2021 Jun 08;24(6):102696
pubmed: 34195571
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
J Comput Biol. 2022 Feb;29(2):155-168
pubmed: 35108101
Genome Biol. 2018 Jul 13;19(1):90
pubmed: 30005597
Science. 2022 Apr;376(6588):44-53
pubmed: 35357919
J Exp Bot. 2017 Nov 28;68(20):5419-5429
pubmed: 28992056
Nat Biotechnol. 2021 Apr;39(4):442-450
pubmed: 33257864
Bioinformatics. 2021 May 5;37(5):589-595
pubmed: 32976553
Adv Exp Med Biol. 2019;1129:143-150
pubmed: 30968366
Mol Ecol Resour. 2014 Nov;14(6):1097-102
pubmed: 25187008
Genome Biol. 2019 Jun 24;20(1):129
pubmed: 31234903
Bioinformatics. 2019 Jul 1;35(13):2193-2198
pubmed: 30462145
J Hum Genet. 2020 Jan;65(1):35-40
pubmed: 31582773
Genome Biol. 2016 Jun 20;17(1):132
pubmed: 27323842