High quality genome assemblies of Mycoplasma bovis using a taxon-specific Bonito basecaller for MinION and Flongle long-read nanopore sequencing.
Basecalling
Genome assembly
Long-read sequencing
Mycoplasma bovis
Nanopore sequencing
Journal
BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194
Informations de publication
Date de publication:
11 Nov 2020
11 Nov 2020
Historique:
received:
03
08
2020
accepted:
30
10
2020
entrez:
12
11
2020
pubmed:
13
11
2020
medline:
24
11
2020
Statut:
epublish
Résumé
Implementation of Third-Generation Sequencing approaches for Whole Genome Sequencing (WGS) all-in-one diagnostics in human and veterinary medicine, requires the rapid and accurate generation of consensus genomes. Over the last years, Oxford Nanopore Technologies (ONT) released various new devices (e.g. the Flongle R9.4.1 flow cell) and bioinformatics tools (e.g. the in 2019-released Bonito basecaller), allowing cheap and user-friendly cost-efficient introduction in various NGS workflows. While single read, overall consensus accuracies, and completeness of genome sequences has been improved dramatically, further improvements are required when working with non-frequently sequenced organisms like Mycoplasma bovis. As an important primary respiratory pathogen in cattle, rapid M. bovis diagnostics is crucial to allow timely and targeted disease control and prevention. Current complete diagnostics (including identification, strain typing, and antimicrobial resistance (AMR) detection) require combined culture-based and molecular approaches, of which the first can take 1-2 weeks. At present, cheap and quick long read all-in-one WGS approaches can only be implemented if increased accuracies and genome completeness can be obtained. Here, a taxon-specific custom-trained Bonito v.0.1.3 basecalling model (custom-pg45) was implemented in various WGS assembly bioinformatics pipelines. Using MinION sequencing data, we showed improved consensus accuracies up to Q45.2 and Q46.7 for reference-based and Canu de novo assembled M. bovis genomes, respectively. Furthermore, the custom-pg45 model resulted in mean consensus accuracies of Q45.0 and genome completeness of 94.6% for nine M. bovis field strains. Improvements were also observed for the single-use Flongle sequencer (mean Q36.0 accuracies and 80.3% genome completeness). These results implicate that taxon-specific basecalling of MinION and single-use Flongle Nanopore long reads are of great value to be implemented in rapid all-in-one WGS tools as evidenced for Mycoplasma bovis as an example.
Sections du résumé
BACKGROUND
BACKGROUND
Implementation of Third-Generation Sequencing approaches for Whole Genome Sequencing (WGS) all-in-one diagnostics in human and veterinary medicine, requires the rapid and accurate generation of consensus genomes. Over the last years, Oxford Nanopore Technologies (ONT) released various new devices (e.g. the Flongle R9.4.1 flow cell) and bioinformatics tools (e.g. the in 2019-released Bonito basecaller), allowing cheap and user-friendly cost-efficient introduction in various NGS workflows. While single read, overall consensus accuracies, and completeness of genome sequences has been improved dramatically, further improvements are required when working with non-frequently sequenced organisms like Mycoplasma bovis. As an important primary respiratory pathogen in cattle, rapid M. bovis diagnostics is crucial to allow timely and targeted disease control and prevention. Current complete diagnostics (including identification, strain typing, and antimicrobial resistance (AMR) detection) require combined culture-based and molecular approaches, of which the first can take 1-2 weeks. At present, cheap and quick long read all-in-one WGS approaches can only be implemented if increased accuracies and genome completeness can be obtained.
RESULTS
RESULTS
Here, a taxon-specific custom-trained Bonito v.0.1.3 basecalling model (custom-pg45) was implemented in various WGS assembly bioinformatics pipelines. Using MinION sequencing data, we showed improved consensus accuracies up to Q45.2 and Q46.7 for reference-based and Canu de novo assembled M. bovis genomes, respectively. Furthermore, the custom-pg45 model resulted in mean consensus accuracies of Q45.0 and genome completeness of 94.6% for nine M. bovis field strains. Improvements were also observed for the single-use Flongle sequencer (mean Q36.0 accuracies and 80.3% genome completeness).
CONCLUSIONS
CONCLUSIONS
These results implicate that taxon-specific basecalling of MinION and single-use Flongle Nanopore long reads are of great value to be implemented in rapid all-in-one WGS tools as evidenced for Mycoplasma bovis as an example.
Identifiants
pubmed: 33176691
doi: 10.1186/s12859-020-03856-0
pii: 10.1186/s12859-020-03856-0
pmc: PMC7661149
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
517Subventions
Organisme : Federaal Agentschap Voor de Veiligheid Van de Voedselketen
ID : RF 17/6313, MALDIRESP/MA
Organisme : Federaal Agentschap Voor de Veiligheid Van de Voedselketen
ID : RF 17/6312, PigMinION
Références
Bioinformatics. 2018 Jul 1;34(13):i142-i150
pubmed: 29949969
BMC Genomics. 2019 Jan 9;20(1):23
pubmed: 30626323
Brief Bioinform. 2019 Jul 19;20(4):1542-1559
pubmed: 29617724
Vet Microbiol. 2019 Nov;238:108432
pubmed: 31648729
Gigascience. 2019 May 1;8(5):
pubmed: 31089679
Bioinformatics. 2014 Aug 1;30(15):2114-20
pubmed: 24695404
Vet Microbiol. 2015 Aug 31;179(1-2):15-22
pubmed: 25824130
F1000Res. 2017 May 31;6:760
pubmed: 28794860
J Clin Microbiol. 2020 May 26;58(6):
pubmed: 32229599
Genome Res. 2017 May;27(5):722-736
pubmed: 28298431
Nat Microbiol. 2020 Mar;5(3):430-442
pubmed: 31844297
Nat Nanotechnol. 2009 Apr;4(4):265-70
pubmed: 19350039
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
BMC Vet Res. 2017 Apr 8;13(1):97
pubmed: 28390431
Genome Biol. 2019 Jun 24;20(1):129
pubmed: 31234903
Infect Immun. 2011 Feb;79(2):982-3
pubmed: 21134966
PLoS Genet. 2013;9(1):e1003191
pubmed: 23300489
Sci Rep. 2018 Dec 4;8(1):17596
pubmed: 30514867
Microbiol Mol Biol Rev. 1998 Dec;62(4):1094-156
pubmed: 9841667
J Microbiol Methods. 2019 Apr;159:138-147
pubmed: 30849421
Genome Res. 2017 May;27(5):737-746
pubmed: 28100585
Appl Environ Microbiol. 2016 Oct 14;82(21):6386-6394
pubmed: 27542937
PLoS Comput Biol. 2017 Jun 8;13(6):e1005595
pubmed: 28594827
Gigascience. 2017 Apr 1;6(4):1-6
pubmed: 28327957
Nat Methods. 2016 Sep;13(9):751-4
pubmed: 27454285
Nat Rev Genet. 2016 May 17;17(6):333-51
pubmed: 27184599
Front Microbiol. 2019 Sep 13;10:2085
pubmed: 31572317
Nat Biotechnol. 2019 Jul;37(7):783-792
pubmed: 31235920
Bioinformatics. 2018 Aug 1;34(15):2666-2669
pubmed: 29547981
Genome Res. 2015 Jul;25(7):1043-55
pubmed: 25977477
Bioinformatics. 2020 Mar 1;36(5):1333-1343
pubmed: 31593235
Nat Methods. 2017 Apr;14(4):407-410
pubmed: 28218898
Nat Commun. 2016 Apr 15;7:11307
pubmed: 27079541
Gigascience. 2018 May 1;7(5):
pubmed: 29648610
J Vet Intern Med. 2018 May;32(3):1241-1252
pubmed: 29671903
Bioinformatics. 2020 Apr 15;36(8):2578-2580
pubmed: 31913436
PLoS Genet. 2016 Feb 12;12(2):e1005854
pubmed: 26870957
Proc Natl Acad Sci U S A. 2016 Dec 27;113(52):E8396-E8405
pubmed: 27956617
Nat Methods. 2021 Feb;18(2):165-169
pubmed: 33432244