Sequencing of individual barcoded cDNAs using Pacific Biosciences and Oxford Nanopore Technologies reveals platform-specific error patterns.


Journal

Genome research
ISSN: 1549-5469
Titre abrégé: Genome Res
Pays: United States
ID NLM: 9518021

Informations de publication

Date de publication:
04 2022
Historique:
received: 18 11 2021
accepted: 05 03 2022
pubmed: 19 3 2022
medline: 13 4 2022
entrez: 18 3 2022
Statut: ppublish

Résumé

Long-read transcriptomics require understanding error sources inherent to technologies. Current approaches cannot compare methods for an individual RNA molecule. Here, we present a novel platform-comparison method that combines barcoding strategies and long-read sequencing to sequence cDNA copies representing an individual RNA molecule on both Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT). We compare these long-read pairs in terms of sequence content and isoform patterns. Although individual read pairs show high similarity, we find differences in (1) aligned length, (2) transcription start site (TSS), (3) polyadenylation site (poly(A)-site) assignment, and (4) exon-intron structures. Overall, 25% of read pairs disagree on either TSS, poly(A)-site, or splice site. Intron-chain disagreement typically arises from alignment errors of microexons and complicated splice sites. Our single-molecule technology comparison reveals that inconsistencies are often caused by sequencing error-induced inaccurate ONT alignments, especially to downstream GUNNGU donor motifs. However, annotation-disagreeing upstream shifts in NAGNAG acceptors in ONT are often confirmed by PacBio and are thus likely real. In both barcoded and nonbarcoded ONT reads, we find that intron number and proximity of GU/AGs better predict inconsistencies with the annotation than read quality alone. We summarize these findings in an annotation-based algorithm for spliced alignment correction that improves subsequent transcript construction with ONT reads.

Identifiants

pubmed: 35301264
pii: gr.276405.121
doi: 10.1101/gr.276405.121
pmc: PMC8997348
doi:

Substances chimiques

DNA, Complementary 0
RNA 63231-63-0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

726-737

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM135247
Pays : United States
Organisme : NIMH NIH HHS
ID : RF1 MH121267
Pays : United States

Informations de copyright

© 2022 Mikheenko et al.; Published by Cold Spring Harbor Laboratory Press.

Références

Nat Methods. 2018 Mar;15(3):201-206
pubmed: 29334379
Science. 2015 Mar 6;347(6226):1138-42
pubmed: 25700174
Nucleic Acids Res. 2021 Jan 8;49(D1):D916-D923
pubmed: 33270111
Nucleic Acids Res. 2015 Oct 15;43(18):e116
pubmed: 26040699
Nat Biotechnol. 2018 Oct 15;:
pubmed: 30320766
Nat Biotechnol. 2015 Jul;33(7):736-42
pubmed: 25985263
Nat Methods. 2013 Dec;10(12):1185-91
pubmed: 24185836
Nat Commun. 2021 Mar 1;12(1):1361
pubmed: 33649327
Proc Natl Acad Sci U S A. 2018 Sep 25;115(39):9726-9731
pubmed: 30201725
Plant Methods. 2020 Jun 12;16:85
pubmed: 32536962
Nat Commun. 2020 Jan 27;11(1):527
pubmed: 31988292
Genome Res. 2018 Jul;28(7):1096
pubmed: 29967126
Genome Biol. 2019 Dec 16;20(1):274
pubmed: 31842925
G3 (Bethesda). 2013 Mar;3(3):387-97
pubmed: 23450794
Nat Biotechnol. 2014 Sep;32(9):888-95
pubmed: 25150837
Proc Natl Acad Sci U S A. 2013 Dec 10;110(50):E4821-30
pubmed: 24282307
PLoS One. 2012;7(10):e46679
pubmed: 23056399
Cell. 2015 May 21;161(5):1202-1214
pubmed: 26000488
Nat Methods. 2013 Dec;10(12):1177-84
pubmed: 24185837
Nat Biotechnol. 2014 Sep;32(9):915-925
pubmed: 25150835
RNA. 2010 Feb;16(2):290-8
pubmed: 20038629
Theory Biosci. 2012 Dec;131(4):281-5
pubmed: 22872506
Genome Res. 2018 Feb;28(2):231-242
pubmed: 29196558
Nucleic Acids Res. 2020 Jan 8;48(D1):D174-D179
pubmed: 31617559
Genome Biol. 2019 Dec 18;20(1):287
pubmed: 31849338
Genome Biol. 2015 Jan 05;16:22
pubmed: 25723102
Nat Commun. 2019 Jul 16;10(1):3120
pubmed: 31311926
Bioinformatics. 2021 Jul 24;:
pubmed: 34302453
Nat Commun. 2021 Jan 19;12(1):463
pubmed: 33469025
Nat Commun. 2019 Feb 14;10(1):754
pubmed: 30765700
Nat Commun. 2021 Feb 8;12(1):992
pubmed: 33558522
Nature. 2008 Nov 27;456(7221):470-6
pubmed: 18978772
Genome Res. 2019 Aug;29(8):1329-1342
pubmed: 31201211
Genome Biol. 2019 Dec 16;20(1):278
pubmed: 31842956
Bioinformatics. 2013 Jan 1;29(1):15-21
pubmed: 23104886
F1000Res. 2020 Apr 28;9:304
pubmed: 32489650
Nat Commun. 2020 Mar 18;11(1):1438
pubmed: 32188845
Nat Biotechnol. 2022 Jul;40(7):1082-1092
pubmed: 35256815
F1000Res. 2017 Feb 3;6:100
pubmed: 28868132
Genome Biol. 2021 Mar 1;22(1):72
pubmed: 33648554
Elife. 2016 Feb 01;5:e11752
pubmed: 26829591
Gigascience. 2020 Jun 1;9(6):
pubmed: 32520350
Nat Biotechnol. 2012 Jul 01;30(7):693-700
pubmed: 22750884
Nat Commun. 2017 Jul 19;8:16027
pubmed: 28722025
Nat Biotechnol. 2013 Nov;31(11):1009-14
pubmed: 24108091
Sci Rep. 2016 Aug 24;6:31602
pubmed: 27554526
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Genome Res. 2020 Jun;30(6):898-909
pubmed: 32540955
Proc Natl Acad Sci U S A. 2014 Jul 8;111(27):9869-74
pubmed: 24961374
PLoS One. 2013 Dec 04;8(12):e82138
pubmed: 24324759
Science. 2009 Jan 2;323(5910):133-8
pubmed: 19023044

Auteurs

Alla Mikheenko (A)

Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg 199034, Russia.

Andrey D Prjibelski (AD)

Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg 199034, Russia.

Anoushka Joglekar (A)

Brain and Mind Research Institute and Center for Neurogenetics, Weill Cornell Medicine, New York, New York 10065, USA.

Hagen U Tilgner (HU)

Brain and Mind Research Institute and Center for Neurogenetics, Weill Cornell Medicine, New York, New York 10065, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Coal Metagenome Phylogeny Bacteria Genome, Bacterial
Humans RNA, Circular Exosomes Cell Proliferation Epithelial-Mesenchymal Transition

Classifications MeSH