RNA-Bloom enables reference-free and reference-guided sequence assembly for single-cell transcriptomes.
Journal
Genome research
ISSN: 1549-5469
Titre abrégé: Genome Res
Pays: United States
ID NLM: 9518021
Informations de publication
Date de publication:
08 2020
08 2020
Historique:
received:
11
12
2019
accepted:
23
07
2020
pubmed:
21
8
2020
medline:
11
11
2021
entrez:
21
8
2020
Statut:
ppublish
Résumé
Despite the rapid advance in single-cell RNA sequencing (scRNA-seq) technologies within the last decade, single-cell transcriptome analysis workflows have primarily used gene expression data while isoform sequence analysis at the single-cell level still remains fairly limited. Detection and discovery of isoforms in single cells is difficult because of the inherent technical shortcomings of scRNA-seq data, and existing transcriptome assembly methods are mainly designed for bulk RNA samples. To address this challenge, we developed RNA-Bloom, an assembly algorithm that leverages the rich information content aggregated from multiple single-cell transcriptomes to reconstruct cell-specific isoforms. Assembly with RNA-Bloom can be either reference-guided or reference-free, thus enabling unbiased discovery of novel isoforms or foreign transcripts. We compared both assembly strategies of RNA-Bloom against five state-of-the-art reference-free and reference-based transcriptome assembly methods. In our benchmarks on a simulated 384-cell data set, reference-free RNA-Bloom reconstructed 37.9%-38.3% more isoforms than the best reference-free assembler, whereas reference-guided RNA-Bloom reconstructed 4.1%-11.6% more isoforms than reference-based assemblers. When applied to a real 3840-cell data set consisting of more than 4 billion reads, RNA-Bloom reconstructed 9.7%-25.0% more isoforms than the best competing reference-based and reference-free approaches evaluated. We expect RNA-Bloom to boost the utility of scRNA-seq data beyond gene expression analysis, expanding what is informatically accessible now.
Identifiants
pubmed: 32817073
pii: gr.260174.119
doi: 10.1101/gr.260174.119
pmc: PMC7462077
doi:
Substances chimiques
Protein Isoforms
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1191-1200Subventions
Organisme : NHGRI NIH HHS
ID : R01 HG007182
Pays : United States
Informations de copyright
© 2020 Nip et al.; Published by Cold Spring Harbor Laboratory Press.
Références
Nat Methods. 2018 Aug;15(8):563-565
pubmed: 30065371
Cell. 2015 May 21;161(5):1202-1214
pubmed: 26000488
Nat Biotechnol. 2011 May 15;29(7):644-52
pubmed: 21572440
Nat Biotechnol. 2016 May;34(5):525-7
pubmed: 27043002
Nat Biotechnol. 2017 Dec;35(12):1167-1169
pubmed: 29131147
Algorithms Mol Biol. 2013 Sep 16;8(1):22
pubmed: 24040893
BMC Bioinformatics. 2011 Aug 04;12:323
pubmed: 21816040
Science. 2019 Aug 23;365(6455):
pubmed: 31439762
Bioinformatics. 2018 Aug 15;34(16):2846-2847
pubmed: 29659703
Bioinformatics. 2017 May 1;33(9):1324-1330
pubmed: 28453674
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Cell Stem Cell. 2015 Oct 1;17(4):471-85
pubmed: 26431182
Nat Methods. 2010 Nov;7(11):909-12
pubmed: 20935650
Genome Res. 2017 May;27(5):768-777
pubmed: 28232478
Gigascience. 2019 Sep 1;8(9):
pubmed: 31494669
Mol Cell. 2017 Jul 6;67(1):148-161.e5
pubmed: 28673540
Gigascience. 2019 May 1;8(5):
pubmed: 31077315
Genome Res. 2015 Oct;25(10):1499-507
pubmed: 26430160
Nat Biotechnol. 2019 Aug;37(8):907-915
pubmed: 31375807
Nat Methods. 2013 Nov;10(11):1096-8
pubmed: 24056875
Front Immunol. 2019 Jun 04;10:1170
pubmed: 31214167
Bioinformatics. 2018 Sep 1;34(17):i884-i890
pubmed: 30423086
Mol Neurodegener. 2017 Aug 2;12(1):56
pubmed: 28768545
Nat Biotechnol. 2015 Mar;33(3):290-5
pubmed: 25690850
Nucleic Acids Res. 2019 Sep 19;47(16):e93
pubmed: 31216024
Science. 2017 Aug 18;357(6352):661-667
pubmed: 28818938
Bioinformatics. 2016 Nov 15;32(22):3492-3494
pubmed: 27423894
Nucleic Acids Res. 2018 Jun 1;46(10):e62
pubmed: 29548006
F1000Res. 2020 Apr 28;9:304
pubmed: 32489650
Bioinformatics. 2017 Feb 1;33(3):425-427
pubmed: 28172415
Genome Biol. 2016 Jan 26;17:13
pubmed: 26813401
Nature. 2018 Oct;562(7727):367-372
pubmed: 30283141
Int J Genomics. 2015;2015:196591
pubmed: 26539459
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Nat Commun. 2017 Jan 16;8:14049
pubmed: 28091601
Bioinformatics. 2016 Jul 15;32(14):2210-2
pubmed: 27153654
Curr Opin Virol. 2018 Apr;29:39-50
pubmed: 29558678
Genome Biol. 2019 Apr 9;20(1):70
pubmed: 30961669
Gigascience. 2015 Oct 19;4:48
pubmed: 26500767
Genome Biol. 2018 Aug 10;19(1):110
pubmed: 30097058