MAAPER: model-based analysis of alternative polyadenylation using 3' end-linked reads.


Journal

Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660

Informations de publication

Date de publication:
10 08 2021
Historique:
received: 03 02 2021
accepted: 01 07 2021
entrez: 11 8 2021
pubmed: 12 8 2021
medline: 20 1 2022
Statut: epublish

Résumé

Most eukaryotic genes express alternative polyadenylation (APA) isoforms. A growing number of RNA sequencing methods, especially those used for single-cell transcriptome analysis, generate reads close to the polyadenylation site (PAS), termed nearSite reads, hence inherently containing information about APA isoform abundance. Here, we present a probabilistic model-based method named MAAPER to utilize nearSite reads for APA analysis. MAAPER predicts PASs with high accuracy and sensitivity and examines different types of APA events with robust statistics. We show MAAPER's performance with both bulk and single-cell data and its applicability in unpaired or paired experimental designs.

Identifiants

pubmed: 34376236
doi: 10.1186/s13059-021-02429-5
pii: 10.1186/s13059-021-02429-5
pmc: PMC8356463
doi:

Substances chimiques

3' Untranslated Regions 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

222

Subventions

Organisme : NCI NIH HHS
ID : P30 CA010815
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM084089
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM129069
Pays : United States
Organisme : NCATS NIH HHS
ID : UL1 TR003017
Pays : United States

Informations de copyright

© 2021. The Author(s).

Références

Tian B, Manley JL. Alternative polyadenylation of mRNA precursors. Nat Rev Mol Cell Biol. 2017;18(1):18–30. https://doi.org/10.1038/nrm.2016.116 .
doi: 10.1038/nrm.2016.116 pubmed: 27677860
Gruber AJ, Zavolan M. Alternative cleavage and polyadenylation in health and disease. Nat Rev Genet. 2019;20(10):599–614. https://doi.org/10.1038/s41576-019-0145-z .
doi: 10.1038/s41576-019-0145-z
Nourse J, Spada S, Danckwardt S. Emerging roles of RNA 3'-end cleavage and polyadenylation in pathogenesis, diagnosis and therapy of human disorders. Biomolecules. 2020;10(6). https://doi.org/10.3390/biom10060915 .
Xia Z, Donehower LA, Cooper TA, Neilson JR, Wheeler DA, Wagner EJ, et al. Dynamic analyses of alternative polyadenylation from RNA-seq reveal a 3'-UTR landscape across seven tumour types. Nat Commun. 2014;5(1):5274. https://doi.org/10.1038/ncomms6274 .
doi: 10.1038/ncomms6274 pubmed: 25409906
Grassi E, Mariella E, Lembo A, Molineris I, Provero P. Roar: detecting alternative polyadenylation with standard mRNA sequencing libraries. BMC Bioinformatics. 2016;17(1):423. https://doi.org/10.1186/s12859-016-1254-8 .
doi: 10.1186/s12859-016-1254-8 pubmed: 27756200 pmcid: 5069797
Ha KCH, Blencowe BJ, Morris Q. QAPA: a new method for the systematic analysis of alternative polyadenylation from RNA-seq data. Genome Biol. 2018;19(1):45. https://doi.org/10.1186/s13059-018-1414-4 .
doi: 10.1186/s13059-018-1414-4 pubmed: 29592814 pmcid: 5874996
Wang R, Tian B. APAlyzer: a bioinformatics package for analysis of alternative polyadenylation isoforms. Bioinformatics. 2020;36(12):3907–9. https://doi.org/10.1093/bioinformatics/btaa266 .
doi: 10.1093/bioinformatics/btaa266 pubmed: 32321166 pmcid: 7320624
Zhang X, Li T, Liu F, Chen Y, Yao J, Li Z, et al. Comparative analysis of droplet-based ultra-high-throughput single-cell RNA-seq systems. Mol Cell. 2019;73(1):130–42 e135. https://doi.org/10.1016/j.molcel.2018.10.020 .
doi: 10.1016/j.molcel.2018.10.020 pubmed: 30472192
Shepard PJ, Choi EA, Lu J, Flanagan LA, Hertel KJ, Shi Y. Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq. RNA. 2011;17(4):761–72. https://doi.org/10.1261/rna.2581711 .
doi: 10.1261/rna.2581711 pubmed: 21343387 pmcid: 3062186
Derti A, Garrett-Engele P, Macisaac KD, Stevens RC, Sriram S, Chen R, et al. A quantitative atlas of polyadenylation in five mammals. Genome Res. 2012;22(6):1173–83. https://doi.org/10.1101/gr.132563.111 .
doi: 10.1101/gr.132563.111 pubmed: 22454233 pmcid: 3371698
Lianoglou S, Garg V, Yang JL, Leslie CS, Mayr C. Ubiquitously transcribed genes use alternative polyadenylation to achieve tissue-specific expression. Genes Dev. 2013;27(21):2380–96. https://doi.org/10.1101/gad.229328.113 .
doi: 10.1101/gad.229328.113 pubmed: 24145798 pmcid: 3828523
Moll P, Ante M, Seitz A, Reda T. QuantSeq 3′ mRNA sequencing for RNA quantification. Nat Methods. 2014;11:i–iii.
doi: 10.1038/nmeth.f.376
Harrison PF, Powell DR, Clancy JL, Preiss T, Boag PR, Traven A, et al. PAT-seq: a method to study the integration of 3'-UTR dynamics with gene expression in the eukaryotic transcriptome. RNA. 2015;21(8):1502–10. https://doi.org/10.1261/rna.048355.114 .
doi: 10.1261/rna.048355.114 pubmed: 26092945 pmcid: 4509939
Zheng GX, Terry JM, Belgrader P, Ryvkin P, Bent ZW, Wilson R, et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8(1):14049. https://doi.org/10.1038/ncomms14049 .
doi: 10.1038/ncomms14049 pubmed: 28091601 pmcid: 5241818
Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161(5):1202–14. https://doi.org/10.1016/j.cell.2015.05.002 .
doi: 10.1016/j.cell.2015.05.002 pubmed: 26000488 pmcid: 4481139
Patrick R, Humphreys DT, Janbandhu V, Oshlack A, Ho JWK, Harvey RP, et al. Sierra: discovery of differential transcript usage from polyA-captured single-cell RNA-seq data. Genome Biol. 2020;21(1):167. https://doi.org/10.1186/s13059-020-02071-7 .
doi: 10.1186/s13059-020-02071-7 pubmed: 32641141 pmcid: 7341584
Shulman ED, Elkon R. Cell-type-specific analysis of alternative polyadenylation using single-cell transcriptomics data. Nucleic acids research. 2019;47(19):10027–39. https://doi.org/10.1093/nar/gkz781 .
doi: 10.1093/nar/gkz781 pubmed: 31501864 pmcid: 6821429
Wang R, Nambiar R, Zheng D, Tian B. PolyA_DB 3 catalogs cleavage and polyadenylation sites identified by deep sequencing in multiple genomes. Nucleic Acids Res. 2018;46(D1):D315–d319. https://doi.org/10.1093/nar/gkx1000 .
doi: 10.1093/nar/gkx1000 pubmed: 29069441
Hoque M, Ji Z, Zheng D, Luo W, Li W, You B, et al. Analysis of alternative cleavage and polyadenylation by 3' region extraction and deep sequencing. Nat Methods. 2013;10(2):133–9. https://doi.org/10.1038/nmeth.2288 .
doi: 10.1038/nmeth.2288 pubmed: 23241633
Zheng D, Wang R, Ding Q, Wang T, Xie B, Wei L, et al. Cellular stress alters 3'UTR landscape through alternative polyadenylation and isoform-specific degradation. Nat Commun. 2018;9(1):2268. https://doi.org/10.1038/s41467-018-04730-7 .
doi: 10.1038/s41467-018-04730-7 pubmed: 29891946 pmcid: 5995920
Tian B, Graber JH. Signals for pre-mRNA cleavage and polyadenylation. Wiley Interdiscip Rev RNA. 2012;3(3):385–96. https://doi.org/10.1002/wrna.116 .
doi: 10.1002/wrna.116 pubmed: 22012871
Nam DK, Lee S, Zhou G, Cao X, Wang C, Clark T, et al. Oligo(dT) primer generates a high frequency of truncated cDNAs through internal poly(A) priming during reverse transcription. Proc Natl Acad Sci U S A. 2002;99(9):6152–6. https://doi.org/10.1073/pnas.092140899 .
doi: 10.1073/pnas.092140899 pubmed: 11972056 pmcid: 122918
Vento-Tormo R, Efremova M, Botting RA, Turco MY, Vento-Tormo M, Meyer KB, et al. Single-cell reconstruction of the early maternal-fetal interface in humans. Nature. 2018;563(7731):347–53. https://doi.org/10.1038/s41586-018-0698-6 .
doi: 10.1038/s41586-018-0698-6 pubmed: 30429548
Tsang JCH, Vong JSL, Ji L, Poon LCY, Jiang P, Lui KO, et al. Integrative single-cell and cell-free plasma RNA transcriptomics elucidates placental cellular dynamics. Proc Natl Acad Sci U S A. 2017;114(37):E7786–95. https://doi.org/10.1073/pnas.1710470114 .
doi: 10.1073/pnas.1710470114 pubmed: 28830992 pmcid: 5604038
Cheng LC, Zheng D, Baljinnyam E, Sun F, Ogami K, Yeung PL, et al. Widespread transcript shortening through alternative polyadenylation in secretory cell differentiation. Nat Commun. 2020;11(1):3182. https://doi.org/10.1038/s41467-020-16959-2 .
doi: 10.1038/s41467-020-16959-2 pubmed: 32576858 pmcid: 7311474
Tian B, Hu J, Zhang H, Lutz CS. A large-scale analysis of mRNA polyadenylation of human and mouse genes. Nucleic Acids Res. 2005;33(1):201–12. https://doi.org/10.1093/nar/gki158 .
doi: 10.1093/nar/gki158 pubmed: 15647503 pmcid: 546146
Herrmann CJ, Schmidt R, Kanitz A, Artimo P, Gruber AJ, Zavolan M. PolyASite 2.0: a consolidated atlas of polyadenylation sites from 3' end sequencing. Nucleic Acids Res. 2020;48(D1):D174–d179. https://doi.org/10.1093/nar/gkz918 .
doi: 10.1093/nar/gkz918 pubmed: 31617559
Li WV, Li JJ. An accurate and robust imputation method scImpute for single-cell RNA-seq data. Nat Commun. 2018;9(1):997. https://doi.org/10.1038/s41467-018-03405-7 .
doi: 10.1038/s41467-018-03405-7 pubmed: 29520097 pmcid: 5843666
Gao Y, Li L, Amos CI, Li W. Analysis of alternative polyadenylation from single-cell RNA-seq using scDaPars reveals cell subpopulations invisible to gene expression. Genome Res.  2021;pp.gr-271346. https://doi.org/10.1101/gr.271346.120 .
von Rango U, Krusche CA, Kertschanska S, Alfer J, Kaufmann P, Beier HM. Apoptosis of extravillous trophoblast cells limits the trophoblast invasion in uterine but not in tubal pregnancy during first trimester. Placenta. 2003;24(10):929–40. https://doi.org/10.1016/S0143-4004(03)00168-1 .
doi: 10.1016/S0143-4004(03)00168-1
Fisher RA. On the interpretation of χ 2 from contingency tables, and the calculation of P. J R Stat Soc. 1922;85(1):87–94. https://doi.org/10.2307/2340521 .
doi: 10.2307/2340521
Li WV, Li S, Tong X, Deng L, Shi H, Li JJ. AIDE: annotation-assisted isoform discovery with high precision. Genome Res. 2019;29(12):2056–72. https://doi.org/10.1101/gr.251108.119 .
doi: 10.1101/gr.251108.119 pubmed: 31694868 pmcid: 6886511
Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodological). 1977;39:1–22.
Benjamini Y, Hochberg Y. Controlling the False Discovery Rate - a Practical and Powerful Approach to Multiple Testing. J R Stat Soc Ser B Stat Methodol. 1995;57:289–300.
Shi P, Li H. A model for paired-multinomial data and its application to analysis of data on a taxonomic tree. Biometrics. 2017;73(4):1266–78. https://doi.org/10.1111/biom.12681 .
doi: 10.1111/biom.12681 pubmed: 28369713 pmcid: 5623182
Bushnell B. BBMap: a fast, accurate, splice-aware aligner. Berkeley: Lawrence Berkeley National Lab. (LBNL); 2014.
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. https://doi.org/10.1093/bioinformatics/bts635 .
doi: 10.1093/bioinformatics/bts635
Satija R, Farrell JA, Gennert D, Schier AF, Regev A. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015;33(5):495–502. https://doi.org/10.1038/nbt.3192 .
doi: 10.1038/nbt.3192 pubmed: 25867923 pmcid: 4430369
Li WV: Github repository of MAAPER. Github. 2021. https://github.com/Vivianstats/MAAPER .
Li WV: Source code of MAAPER analyses. zenodo. 2021. https://doi.org/10.5281/zenodo.4760264 .
Li WV, Zheng D, Wang R, Tian B: QuantSeq FWD and QuantSeq REV using RNA samples from mouse NIH3T3 cells. Gene Expression Omnibus. 2021. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE164958 .

Auteurs

Wei Vivian Li (WV)

Department of Biostatistics and Epidemiology, Rutgers School of Public Health, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA. vivian.li@rutgers.edu.

Dinghai Zheng (D)

Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, 07103, USA.

Ruijia Wang (R)

Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, 07103, USA.

Bin Tian (B)

Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, 07103, USA. btian@wistar.org.
Program in Gene Expression and Regulation, and Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA, 19104, USA. btian@wistar.org.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH