Deep profiling and custom databases improve detection of proteoforms generated by alternative splicing.


Journal

Genome research
ISSN: 1549-5469
Titre abrégé: Genome Res
Pays: United States
ID NLM: 9518021

Informations de publication

Date de publication:
12 2019
Historique:
received: 15 01 2019
accepted: 16 09 2019
pubmed: 16 11 2019
medline: 16 4 2020
entrez: 16 11 2019
Statut: ppublish

Résumé

Alternative pre-mRNA splicing has long been proposed to contribute greatly to proteome complexity. However, the extent to which mature mRNA isoforms are successfully translated into protein remains controversial. Here, we used high-throughput RNA sequencing and mass spectrometry (MS)-based proteomics to better evaluate the translation of alternatively spliced mRNAs. To increase proteome coverage and improve protein quantitation, we optimized cell fractionation and sample processing steps at both the protein and peptide level. Furthermore, we generated a custom peptide database trained on analysis of RNA-seq data with MAJIQ, an algorithm optimized to detect and quantify differential and unannotated splice junction usage. We matched tandem mass spectra acquired by data-dependent acquisition (DDA) against our custom RNA-seq based database, as well as SWISS-PROT and RefSeq databases to improve identification of splicing-derived proteoforms by 28% compared with use of the SWISS-PROT database alone. Altogether, we identified peptide evidence for 554 alternate proteoforms corresponding to 274 genes. Our increased depth and detection of proteins also allowed us to track changes in the transcriptome and proteome induced by T-cell stimulation, as well as fluctuations in protein subcellular localization. In sum, our data here confirm that use of generic databases in proteomic studies underestimates the number of spliced mRNA isoforms that are translated into protein and provides a workflow that improves isoform detection in large-scale proteomic experiments.

Identifiants

pubmed: 31727681
pii: gr.248435.119
doi: 10.1101/gr.248435.119
pmc: PMC6886501
doi:

Substances chimiques

Peptides 0
Protein Isoforms 0
RNA Isoforms 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

2046-2055

Subventions

Organisme : NIAID NIH HHS
ID : R01 AI118891
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM110174
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM118048
Pays : United States
Organisme : NIGMS NIH HHS
ID : T32 GM008216
Pays : United States

Informations de copyright

© 2019 Agosto et al.; Published by Cold Spring Harbor Laboratory Press.

Références

Cancer Discov. 2015 Dec;5(12):1282-95
pubmed: 26516065
J Proteome Res. 2018 Dec 7;17(12):4235-4242
pubmed: 30289715
Nat Struct Mol Biol. 2016 Dec;23(12):1117-1123
pubmed: 27820807
Mol Cell. 2003 Nov;12(5):1317-24
pubmed: 14636588
Nat Genet. 2008 Dec;40(12):1413-5
pubmed: 18978789
J Proteome Res. 2018 Sep 7;17(9):2917-2924
pubmed: 30114372
Genome Res. 2017 Aug;27(8):1360-1370
pubmed: 28512194
Eur J Biochem. 1991 May 23;198(1):101-6
pubmed: 2040271
Wiley Interdiscip Rev RNA. 2015 May-Jun;6(3):311-26
pubmed: 25630614
Genome Res. 2013 Oct;23(10):1615-23
pubmed: 23783272
Mol Cell Biol. 2000 Jan;20(1):70-80
pubmed: 10594010
Proc Natl Acad Sci U S A. 2010 Feb 2;107(5):1894-9
pubmed: 20133837
PLoS One. 2018 Mar 27;13(3):e0194518
pubmed: 29584760
Nat Biotechnol. 2010 May;28(5):511-5
pubmed: 20436464
Mol Cell Biol. 2007 Oct;27(19):6972-84
pubmed: 17664280
RNA Biol. 2016 Jun 2;13(6):569-81
pubmed: 27096301
Mol Cell Proteomics. 2018 Mar;17(3):422-430
pubmed: 29222161
Nature. 2008 Mar 13;452(7184):230-3
pubmed: 18337823
Nat Struct Mol Biol. 2012 Feb 05;19(3):346-54
pubmed: 22307054
Proteomics. 2014 May;14(9):1006-0
pubmed: 24678027
Eur J Immunol. 2017 Jun;47(6):946-953
pubmed: 28475283
Sci Signal. 2011 Oct 4;4(193):ra66
pubmed: 21971040
Nat Methods. 2014 Nov;11(11):1114-25
pubmed: 25357241
Elife. 2016 Jan 06;5:null
pubmed: 26735365
Mol Cell Biol. 2011 Jun;31(11):2184-95
pubmed: 21444716
Brain Struct Funct. 2013 Nov;218(6):1531-49
pubmed: 23152144
Mol Cell Proteomics. 2015 May;14(5):1400-10
pubmed: 25724911
Genes Dev. 2015 Oct 1;29(19):2054-66
pubmed: 26443849
Nat Methods. 2007 Nov;4(11):923-5
pubmed: 17952086
Nature. 2008 Nov 27;456(7221):470-6
pubmed: 18978772
RNA. 2012 May;18(5):1029-40
pubmed: 22454538
Methods Mol Biol. 2014;1126:243-55
pubmed: 24549669
Nature. 2010 Jan 21;463(7279):364-8
pubmed: 20010808
Oncogene. 2006 Dec 4;25(57):7492-504
pubmed: 17143293
RNA. 2007 Apr;13(4):563-72
pubmed: 17307815
Mol Cell Proteomics. 2013 Aug;12(8):2341-53
pubmed: 23629695
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50
pubmed: 16199517
Proc Natl Acad Sci U S A. 2018 Aug 28;115(35):E8181-E8190
pubmed: 30104386
Immunol Rev. 2010 May;235(1):93-104
pubmed: 20536557
J Proteome Res. 2009 Jun;8(6):3176-81
pubmed: 19338334
J Proteome Res. 2015 Apr 3;14(4):1880-7
pubmed: 25732134
Int Immunol. 2004 Dec;16(12):1711-21
pubmed: 15492022
Genes Dev. 1991 Apr;5(4):656-69
pubmed: 2010090
Acta Physiol (Oxf). 2012 Jan;204(1):74-109
pubmed: 21624092
Elife. 2016 Feb 01;5:e11752
pubmed: 26829591
Cell Rep. 2016 Mar 29;14(12):2797-808
pubmed: 26997278

Auteurs

Laura M Agosto (LM)

Biochemistry and Molecular Biophysics Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Matthew R Gazzara (MR)

Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Caleb M Radens (CM)

Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
Genetics and Epigenetics, Cell & Molecular Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Simone Sidoli (S)

Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Josue Baeza (J)

Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Benjamin A Garcia (BA)

Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Kristen W Lynch (KW)

Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH