A Systematic Evaluation of Semispecific Peptide Search Parameter Enables Identification of Previously Undescribed N-Terminal Peptides and Conserved Proteolytic Processing in Cancer Cell Lines.

NCI-60 reanalysis endogenous proteolysis fragment mass tolerance mass spectrometry semispecific peptide search

Journal

Proteomes
ISSN: 2227-7382
Titre abrégé: Proteomes
Pays: Switzerland
ID NLM: 101621966

Informations de publication

Date de publication:
25 May 2021
Historique:
received: 04 05 2021
revised: 21 05 2021
accepted: 22 05 2021
entrez: 2 6 2021
pubmed: 3 6 2021
medline: 3 6 2021
Statut: epublish

Résumé

Liquid chromatography-tandem mass spectrometry (LC-MS/MS) has become the most commonly used technique in explorative proteomic research. A variety of open-source tools for peptide-spectrum matching have become available. Most analyses of explorative MS data are performed using conventional settings, such as fully specific enzymatic constraints. Here we evaluated the impact of the fragment mass tolerance in combination with the enzymatic constraints on the performance of three search engines. Three open-source search engines (Myrimatch, X! Tandem, and MSGF+) were evaluated concerning the suitability in semi- and unspecific searches as well as the importance of accurate fragment mass spectra in non-specific peptide searches. We then performed a semispecific reanalysis of the published NCI-60 deep proteome data applying the most suited parameters. Semi- and unspecific LC-MS/MS data analyses particularly benefit from accurate fragment mass spectra while this effect is less pronounced for conventional, fully specific peptide-spectrum matching. Search speed differed notably between the three search engines for semi- and non-specific peptide-spectrum matching. Semispecific reanalysis of NCI-60 proteome data revealed hundreds of previously undescribed N-terminal peptides, including cases of proteolytic processing or likely alternative translation start sites, some of which were ubiquitously present in all cell lines of the reanalyzed panel. Highly accurate MS2 fragment data in combination with modern open-source search algorithms enable the confident identification of semispecific peptides from large proteomic datasets. The identification of previously undescribed N-terminal peptides in published studies highlights the potential of future reanalysis and data mining in proteomic datasets.

Identifiants

pubmed: 34070654
pii: proteomes9020026
doi: 10.3390/proteomes9020026
pmc: PMC8162549
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : Deutsche Forschungsgemeinschaft
ID : SCHI 871/11-1, SCHI 871/15-1, GR 4553/5-1, PA 2807/3-1, INST 39/1244-1 (P12), INST 39/766-3 (Z1), GRK 2606 "ProtPath"
Organisme : Bundesministerium für Bildung und Forschung
ID : 01KU1916, 01KU1915A
Organisme : German-Israeli Foundation for Scientific Research and Development
ID : grant no. 1444

Références

Trends Immunol. 2008 Feb;29(2):75-82
pubmed: 18182322
J Proteome Res. 2012 Jul 6;11(7):3914-20
pubmed: 22583024
IEEE/ACM Trans Comput Biol Bioinform. 2012 Sep-Oct;9(5):1273-80
pubmed: 22689082
Nat Commun. 2021 Mar 16;12(1):1693
pubmed: 33727531
Nat Protoc. 2007;2(8):1896-906
pubmed: 17703201
Transl Oncol. 2018 Dec;11(6):1307-1322
pubmed: 30172883
Mol Cell Proteomics. 2008 May;7(5):962-70
pubmed: 18216375
J Proteome Res. 2007 Feb;6(2):654-61
pubmed: 17269722
Eur J Cardiothorac Surg. 2017 Jun 1;51(6):1063-1071
pubmed: 28329269
Clin Proteomics. 2018 Mar 6;15:11
pubmed: 29527141
Biochim Biophys Acta. 2015 Jun;1854(6):559-80
pubmed: 25315853
Proc Natl Acad Sci U S A. 2000 Aug 15;97(17):9425-30
pubmed: 10920196
Gigascience. 2019 Dec 1;8(12):
pubmed: 31816088
Chem Rev. 2018 Feb 14;118(3):1137-1168
pubmed: 29265812
Matrix Biol. 2018 Mar;66:1-21
pubmed: 29158163
Nucleic Acids Res. 2018 Jul 2;46(W1):W537-W544
pubmed: 29790989
Nat Protoc. 2011 Jan;6(1):111-20
pubmed: 21212787
J Proteome Res. 2010 Feb 5;9(2):1138-43
pubmed: 19938873
Nat Biotechnol. 2008 Jun;26(6):685-94
pubmed: 18500335
Nat Methods. 2016 Aug 30;13(9):741-8
pubmed: 27575624
Genome Res. 2018 Jan;28(1):25-36
pubmed: 29162641
Nat Biotechnol. 2010 Mar;28(3):281-8
pubmed: 20208520
Bioinformatics. 2008 Jan 1;24(1):102-9
pubmed: 18033797
Cell Rep. 2013 Aug 15;4(3):609-20
pubmed: 23933261
Nature. 2016 Sep 14;537(7620):347-55
pubmed: 27629641
Neoplasia. 2018 Feb;20(2):140-151
pubmed: 29248718
Bioinformatics. 2008 Nov 1;24(21):2534-6
pubmed: 18606607
Expert Rev Proteomics. 2017 May;14(5):409-418
pubmed: 28395554
Mol Cell Proteomics. 2013 Mar;12(3):611-25
pubmed: 23233448

Auteurs

Matthias Fahrner (M)

Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany.
Faculty of Biology, Albert-Ludwigs-University Freiburg, 79104 Freiburg, Germany.
Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, 79104 Freiburg, Germany.

Lucas Kook (L)

Epidemiology, Biostatistics & Prevention Institute, University of Zurich, 8001 Zurich, Switzerland.
Institute for Data Analysis and Process Design, Zurich University of Applied Sciences, 8401 Winterthur, Switzerland.

Klemens Fröhlich (K)

Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany.
Faculty of Biology, Albert-Ludwigs-University Freiburg, 79104 Freiburg, Germany.
Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, 79104 Freiburg, Germany.

Martin L Biniossek (ML)

Institute for Molecular Medicine and Cell Research, University of Freiburg, 79104 Freiburg, Germany.

Oliver Schilling (O)

Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, 79106 Freiburg, Germany.
Faculty of Biology, Albert-Ludwigs-University Freiburg, 79104 Freiburg, Germany.
German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany.
BIOSS Centre for Biological Signaling Studies, University of Freiburg, 79104 Freiburg, Germany.

Classifications MeSH