The Choice of Search Engine Affects Sequencing Depth and HLA Class I Allele-Specific Peptide Repertoires.
HLA
MHC
MS search engine
database search
de novo sequencing
human leukocyte antigen
immunopeptidomics
major histocompatibility complex
peptide sequence annotation
peptide spectrum match
Journal
Molecular & cellular proteomics : MCP
ISSN: 1535-9484
Titre abrégé: Mol Cell Proteomics
Pays: United States
ID NLM: 101125647
Informations de publication
Date de publication:
2021
2021
Historique:
received:
18
12
2020
revised:
09
07
2021
accepted:
12
07
2021
pubmed:
26
7
2021
medline:
25
3
2022
entrez:
25
7
2021
Statut:
ppublish
Résumé
Standardization of immunopeptidomics experiments across laboratories is a pressing issue within the field, and currently a variety of different methods for sample preparation and data analysis tools are applied. Here, we compared different software packages to interrogate immunopeptidomics datasets and found that Peaks reproducibly reports substantially more peptide sequences (~30-70%) compared with Maxquant, Comet, and MS-GF+ at a global false discovery rate (FDR) of <1%. We noted that these differences are driven by search space and spectral ranking. Furthermore, we observed differences in the proportion of peptides binding the human leukocyte antigen (HLA) alleles present in the samples, indicating that sequence-related differences affected the performance of each tested engine. Utilizing data from single HLA allele expressing cell lines, we observed significant differences in amino acid frequency among the peptides reported, with a broadly higher representation of hydrophobic amino acids L, I, P, and V reported by Peaks. We validated these results using data generated with a synthetic library of 2000 HLA-associated peptides from four common HLA alleles with distinct anchor residues. Our investigation highlights that search engines create a bias in peptide sequence depth and peptide amino acid composition, and resulting data should be interpreted with caution.
Identifiants
pubmed: 34303857
pii: S1535-9476(21)00096-7
doi: 10.1016/j.mcpro.2021.100124
pmc: PMC8724928
pii:
doi:
Substances chimiques
Histocompatibility Antigens Class I
0
Peptide Library
0
Peptides
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
100124Subventions
Organisme : Cancer Research UK
ID : C328/A21998
Pays : United Kingdom
Organisme : Cancer Research UK
ID : C6078/A28736
Pays : United Kingdom
Informations de copyright
Copyright © 2021 The Authors. Published by Elsevier Inc. All rights reserved.
Déclaration de conflit d'intérêts
Conflict of interest N. T. is directing immunopeptidomics research at Enara Bio part-time and serves on the Scientific Advisory Boards of Enara Bio and T-Cypher Bio. N. T. is consultant to Hoffman-La Roche and Grey Wolf Therapeutics. All other authors declare no conflict of interest.
Références
Mol Cell Proteomics. 2012 Apr;11(4):M111.010587
pubmed: 22186715
J Am Soc Mass Spectrom. 2016 Nov;27(11):1719-1727
pubmed: 27572102
Nat Commun. 2014 Oct 31;5:5277
pubmed: 25358478
J Mass Spectrom. 2000 Dec;35(12):1399-406
pubmed: 11180630
Proteomics. 2012 Apr;12(8):1111-21
pubmed: 22577012
Nat Methods. 2007 Nov;4(11):923-5
pubmed: 17952086
Immunity. 2017 Feb 21;46(2):315-326
pubmed: 28228285
Mass Spectrom Rev. 2009 Jan-Feb;28(1):177-87
pubmed: 18680189
J Proteome Res. 2009 Jul;8(7):3737-45
pubmed: 19385687
Nature. 1991 May 23;351(6324):290-6
pubmed: 1709722
J Proteome Res. 2019 Nov 1;18(11):3876-3884
pubmed: 31589052
Nature. 2016 Sep 14;537(7620):347-55
pubmed: 27629641
Nat Commun. 2016 Jan 05;7:10238
pubmed: 26728094
Mass Spectrom Rev. 2005 Jul-Aug;24(4):508-48
pubmed: 15389847
Front Immunol. 2018 Jul 25;9:1716
pubmed: 30090105
Nat Rev Immunol. 2011 Nov 11;11(12):823-36
pubmed: 22076556
Anal Chem. 2009 Nov 15;81(22):9522-30
pubmed: 19848410
J Immunol. 2017 Oct 15;199(8):2639-2651
pubmed: 28904123
Nat Commun. 2021 Jun 7;12(1):3346
pubmed: 34099720
Proteomics. 2020 Nov;20(21-22):e1900334
pubmed: 32864883
Nat Methods. 2017 Mar;14(3):259-262
pubmed: 28135259
J Immunol. 2018 Dec 15;201(12):3705-3716
pubmed: 30429286
J Immunol. 2016 Sep 15;197(6):2492-9
pubmed: 27511729
Anal Chem. 2017 Dec 5;89(23):12690-12697
pubmed: 29125736
Proteomics. 2018 Jun;18(12):e1700464
pubmed: 29377634
Proteomics. 2016 Sep;16(18):2461-9
pubmed: 27503675
Proc Natl Acad Sci U S A. 2017 Nov 14;114(46):E9942-E9951
pubmed: 29093164
J Proteome Res. 2011 Apr 1;10(4):1794-805
pubmed: 21254760
Nat Protoc. 2016 Dec;11(12):2301-2319
pubmed: 27809316
Mol Cell Proteomics. 2005 Oct;4(10):1419-40
pubmed: 16009968
J Immunol. 2017 Nov 1;199(9):3360-3368
pubmed: 28978689
Proteomics. 2017 Oct;17(19):
pubmed: 28834231
Nat Methods. 2019 Jun;16(6):509-518
pubmed: 31133760
Proc Natl Acad Sci U S A. 2014 Mar 25;111(12):4507-12
pubmed: 24616531
Proteomics. 2013 Jan;13(1):22-4
pubmed: 23148064
BMC Genomics. 2016 Dec 22;17(Suppl 13):1031
pubmed: 28155652
Nat Commun. 2016 Nov 21;7:13404
pubmed: 27869121
Proteomics. 2019 Feb;19(4):e1800357
pubmed: 30578603
BMC Bioinformatics. 2007 Jun 19;8:211
pubmed: 17578581
Nat Commun. 2020 Mar 10;11(1):1293
pubmed: 32157095