Benchmarking predictions of MHC class I restricted T cell epitopes in a comprehensively studied model system.
Algorithms
Alleles
Allergy and Immunology
/ standards
Animals
Area Under Curve
Automation
Epitopes, T-Lymphocyte
/ chemistry
Histocompatibility Antigens Class I
/ chemistry
Immune System
Ligands
Machine Learning
Mice
Mice, Inbred C57BL
Neural Networks, Computer
Peptides
/ chemistry
Protein Binding
Proteome
ROC Curve
Vaccinia virus
Journal
PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922
Informations de publication
Date de publication:
05 2020
05 2020
Historique:
received:
03
05
2019
accepted:
02
03
2020
revised:
05
06
2020
pubmed:
27
5
2020
medline:
8
8
2020
entrez:
27
5
2020
Statut:
epublish
Résumé
T cell epitope candidates are commonly identified using computational prediction tools in order to enable applications such as vaccine design, cancer neoantigen identification, development of diagnostics and removal of unwanted immune responses against protein therapeutics. Most T cell epitope prediction tools are based on machine learning algorithms trained on MHC binding or naturally processed MHC ligand elution data. The ability of currently available tools to predict T cell epitopes has not been comprehensively evaluated. In this study, we used a recently published dataset that systematically defined T cell epitopes recognized in vaccinia virus (VACV) infected C57BL/6 mice (expressing H-2Db and H-2Kb), considering both peptides predicted to bind MHC or experimentally eluted from infected cells, making this the most comprehensive dataset of T cell epitopes mapped in a complex pathogen. We evaluated the performance of all currently publicly available computational T cell epitope prediction tools to identify these major epitopes from all peptides encoded in the VACV proteome. We found that all methods were able to improve epitope identification above random, with the best performance achieved by neural network-based predictions trained on both MHC binding and MHC ligand elution data (NetMHCPan-4.0 and MHCFlurry). Impressively, these methods were able to capture more than half of the major epitopes in the top N = 277 predictions within the N = 767,788 predictions made for distinct peptides of relevant lengths that can theoretically be encoded in the VACV proteome. These performance metrics provide guidance for immunologists as to which prediction methods to use, and what success rates are possible for epitope predictions when considering a highly controlled system of administered immunizations to inbred mice. In addition, this benchmark was implemented in an open and easy to reproduce format, providing developers with a framework for future comparisons against new tools.
Identifiants
pubmed: 32453790
doi: 10.1371/journal.pcbi.1007757
pii: PCOMPBIOL-D-19-00709
pmc: PMC7274474
doi:
Substances chimiques
Epitopes, T-Lymphocyte
0
Histocompatibility Antigens Class I
0
Ligands
0
Peptides
0
Proteome
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e1007757Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
Hum Immunol. 2002 Sep;63(9):701-9
pubmed: 12175724
Proc Natl Acad Sci U S A. 1989 May;86(9):3296-300
pubmed: 2717617
J Immunol. 1997 Mar 15;158(6):2723-30
pubmed: 9058806
BMC Bioinformatics. 2005 May 31;6:132
pubmed: 15927070
J Exp Med. 2005 Jan 3;201(1):95-104
pubmed: 15623576
PLoS Comput Biol. 2018 Nov 8;14(11):e1006457
pubmed: 30408041
J Immunol. 2016 Feb 15;196(4):1480-7
pubmed: 26783342
Bioinformatics. 2016 Feb 15;32(4):511-7
pubmed: 26515819
PLoS Comput Biol. 2006 Jun 9;2(6):e65
pubmed: 16789818
J Immunol. 1994 Jan 1;152(1):163-75
pubmed: 8254189
J Mol Biol. 1995 Jun 2;249(2):244-50
pubmed: 7540211
Immunogenetics. 1999 Nov;50(3-4):213-9
pubmed: 10602881
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W254-60
pubmed: 21666259
J Immunol. 2017 Nov 1;199(9):3360-3368
pubmed: 28978689
J Virol. 2012 Jul;86(13):7298-309
pubmed: 22532670
Nat Biotechnol. 2006 Jul;24(7):817-9
pubmed: 16767078
Immunogenetics. 2005 Jun;57(5):304-14
pubmed: 15868141
Nucleic Acids Res. 2015 Jan;43(Database issue):D405-12
pubmed: 25300482
Bioinformatics. 2003 May 22;19(8):1009-14
pubmed: 12761064
Proc Natl Acad Sci U S A. 2019 Feb 19;116(8):3112-3117
pubmed: 30718433
Cell Syst. 2018 Jul 25;7(1):129-132.e4
pubmed: 29960884
Genome Med. 2016 Mar 30;8(1):33
pubmed: 27029192
BMC Bioinformatics. 2009 Nov 30;10:394
pubmed: 19948066