epitope1D: accurate taxonomy-aware B-cell linear epitope prediction.

B-cell epitopes immunoinformatics linear epitopes machine learning

Journal

Briefings in bioinformatics
ISSN: 1477-4054
Titre abrégé: Brief Bioinform
Pays: England
ID NLM: 100912837

Informations de publication

Date de publication:
19 05 2023
Historique:
received: 23 11 2022
revised: 30 01 2023
accepted: 07 03 2023
medline: 22 5 2023
pubmed: 12 4 2023
entrez: 11 4 2023
Statut: ppublish

Résumé

The ability to identify B-cell epitopes is an essential step in vaccine design, immunodiagnostic tests and antibody production. Several computational approaches have been proposed to identify, from an antigen protein or peptide sequence, which residues are more likely to be part of an epitope, but have limited performance on relatively homogeneous data sets and lack interpretability, limiting biological insights that could otherwise be obtained. To address these limitations, we have developed epitope1D, an explainable machine learning method capable of accurately identifying linear B-cell epitopes, leveraging two new descriptors: a graph-based signature representation of protein sequences, based on our well-established Cutoff Scanning Matrix algorithm and Organism Ontology information. Our model achieved Areas Under the ROC curve of up to 0.935 on cross-validation and blind tests, demonstrating robust performance. A comprehensive comparison to alternative methods using distinct benchmark data sets was also employed, with our model outperforming state-of-the-art tools. epitope1D represents not only a significant advance in predictive performance, but also allows biologically meaningful features to be combined and used for model interpretation. epitope1D has been made available as a user-friendly web server interface and application programming interface at https://biosig.lab.uq.edu.au/epitope1d/.

Identifiants

pubmed: 37039696
pii: 7111720
doi: 10.1093/bib/bbad114
pmc: PMC10199762
pii:
doi:

Substances chimiques

Epitopes, B-Lymphocyte 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2023. Published by Oxford University Press.

Références

Bioinformatics. 2012 Dec 1;28(23):3150-2
pubmed: 23060610
Immunome Res. 2006 Apr 24;2:2
pubmed: 16635264
Bioinformatics. 2013 Apr 1;29(7):855-61
pubmed: 23396119
Brief Bioinform. 2022 Jan 17;23(1):
pubmed: 34676398
BMC Genomics. 2005 May 29;6:79
pubmed: 15921533
Bioinformatics. 2021 May 1;37(4):448-455
pubmed: 32915967
Comp Immunol Microbiol Infect Dis. 2003 Oct;26(5-6):309-28
pubmed: 12818619
Vaccine. 2017 Jul 24;35(33):4064-4071
pubmed: 28647170
Nucleic Acids Res. 2017 Jul 3;45(W1):W24-W29
pubmed: 28472356
Proteins. 2009 Feb 15;74(3):727-43
pubmed: 18704933
Bioinformatics. 2021 Dec 7;37(23):4517-4525
pubmed: 34180989
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W314-9
pubmed: 24829462
Biochemistry. 1986 Sep 23;25(19):5425-32
pubmed: 2430611
FEBS Lett. 1985 Sep 2;188(2):215-8
pubmed: 2411595
Lancet Glob Health. 2018 Dec;6(12):e1386-e1396
pubmed: 30342925
Adv Enzymol Relat Areas Mol Biol. 1978;47:45-148
pubmed: 364941
Cell Microbiol. 2011 Jul;13(7):934-42
pubmed: 21631691
Proteins. 2006 Oct 1;65(1):40-8
pubmed: 16894596
J Mol Recognit. 2008 Jul-Aug;21(4):243-55
pubmed: 18496882
BMC Genomics. 2020 Jan 2;21(1):6
pubmed: 31898477
BMC Genomics. 2011 Dec 22;12 Suppl 4:S12
pubmed: 22369665
Amino Acids. 2007 Sep;33(3):423-8
pubmed: 17252308
J Virol. 1985 Sep;55(3):836-9
pubmed: 2991600
Front Immunol. 2018 Jul 27;9:1695
pubmed: 30100904
Bioinformatics. 2021 Dec 11;37(24):4826-4834
pubmed: 34289025
Protein Sci. 2022 Dec;31(12):e4497
pubmed: 36366745
PLoS One. 2013 May 07;8(5):e62216
pubmed: 23667458
Nucleic Acids Res. 2019 Jan 8;47(D1):D339-D343
pubmed: 30357391
FEBS Lett. 1990 Dec 10;276(1-2):172-4
pubmed: 1702393
Bioinformatics. 2014 Feb 01;30(3):335-42
pubmed: 24281696

Auteurs

Bruna Moreira da Silva (BM)

Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.
Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.
School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia.

David B Ascher (DB)

Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.
Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.
The School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia.

Douglas E V Pires (DEV)

Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.
Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.
School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals Hemiptera Insect Proteins Phylogeny Insecticides
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH