BepFAMN: A Method for Linear B-Cell Epitope Predictions Based on Fuzzy-ARTMAP Artificial Neural Network.
diagnosis
epitope mapping
hybrid approach
in silico prediction
online training
Journal
Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366
Informations de publication
Date de publication:
26 May 2022
26 May 2022
Historique:
received:
04
05
2022
revised:
22
05
2022
accepted:
24
05
2022
entrez:
10
6
2022
pubmed:
11
6
2022
medline:
14
6
2022
Statut:
epublish
Résumé
The public health system is extremely dependent on the use of vaccines to immunize the population from a series of infectious and dangerous diseases, preventing the system from collapsing and millions of people dying every year. However, to develop these vaccines and effectively monitor these diseases, it is necessary to use accurate diagnostic methods capable of identifying highly immunogenic regions within a given pathogenic protein. Existing experimental methods are expensive, time-consuming, and require arduous laboratory work, as they require the screening of a large number of potential candidate epitopes, making the methods extremely laborious, especially for application to larger microorganisms. In the last decades, researchers have developed in silico prediction methods, based on machine learning, to identify these markers, to drastically reduce the list of potential candidate epitopes for experimental tests, and, consequently, to reduce the laborious task associated with their mapping. Despite these efforts, the tools and methods still have low accuracy, slow diagnosis, and offline training. Thus, we develop a method to predict B-cell linear epitopes which are based on a Fuzzy-ARTMAP neural network architecture, called BepFAMN (B Epitope Prediction Fuzzy ARTMAP Artificial Neural Network). This was trained using a linear averaging scheme on 15 properties that include an amino acid ratio scale and a set of 14 physicochemical scales. The database used was obtained from the IEDB website, from which the amino acid sequences with the annotations of their positive and negative epitopes were taken. To train and validate the knowledge models, five-fold cross-validation and competition techniques were used. The BepiPred-2.0 database, an independent database, was used for the tests. In our experiment, the validation dataset reached sensitivity = 91.50%, specificity = 91.49%, accuracy = 91.49%, MCC = 0.83, and an area under the curve (AUC) ROC of approximately 0.9289. The result in the testing dataset achieves a significant improvement, with sensitivity = 81.87%, specificity = 74.75%, accuracy = 78.27%, MCC = 0.56, and AOC = 0.7831. These achieved values demonstrate that BepFAMN outperforms all other linear B-cell epitope prediction tools currently used. In addition, the architecture provides mechanisms for online training, which allow the user to find a new B-cell linear epitope, and to improve the model without need to re-train itself with the whole dataset. This fact contributes to a considerable reduction in the number of potential linear epitopes to be experimentally validated, reducing laboratory time and accelerating the development of diagnostic tests, vaccines, and immunotherapeutic approaches.
Identifiants
pubmed: 35684648
pii: s22114027
doi: 10.3390/s22114027
pmc: PMC9185646
pii:
doi:
Substances chimiques
Epitopes, B-Lymphocyte
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Références
Am J Phys Anthropol. 2003;Suppl 37:14-46
pubmed: 14666532
Curr Top Med Chem. 2019;19(2):105-115
pubmed: 30499399
PLoS One. 2009;4(3):e4920
pubmed: 19290060
J Biomed Inform. 2005 Oct;38(5):404-15
pubmed: 16198999
Mol Immunol. 2013 Jan;53(1-2):24-34
pubmed: 22784991
Nucleic Acids Res. 2019 Jan 8;47(D1):D339-D343
pubmed: 30357391
Proc Natl Acad Sci U S A. 2014 Aug 26;111(34):12288-93
pubmed: 25136130
Cell Mol Life Sci. 2016 Dec;73(23):4433-4448
pubmed: 27392606
Immunome Res. 2005 Oct 06;1(1):4
pubmed: 16305757
Bioinformatics. 2006 Jul 1;22(13):1658-9
pubmed: 16731699
Proteins. 2006 Oct 1;65(1):40-8
pubmed: 16894596
N Engl J Med. 2013 Nov 28;369(22):2152-8
pubmed: 24283231
J Mol Recognit. 2008 Jul-Aug;21(4):243-55
pubmed: 18496882
PLoS One. 2012;7(9):e45152
pubmed: 22984622
BMC Genomics. 2005 May 29;6:79
pubmed: 15921533
Bioinformatics. 2021 May 1;37(4):448-455
pubmed: 32915967
Nucleic Acids Res. 2010 Jan;38(Database issue):D854-62
pubmed: 19906713
Mol Ther. 2000 Jan;1(1):18-30
pubmed: 10933908
J Am Chem Soc. 2001 Jun 27;123(25):6108-17
pubmed: 11414845
Neural Comput. 1997 Nov 15;9(8):1735-80
pubmed: 9377276
Immunome Res. 2008 Jan 07;4:1
pubmed: 18179690
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D154-9
pubmed: 15608167
Immunome Res. 2010 Nov 03;6 Suppl 2:S2
pubmed: 21067544
Nucleic Acids Res. 2017 Jul 3;45(W1):W24-W29
pubmed: 28472356
Nucleic Acids Res. 2000 Jan 1;28(1):45-8
pubmed: 10592178
Neural Netw. 2013 Jan;37:1-47
pubmed: 23149242
BMC Bioinformatics. 2011 Jun 21;12:251
pubmed: 21693021
Amino Acids. 2007 Sep;33(3):423-8
pubmed: 17252308
Methods Mol Biol. 2009;524:3-20
pubmed: 19377933
BMC Genomics. 2010 Dec 02;11 Suppl 4:S21
pubmed: 21143805
Immunol Lett. 1993 Apr;36(1):83-99
pubmed: 7688347
BMC Struct Biol. 2007 Oct 02;7:64
pubmed: 17910770
BMC Bioinformatics. 2013;14 Suppl 2:S10
pubmed: 23484214