Encodings and models for antimicrobial peptide classification for multi-resistant pathogens.

Antimicrobial peptides Encodings Machine learning

Journal

BioData mining
ISSN: 1756-0381
Titre abrégé: BioData Min
Pays: England
ID NLM: 101319161

Informations de publication

Date de publication:
2019
Historique:
received: 21 12 2018
accepted: 24 02 2019
entrez: 15 3 2019
pubmed: 15 3 2019
medline: 15 3 2019
Statut: epublish

Résumé

Antimicrobial peptides (AMPs) are part of the inherent immune system. In fact, they occur in almost all organisms including, e.g., plants, animals, and humans. Remarkably, they show effectivity also against multi-resistant pathogens with a high selectivity. This is especially crucial in times, where society is faced with the major threat of an ever-increasing amount of antibiotic resistant microbes. In addition, AMPs can also exhibit antitumor and antiviral effects, thus a variety of scientific studies dealt with the prediction of active peptides in recent years. Due to their potential, even the pharmaceutical industry is keen on discovering and developing novel AMPs. However, AMPs are difficult to verify in vitro, hence researchers conduct sequence similarity experiments against known, active peptides. Unfortunately, this approach is very time-consuming and limits potential candidates to sequences with a high similarity to known AMPs. Machine learning methods offer the opportunity to explore the huge space of sequence variations in a timely manner. These algorithms have, in principal, paved the way for an automated discovery of AMPs. However, machine learning models require a numerical input, thus an informative encoding is very important. Unfortunately, developing an appropriate encoding is a major challenge, which has not been entirely solved so far. For this reason, the development of novel amino acid encodings is established as a stand-alone research branch. The present review introduces state-of-the-art encodings of amino acids as well as their properties in sequence and structure based aggregation. Moreover, albeit a well-chosen encoding is essential, performant classifiers are required, which is reflected by a tendency towards specifically designed models in the literature. Furthermore, we introduce these models with a particular focus on encodings derived from support vector machines and deep learning approaches. Albeit a strong focus has been set on AMP predictions, not all of the mentioned encodings have been elaborated as part of antimicrobial research studies, but rather as general protein or peptide representations.

Identifiants

pubmed: 30867681
doi: 10.1186/s13040-019-0196-x
pii: 196
pmc: PMC6399931
doi:

Types de publication

Journal Article Review

Langues

eng

Pagination

7

Déclaration de conflit d'intérêts

Not applicableNot applicableThe authors declare that they have no competing interests.Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Références

PLoS Comput Biol. 2013;9(3):e1002977
pubmed: 23555214
ACS Chem Biol. 2009 Jan 16;4(1):65-74
pubmed: 19055425
Bioinformatics. 2015 Nov 1;31(21):3429-36
pubmed: 26130574
Amino Acids. 2010 Jun;39(1):101-10
pubmed: 19908123
Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915-9
pubmed: 1438297
Bioinformatics. 2018 Aug 1;34(15):2575-2580
pubmed: 29554213
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D590-2
pubmed: 14681488
Hum Mol Genet. 2018 May 1;27(R1):R63-R71
pubmed: 29648622
PLoS Comput Biol. 2015 Apr 07;11(4):e1004074
pubmed: 25849257
BMC Bioinformatics. 2013 Mar 05;14:82
pubmed: 23497081
J Cheminform. 2018 Mar 20;10(1):16
pubmed: 29556758
Anal Biochem. 2013 May 15;436(2):168-77
pubmed: 23395824
J Med Chem. 1998 Jul 2;41(14):2481-91
pubmed: 9651153
PLoS One. 2011 Feb 09;6(2):e16968
pubmed: 21347392
Biochemistry. 1992 Aug 18;31(32):7211-8
pubmed: 1510913
J Microbiol Methods. 2009 Jul;78(1):94-6
pubmed: 19348863
Bioconjug Chem. 2018 Dec 19;29(12):4020-4029
pubmed: 30380293
Proc Natl Acad Sci U S A. 2005 May 3;102(18):6395-400
pubmed: 15851683
Nucleic Acids Res. 1990 Apr 25;18(8):2163-70
pubmed: 2336393
Appl Environ Microbiol. 2013 May;79(10):3156-9
pubmed: 23455341
BioData Min. 2011 Nov 14;4:26
pubmed: 22082002
Bioinformatics. 2004 Mar 1;20(4):467-76
pubmed: 14990442
BioData Min. 2014 Aug 01;7:14
pubmed: 25120583
Pac Symp Biocomput. 2002;:564-75
pubmed: 11928508
IEEE/ACM Trans Comput Biol Bioinform. 2012;9(2):467-75
pubmed: 21860064
Curr Med Chem. 2014;21(20):2299-321
pubmed: 24533812
Sci Rep. 2018 Jan 26;8(1):1697
pubmed: 29374199
BMC Syst Biol. 2015;9 Suppl 1:S10
pubmed: 25708928
PLoS One. 2017 Apr 21;12(4):e0174862
pubmed: 28430779
Nucleic Acids Res. 2016 Jul 27;44(13):6503
pubmed: 27060142
BMC Bioinformatics. 2010 Oct 26;11 Suppl 8:S7
pubmed: 21034432
BioData Min. 2011 Jun 17;4:16
pubmed: 21682849
Nature. 2006 Oct 19;443(7113):867-9
pubmed: 17051220
Nature. 2015 May 28;521(7553):436-44
pubmed: 26017442
ACS Comb Sci. 2016 Aug 8;18(8):490-8
pubmed: 27280735
Bioinformatics. 2006 Nov 15;22(22):2753-60
pubmed: 16966363
Bioinformatics. 2017 Jul 01;33(13):1921-1929
pubmed: 28203715
J Mol Recognit. 2008 Jul-Aug;21(4):243-55
pubmed: 18496882
Amino Acids. 2008 Aug;35(2):345-53
pubmed: 18163182
Protein Sci. 2003 May;12(5):1007-17
pubmed: 12717023
Bioinformatics. 2017 Sep 01;33(17):2753-2755
pubmed: 28472272
Adv Protein Chem Struct Biol. 2018;112:359-384
pubmed: 29680241
Biopolymers. 1988 Mar;27(3):451-77
pubmed: 3359010
Mol Inform. 2017 Jan;36(1-2):
pubmed: 28124834
BMC Bioinformatics. 2010 Jan 20;11:37
pubmed: 20089140
Protein Sci. 2004 May;13(5):1402-6
pubmed: 15096640
Drug Discov Today. 2018 Aug;23(8):1538-1546
pubmed: 29750902
J Transl Med. 2018 Jul 3;16(1):181
pubmed: 29970096
Front Pharmacol. 2018 Aug 28;9:954
pubmed: 30210341
Bioinformatics. 2008 Oct 15;24(20):2370-5
pubmed: 18697770
J Mol Graph Model. 1997 Oct;15(5):279-89
pubmed: 9640559
Bioinformatics. 2012 Jan 1;28(1):130-1
pubmed: 22053077
Bioinformatics. 2018 Jul 15;34(14):2499-2502
pubmed: 29528364
Chem Biol Drug Des. 2006 Jul;68(1):48-57
pubmed: 16923026
Biotechnol Adv. 2017 May - Jun;35(3):337-349
pubmed: 28216008
Proteins. 2006 Aug 15;64(3):575-86
pubmed: 16763996
Proteins. 2007 Jun 1;67(4):834-52
pubmed: 17377990
PLoS Comput Biol. 2013;9(9):e1003212
pubmed: 24039565
J Mol Biol. 1982 May 5;157(1):105-32
pubmed: 7108955
Nucleic Acids Res. 2016 Jan 4;44(D1):D1087-93
pubmed: 26602694
Trends Biochem Sci. 1998 Nov;23(11):444-7
pubmed: 9852764
BMC Bioinformatics. 2018 Dec 6;19(1):469
pubmed: 30522443
Protein Pept Lett. 2010 Jan;17(1):137-45
pubmed: 20214637
Protein Sci. 2001 Jan;10(1):212-9
pubmed: 11266608
PLoS One. 2015 Mar 24;10(3):e0119490
pubmed: 25803302
Protein Sci. 2002 Nov;11(11):2606-21
pubmed: 12381844
Retrovirology. 2008 Dec 04;5:110
pubmed: 19055831
Biopolymers. 2005;80(6):775-86
pubmed: 15895431
PeerJ. 2018 May 4;6:e4750
pubmed: 29740518
Physiol Meas. 2019 Jan 15;40(1):015001
pubmed: 30523982
IEEE/ACM Trans Comput Biol Bioinform. 2012 Sep-Oct;9(5):1535-8
pubmed: 22732690
PLoS One. 2015 Nov 10;10(11):e0141287
pubmed: 26555596
IEEE/ACM Trans Comput Biol Bioinform. 2017 Mar-Apr;14(2):300-313
pubmed: 28368808
BMC Bioinformatics. 2006 Sep 26;7 Suppl 2:S2
pubmed: 17118141
Sci Rep. 2017 Feb 13;7:42362
pubmed: 28205576
Proc Natl Acad Sci U S A. 1995 Sep 12;92(19):8700-4
pubmed: 7568000
BMC Res Notes. 2011 Mar 31;4:94
pubmed: 21453485
Proc SIAM Int Conf Data Min. 2013;2013:342-349
pubmed: 24910813
Bioinformatics. 2015 Jun 1;31(11):1857-9
pubmed: 25619996
Oncotarget. 2016 Mar 29;7(13):16895-909
pubmed: 26942877
New Microbes New Infect. 2015 Apr 16;6:22-9
pubmed: 26029375
J Theor Biol. 1968 Nov;21(2):170-201
pubmed: 5700434
Adv Drug Deliv Rev. 2008 Mar 1;60(4-5):572-9
pubmed: 18045726
Mol Biosyst. 2017 Jun 27;13(7):1336-1344
pubmed: 28604872
Interface Focus. 2017 Dec 6;7(6):20160153
pubmed: 29147555
Nat Rev Drug Discov. 2011 Dec 16;11(1):37-51
pubmed: 22173434
Med Chem. 2013 Feb;9(1):32-44
pubmed: 22741801
Science. 2001 Oct 5;294(5540):93-6
pubmed: 11588250
J Med Chem. 2014 Jun 26;57(12):4977-5010
pubmed: 24351051
Nat Rev Immunol. 2016 May;16(5):321-34
pubmed: 27087664
Proteins. 2008 Apr;71(1):189-94
pubmed: 17932917
Bioinformatics. 2019 Jun 1;35(11):1820-1828
pubmed: 30346493
Peptides. 2012 Oct;37(2):207-15
pubmed: 22800692
Macromolecules. 1976 Nov-Dec;9(6):945-50
pubmed: 1004017
Future Med Chem. 2017 Sep;9(13):1529-1538
pubmed: 28791894
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402
pubmed: 9254694
Mol Biosyst. 2009 May;5(5):483-96
pubmed: 19381363
Bioinformatics. 2018 Sep 15;34(18):3111-3117
pubmed: 29668844
PLoS Comput Biol. 2007 Mar 30;3(3):e58
pubmed: 17397254
Front Microbiol. 2013 Dec 09;4:353
pubmed: 24367355
Lancet Infect Dis. 2018 Jul;18(7):796-811
pubmed: 29779917
Front Cell Infect Microbiol. 2016 Dec 27;6:194
pubmed: 28083516
J Theor Biol. 2019 Jan 7;460:195-203
pubmed: 30312687
Sci Rep. 2016 Apr 14;6:24482
pubmed: 27075512
Protein Sci. 2005 Nov;14(11):2804-13
pubmed: 16251364
J Pept Sci. 2008 Jan;14(1):110-4
pubmed: 17847019
J Med Chem. 2007 Dec 27;50(26):6545-53
pubmed: 18062663
J Theor Biol. 2017 Apr 21;419:254-265
pubmed: 28216428
Nature. 2002 Jan 24;415(6870):389-95
pubmed: 11807545
Nucleic Acids Res. 2016 Jan 4;44(D1):D1094-7
pubmed: 26467475
J Chem Inf Model. 2009 Dec;49(12):2873-82
pubmed: 19947578
Mol Biosyst. 2014 Aug;10(8):2229-35
pubmed: 24931825
Proteins. 2009 Apr;75(1):75-88
pubmed: 18798568
Nucleic Acids Res. 2014 Jan;42(Database issue):D1154-8
pubmed: 24265220
Bioinformatics. 2005 Jun;21 Suppl 1:i359-68
pubmed: 15961479
J Theor Biol. 2018 Apr 14;443:125-137
pubmed: 29408627
Nucleic Acids Res. 2008 Jan;36(Database issue):D202-5
pubmed: 17998252
Proteins. 2001 May 15;43(3):246-55
pubmed: 11288174
Bioinformatics. 2017 Oct 01;33(19):3036-3042
pubmed: 28575181
Bioinformatics. 2018 Jul 1;34(13):i254-i262
pubmed: 29949966
Acta Biochim Pol. 2009;56(1):167-76
pubmed: 19294233
Bioinformatics. 2017 Sep 01;33(17):2756-2758
pubmed: 28903538
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W199-204
pubmed: 22638580
Proteins. 2000 Feb 1;38(2):149-64
pubmed: 10656262
Biochem Biophys Res Commun. 2014 Oct 17;453(2):254-67
pubmed: 24878531
PLoS One. 2012;7(10):e45012
pubmed: 23056189
Bioinformatics. 2013 Apr 1;29(7):960-2
pubmed: 23426256
Cell. 2018 Apr 5;173(2):338-354.e15
pubmed: 29625051
Int J Antimicrob Agents. 2012 Apr;39(4):346-51
pubmed: 22325123
Chem Biol Drug Des. 2007 Aug;70(2):134-42
pubmed: 17683374
PLoS Comput Biol. 2010 Apr 15;6(4):e1000743
pubmed: 20419152

Auteurs

Sebastian Spänig (S)

Department of Bioinformatics, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg, Germany.

Dominik Heider (D)

Department of Bioinformatics, Faculty of Mathematics and Computer Science, Philipps-University of Marburg, Marburg, Germany.

Classifications MeSH