Systematic Characterization of Lysine Post-translational Modification Sites Using MUscADEL.

Bioinformatics Deep learning Long short-term memory Lysine Machine learning Post-translational modification

Journal

Methods in molecular biology (Clifton, N.J.)
ISSN: 1940-6029
Titre abrégé: Methods Mol Biol
Pays: United States
ID NLM: 9214969

Informations de publication

Date de publication:
2022
Historique:
entrez: 13 6 2022
pubmed: 14 6 2022
medline: 16 6 2022
Statut: ppublish

Résumé

Among various types of protein post-translational modifications (PTMs), lysine PTMs play an important role in regulating a wide range of functions and biological processes. Due to the generation and accumulation of enormous amount of protein sequence data by ongoing whole-genome sequencing projects, systematic identification of different types of lysine PTM substrates and their specific PTM sites in the entire proteome is increasingly important and has therefore received much attention. Accordingly, a variety of computational methods for lysine PTM identification have been developed based on the combination of various handcrafted sequence features and machine-learning techniques. In this chapter, we first briefly review existing computational methods for lysine PTM identification and then introduce a recently developed deep learning-based method, termed MUscADEL (Multiple Scalable Accurate Deep Learner for lysine PTMs). Specifically, MUscADEL employs bidirectional long short-term memory (BiLSTM) recurrent neural networks and is capable of predicting eight major types of lysine PTMs in both the human and mouse proteomes. The web server of MUscADEL is publicly available at http://muscadel.erc.monash.edu/ for the research community to use.

Identifiants

pubmed: 35696083
doi: 10.1007/978-1-0716-2317-6_11
doi:

Substances chimiques

Proteome 0
Lysine K3Z4F929H6

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

205-219

Informations de copyright

© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.

Références

Hornbeck PV, Zhang B, Murray B et al (2015) PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res 43:D512–D520
pubmed: 25514926 doi: 10.1093/nar/gku1267
Xu H, Zhou J, Lin S et al (2017) PLMD: an updated data resource of protein lysine modifications. J Genet Genomics 44:243–250
pubmed: 28529077 doi: 10.1016/j.jgg.2017.03.007
Das C, Kundu TK (2005) Transcriptional regulation by the acetylation of nonhistone proteins in humans—a new target for therapeutics. IUBMB Life 57:137–149
pubmed: 16036576 doi: 10.1080/15216540500090629
Glozak MA, Sengupta N, Zhang X et al (2005) Acetylation and deacetylation of non-histone proteins. Gene 363:15–23
pubmed: 16289629 doi: 10.1016/j.gene.2005.09.010
Kim SC, Sprung R, Chen Y et al (2006) Substrate and functional diversity of lysine acetylation revealed by a proteomics survey. Mol Cell 23:607–618
pubmed: 16916647 doi: 10.1016/j.molcel.2006.06.026
Polevoda B, Sherman F (2002) The diversity of acetylated proteins. Genome Biol 3:reviews0006
pubmed: 12049668 pmcid: 139359 doi: 10.1186/gb-2002-3-5-reviews0006
Spange S, Wagner T, Heinzel T et al (2009) Acetylation of non-histone proteins modulates cellular signalling at multiple levels. Int J Biochem Cell Biol 41:185–198
pubmed: 18804549 doi: 10.1016/j.biocel.2008.08.027
Zhao S, Xu W, Jiang W et al (2010) Regulation of cellular metabolism by protein lysine acetylation. Science 327:1000–1004
pubmed: 20167786 pmcid: 3232675 doi: 10.1126/science.1179689
Munch G, Gerlach M, Sian J et al (1998) Advanced glycation end products in neurodegeneration: more than early markers of oxidative stress? Ann Neurol 44:S85–S88
pubmed: 9749578 doi: 10.1002/ana.410440713
Nicolls MR (2004) The clinical and biological relationship between Type II diabetes mellitus and Alzheimer’s disease. Curr Alzheimer Res 1:47–54
pubmed: 15975085 doi: 10.2174/1567205043480555
Chau V, Tobias JW, Bachmair A et al (1989) A multiubiquitin chain is confined to specific lysine in a targeted short-lived protein. Science 243:1576–1583
pubmed: 2538923 doi: 10.1126/science.2538923
Pickart CM (2001) Ubiquitin enters the new millennium. Mol Cell 8:499–504
pubmed: 11583613 doi: 10.1016/S1097-2765(01)00347-1
Welchman RL, Gordon C, Mayer RJ (2005) Ubiquitin and ubiquitin-like proteins as multifunctional signals. Nat Rev Mol Cell Biol 6:599–609
pubmed: 16064136 doi: 10.1038/nrm1700
Chen ZJ, Sun LJ (2009) Nonproteolytic functions of ubiquitin in cell signaling. Mol Cell 33:275–286
pubmed: 19217402 doi: 10.1016/j.molcel.2009.01.014
Hicke L (2001) Protein regulation by monoubiquitin. Nat Rev Mol Cell Biol 2:195–201
pubmed: 11265249 doi: 10.1038/35056583
Longo VD, Kennedy BK (2006) Sirtuins in aging and age-related disease. Cell 126:257–268
pubmed: 16873059 doi: 10.1016/j.cell.2006.07.002
Mastronardi FG, Wood DD, Mei J et al (2006) Increased citrullination of histone H3 in multiple sclerosis brain and animal models of demyelination: a role for tumor necrosis factor-induced peptidylarginine deiminase 4 translocation. J Neurosci 26:11387–11396
pubmed: 17079667 pmcid: 6674531 doi: 10.1523/JNEUROSCI.3349-06.2006
Suzuki A, Yamada R, Yamamoto K (2007) Citrullination by peptidylarginine deiminase in rheumatoid arthritis. Ann N Y Acad Sci 1108:323–339
pubmed: 17893996 doi: 10.1196/annals.1422.034
Wang Z, Tang WH, Cho L et al (2009) Targeted metabolomic evaluation of arginine methylation and cardiovascular risks: potential mechanisms beyond nitric oxide synthase inhibition. Arterioscler Thromb Vasc Biol 29:1383–1391
pubmed: 19542023 pmcid: 2755199 doi: 10.1161/ATVBAHA.109.185645
Peng C, Lu Z, Xie Z et al (2011) The first identification of lysine malonylation substrates and its regulatory enzyme. Mol Cell Proteomics 10:M111:012658
pubmed: 21908771 pmcid: 3237090 doi: 10.1074/mcp.M111.012658
Xie Z, Dai J, Dai L et al (2012) Lysine succinylation and lysine malonylation in histones. Mol Cell Proteomics 11:100–107
pubmed: 22389435 pmcid: 3418837 doi: 10.1074/mcp.M111.015875
Zhang Z, Tan M, Xie Z et al (2011) Identification of lysine succinylation as a new post-translational modification. Nat Chem Biol 7:58–63
pubmed: 21151122 doi: 10.1038/nchembio.495
Caron C, Boyault C, Khochbin S (2005) Regulatory cross-talk between lysine acetylation and ubiquitination: role in the control of protein stability. BioEssays 27:408–415
pubmed: 15770681 doi: 10.1002/bies.20210
Hunter T, Sun H (2008) Crosstalk between the SUMO and ubiquitin pathways. Ernst Schering Found Symp Proc:1–16
Nie Q, Gong XD, Liu M et al (2017) Effects of crosstalks between sumoylation and phosphorylation in normal cellular physiology and human diseases. Curr Mol Med 16:906–913
pubmed: 28017137 doi: 10.2174/1566524016666161223105555
Chen Z, Liu X, Li F et al (2019) Large-scale comparative assessment of computational predictors for lysine post-translational modification sites. Brief Bioinform 20:2267–2290
pubmed: 30285084 doi: 10.1093/bib/bby089
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780
pubmed: 9377276 doi: 10.1162/neco.1997.9.8.1735
Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45:2673–2681
doi: 10.1109/78.650093
Li Y, Wang M, Wang H et al (2014) Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features. Sci Rep 4:5765
pubmed: 25042424 pmcid: 4104576 doi: 10.1038/srep05765
Johansen MB, Kiemer L, Brunak S (2006) Analysis and prediction of mammalian protein glycation. Glycobiology 16:844–853
pubmed: 16762979 doi: 10.1093/glycob/cwl009
Wang LN, Shi SP, Xu HD et al (2017) Computational prediction of species-specific malonylation sites via enhanced characteristic strategy. Bioinformatics 33:1457–1463
pubmed: 28025199
Lu L, Shi XH, Li SJ et al (2010) Protein sumoylation sites prediction based on two-stage feature selection. Mol Divers 14:81–86
pubmed: 19472067 doi: 10.1007/s11030-009-9149-5
Cai Y, Huang T, Hu L et al (2012) Prediction of lysine ubiquitination with mRMR feature selection and analysis. Amino Acids 42:1387–1395
pubmed: 21267749 doi: 10.1007/s00726-011-0835-0
Walsh I, Di Domenico T, Tosatto SC (2014) RUBI: rapid proteomic-scale prediction of lysine ubiquitination and factors influencing predictor performance. Amino Acids 46:853–862
pubmed: 24363213 doi: 10.1007/s00726-013-1645-3
Chen Z, Zhou Y, Song J et al (2013) hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties. Biochim Biophys Acta 1834:1461–1467
pubmed: 23603789 doi: 10.1016/j.bbapap.2013.04.006
Wuyun Q, Zheng W, Zhang Y et al (2016) Improved species-specific lysine acetylation site prediction based on a large variety of features set. PLoS One 11:e0155370
pubmed: 27183223 pmcid: 4868276 doi: 10.1371/journal.pone.0155370
Suo SB, Qiu JD, Shi SP et al (2012) Position-specific analysis and prediction for protein lysine acetylation based on multiple features. PLoS One 7:e49108
pubmed: 23173045 pmcid: 3500252 doi: 10.1371/journal.pone.0049108
Deng W, Wang Y, Ma L et al (2017) Computational prediction of methylation types of covalently modified lysine and arginine residues in proteins. Brief Bioinform 18:647–658
pubmed: 27241573
Xue Y, Zhou F, Fu C et al (2006) SUMOsp: a web server for sumoylation site prediction. Nucleic Acids Res 34:W254–W257
pubmed: 16845005 pmcid: 1538802 doi: 10.1093/nar/gkl207
Wang L, Du Y, Lu M et al (2012) ASEB: a web server for KAT-specific acetylation site prediction. Nucleic Acids Res 40:W376–W379
pubmed: 22600735 pmcid: 3394258 doi: 10.1093/nar/gks437
Li A, Xue Y, Jin C et al (2006) Prediction of nepsilon-acetylation on internal lysines implemented in Bayesian discriminant method. Biochem Biophys Res Commun 350:818–824
pubmed: 17045240 pmcid: 2093955 doi: 10.1016/j.bbrc.2006.08.199
Hou T, Zheng G, Zhang P et al (2014) LAceP: lysine acetylation site prediction using logistic regression classifiers. PLoS One 9:e89575
pubmed: 24586884 pmcid: 3930742 doi: 10.1371/journal.pone.0089575
Li S, Li H, Li M et al (2009) Improved prediction of lysine acetylation by support vector machines. Protein Pept Lett 16:977–983
pubmed: 19689425 doi: 10.2174/092986609788923338
Liu Y, Gu W, Zhang W et al (2015) Predict and analyze protein glycation sites with the mRMR and IFS methods. Biomed Res Int 2015:561547
pubmed: 25961025 pmcid: 4413511
Wen PP, Shi SP, Xu HD et al (2016) Accurate in silico prediction of species-specific methylation sites based on information gain feature optimization. Bioinformatics 32:3107–3115
pubmed: 27354692 doi: 10.1093/bioinformatics/btw377
Yavuz AS, Sezerman OU (2014) Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder. BMC Genomics 15(Suppl 9):S18
pubmed: 25521314 pmcid: 4290605 doi: 10.1186/1471-2164-15-S9-S18
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297
Breiman L (2001) Random forests. Mach Learn 45:5–32
doi: 10.1023/A:1010933404324
McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. Bull Math Biol 5:115–133
Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46:175–185
Freedman AD (2006) Statistical models: theory and practice. Technometrics 48:315–315
UniProt Consortium T (2018) UniProt: the universal protein knowledgebase. Nucleic Acids Res 46:2699
pubmed: 29425356 pmcid: 5861450 doi: 10.1093/nar/gky092
Li W, Godzik A (2006) Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22:1658–1659
pubmed: 16731699 doi: 10.1093/bioinformatics/btl158
Chen Z, He N, Huang Y et al (2018) Integration of a deep learning classifier with a random forest approach for predicting malonylation sites. Genomics Proteomics Bioinformatics 16:451–459
pubmed: 30639696 doi: 10.1016/j.gpb.2018.08.004
Maas AL, Daly RE, Pham PT et al (2011) Learning word vectors for sentiment analysis. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies - volume 1. Association for Computational Linguistics, Portland, OR, pp 142–150
Greff K, Srivastava RK, Koutník J et al (2016) LSTM: a search space odyssey. IEEE Trans Neural Netw Learn Syst 28:2222–2232
pubmed: 27411231 doi: 10.1109/TNNLS.2016.2582924
Kingma D, Ba J (2014) Adam: a method for stochastic optimization. International conference on learning representations. arXiv:1412.6980

Auteurs

Zhen Chen (Z)

Key Laboratory of Rice Biology in Henan Province, Henan Agricultural University, Zhengzhou, China.

Xuhan Liu (X)

Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands.

Fuyi Li (F)

Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC, Australia.

Chen Li (C)

Biomedicine Discovery Institute, Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia.

Tatiana Marquez-Lago (T)

Department of Genetics, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.
Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.

André Leier (A)

Department of Genetics, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.
Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.

Geoffrey I Webb (GI)

Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC, Australia.

Dakang Xu (D)

Faculty of Medical Laboratory Science, Ruijin Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China.
Department of Molecular and Translational Science, Faculty of Medicine, Hudson Institute of Medical Research, Monash University, Melbourne, VIC, Australia.

Tatsuya Akutsu (T)

Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan. takutsu@kuicr.kyoto-u.ac.jp.

Jiangning Song (J)

Biomedicine Discovery Institute, Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia. jiangning.song@monash.edu.
Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC, Australia. jiangning.song@monash.edu.

Articles similaires

Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice
Animals Tail Swine Behavior, Animal Animal Husbandry

Classifications MeSH