DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
15 02 2020
Historique:
received: 07 06 2019
revised: 13 08 2019
accepted: 25 09 2019
pubmed: 1 10 2019
medline: 17 9 2020
entrez: 1 10 2019
Statut: ppublish

Résumé

Proteases are enzymes that cleave target substrate proteins by catalyzing the hydrolysis of peptide bonds between specific amino acids. While the functional proteolysis regulated by proteases plays a central role in the 'life and death' cellular processes, many of the corresponding substrates and their cleavage sites were not found yet. Availability of accurate predictors of the substrates and cleavage sites would facilitate understanding of proteases' functions and physiological roles. Deep learning is a promising approach for the development of accurate predictors of substrate cleavage events. We propose DeepCleave, the first deep learning-based predictor of protease-specific substrates and cleavage sites. DeepCleave uses protein substrate sequence data as input and employs convolutional neural networks with transfer learning to train accurate predictive models. High predictive performance of our models stems from the use of high-quality cleavage site features extracted from the substrate sequences through the deep learning process, and the application of transfer learning, multiple kernels and attention layer in the design of the deep network. Empirical tests against several related state-of-the-art methods demonstrate that DeepCleave outperforms these methods in predicting caspase and matrix metalloprotease substrate-cleavage sites. The DeepCleave webserver and source code are freely available at http://deepcleave.erc.monash.edu/. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 31566664
pii: 5578482
doi: 10.1093/bioinformatics/btz721
pmc: PMC8215920
doi:

Substances chimiques

Metalloproteases EC 3.4.-
Caspases EC 3.4.22.-

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1057-1065

Subventions

Organisme : NIAID NIH HHS
ID : R01 AI111965
Pays : United States

Informations de copyright

© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Références

Bioinformatics. 2012 Dec 1;28(23):3150-2
pubmed: 23060610
Cell. 1994 Jun 3;77(5):761-71
pubmed: 7515772
Sci Rep. 2016 Oct 06;6:34595
pubmed: 27708373
Brief Bioinform. 2019 Mar 25;20(2):638-658
pubmed: 29897410
BMC Res Notes. 2015 Jul 29;8:322
pubmed: 26219353
J Biol Chem. 2005 Oct 21;280(42):35337-45
pubmed: 16123041
Proc Natl Acad Sci U S A. 2016 Apr 5;113(14):E2001-10
pubmed: 27006500
Nat Rev Mol Cell Biol. 2002 Jul;3(7):509-19
pubmed: 12094217
Bioinformatics. 2018 Feb 15;34(4):684-687
pubmed: 29069280
Genes Dev. 2016 Oct 1;30(19):2213-2225
pubmed: 27798844
J Biol Chem. 2003 Aug 8;278(32):30057-62
pubmed: 12766152
J Mol Biol. 1999 Sep 17;292(2):195-202
pubmed: 10493868
Nucleic Acids Res. 2018 Mar 16;46(5):2699
pubmed: 29425356
Cell. 2001 Dec 14;107(6):789-800
pubmed: 11747814
Bioinformatics. 2019 Aug 15;35(16):2766-2773
pubmed: 30601936
Proteomics. 2019 Jun;19(12):e1900019
pubmed: 30941889
BMC Bioinformatics. 2010 Jun 15;11:320
pubmed: 20546630
Bioinformatics. 2018 Jul 15;34(14):2499-2502
pubmed: 29528364
Oncogene. 2004 Jul 15;23(32):5487-95
pubmed: 15122322
Proteins. 2013 Apr;81(4):622-34
pubmed: 23180633
Brief Bioinform. 2019 Jul 19;20(4):1250-1268
pubmed: 29253082
Bioinformatics. 2010 Mar 15;26(6):752-60
pubmed: 20130033
Bioinformatics. 2018 Dec 15;34(24):4223-4231
pubmed: 29947803
Bioinformatics. 2019 Aug 15;35(16):2730-2737
pubmed: 30601980
Bioinformatics. 2017 Dec 15;33(24):3909-3916
pubmed: 29036382
Proteomics. 2014 Sep;14(17-18):2042-6
pubmed: 24995852
Naturwissenschaften. 1995 Jun;82(6):257-68
pubmed: 7643904
Bioinformatics. 2015 May 1;31(9):1411-9
pubmed: 25568279
J Biol Chem. 1999 Aug 6;274(32):22635-45
pubmed: 10428844
Cell Death Differ. 2007 Aug;14(8):1433-42
pubmed: 17431426
Mol Cell Proteomics. 2010 May;9(5):912-27
pubmed: 20305283
Sci Rep. 2015 Mar 20;5:9363
pubmed: 25791537
PLoS One. 2015 May 21;10(5):e0127877
pubmed: 25996941
PLoS One. 2012;7(11):e50300
pubmed: 23209700
Curr Protein Pept Sci. 2010 Nov;11(7):609-28
pubmed: 20887256
Bioinformatics. 2014 Jan 1;30(1):71-80
pubmed: 24149049
J Biol Chem. 1998 Jul 10;273(28):17559-64
pubmed: 9651349
Oncogene. 2017 Jun 1;36(22):3137-3148
pubmed: 27941877
Brief Bioinform. 2019 Nov 27;20(6):2150-2166
pubmed: 30184176
Proteins. 2014 Nov;82(11):3170-6
pubmed: 25204636
Bioinformatics. 2019 Jul 1;35(13):2216-2225
pubmed: 30462171
J Cell Sci. 2013 Jul 1;126(Pt 13):2820-33
pubmed: 23606746
J Cell Biochem. 2008 Dec 1;105(5):1211-8
pubmed: 18802920
Mol Cell Proteomics. 2010 May;9(5):894-911
pubmed: 20305284
Bioinformatics. 2019 Jul 15;35(14):2386-2394
pubmed: 30520972
Brief Bioinform. 2020 May 21;21(3):1047-1057
pubmed: 31067315
Nucleic Acids Res. 2018 Jan 4;46(D1):D624-D632
pubmed: 29145643
J Biol Chem. 2011 Sep 16;286(37):32333-43
pubmed: 21795698

Auteurs

Fuyi Li (F)

Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.
Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia.

Jinxiang Chen (J)

Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.
College of Information Engineering, Northwest A&F University, Yangling 712100, China.

André Leier (A)

Department of Genetics, USA.
Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.

Tatiana Marquez-Lago (T)

Department of Genetics, USA.
Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA.

Quanzhong Liu (Q)

College of Information Engineering, Northwest A&F University, Yangling 712100, China.

Yanze Wang (Y)

College of Information Engineering, Northwest A&F University, Yangling 712100, China.

Jerico Revote (J)

Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.

A Ian Smith (AI)

Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.

Tatsuya Akutsu (T)

Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto 611-0011, Japan.

Geoffrey I Webb (GI)

Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia.

Lukasz Kurgan (L)

Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA.

Jiangning Song (J)

Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.
Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC 3800, Australia.
ARC Centre of Excellence in Advanced Molecular Imaging, Monash University, Melbourne, VIC 3800, Australia.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH