Prediction of secondary structure population and intrinsic disorder of proteins using multitask deep learning.
Journal
AMIA ... Annual Symposium proceedings. AMIA Symposium
ISSN: 1942-597X
Titre abrégé: AMIA Annu Symp Proc
Pays: United States
ID NLM: 101209213
Informations de publication
Date de publication:
2020
2020
Historique:
entrez:
3
5
2021
pubmed:
4
5
2021
medline:
14
7
2021
Statut:
epublish
Résumé
Recent research in predicting protein secondary structure populations (SSP) based on Nuclear Magnetic Resonance (NMR) chemical shifts has helped quantitatively characterise the structural conformational properties of intrinsically disordered proteins and regions (IDP/IDR). Different from protein secondary structure (SS) prediction, the SSP prediction assumes a dynamic assignment of secondary structures that seem correlate with disordered states. In this study, we designed a single-task deep learning framework to predict IDP/IDR and SSP respectively; and multitask deep learning frameworks to allow quantitative predictions of IDP/IDR evidenced by the simultaneously predicted SSP. According to independent test results, single-task deep learning models improve the prediction performance of shallow models for SSP and IDP/IDR. Also, the prediction performance was further improved for IDP/IDR prediction when SSP prediction was simultaneously predicted in multitask models. With p53 as a use case, we demonstrate how predicted SSP is used to explain the IDP/IDR predictions for each functional region.
Substances chimiques
Intrinsically Disordered Proteins
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
1325-1334Informations de copyright
©2020 AMIA - All rights reserved.
Références
Proteins. 2014 Feb;82 Suppl 2:127-37
pubmed: 23946100
Proteins. 2011;79 Suppl 10:107-18
pubmed: 21928402
J Biomol Struct Dyn. 2012;29(4):799-813
pubmed: 22208280
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W349-57
pubmed: 23748958
J Mol Biol. 1999 Oct 22;293(2):321-31
pubmed: 10550212
J Biol Chem. 2004 Jan 9;279(2):1291-6
pubmed: 14534297
Bioinformatics. 2017 Mar 1;33(5):685-692
pubmed: 28011771
Structure. 2012 Dec 5;20(12):2014-24
pubmed: 23063560
Sci Rep. 2016 Jan 11;6:18962
pubmed: 26752681
J Mol Biol. 1991 Nov 20;222(2):311-33
pubmed: 1960729
Nucleic Acids Res. 2007 Jan;35(Database issue):D786-93
pubmed: 17145717
Int J Mol Sci. 2015 Jul 29;16(8):17315-30
pubmed: 26230689
J Biomol Struct Dyn. 2012;29(6):643-9
pubmed: 22545995
J Mol Biol. 2015 Feb 27;427(4):982-996
pubmed: 25534081
Intrinsically Disord Proteins. 2013 Apr 1;1(1):e24428
pubmed: 28516009
Bioinformatics. 2015 Mar 15;31(6):857-63
pubmed: 25391399
Nucleic Acids Res. 2018 Jan 4;46(D1):D471-D476
pubmed: 29136219
Biochemistry. 2012 Mar 20;51(11):2224-31
pubmed: 22360139
Proc Natl Acad Sci U S A. 2008 Apr 15;105(15):5762-7
pubmed: 18391200
Nucleic Acids Res. 2008 Jan;36(Database issue):D402-8
pubmed: 17984079
Biochemistry. 1992 Feb 18;31(6):1647-51
pubmed: 1737021
Science. 1996 Nov 8;274(5289):948-53
pubmed: 8875929
Sci Rep. 2016 Mar 31;6:23750
pubmed: 27030593
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W460-4
pubmed: 17567614
Nat Rev Mol Cell Biol. 2005 Mar;6(3):197-208
pubmed: 15738986
Eur J Biochem. 1977 Nov 1;80(2):319-24
pubmed: 923582
J Chem Inf Model. 2018 Nov 26;58(11):2369-2376
pubmed: 30395465
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402
pubmed: 9254694
Bioinformatics. 2016 Sep 1;32(17):i672-i679
pubmed: 27587688