Prediction of secondary structure population and intrinsic disorder of proteins using multitask deep learning.


Journal

AMIA ... Annual Symposium proceedings. AMIA Symposium
ISSN: 1942-597X
Titre abrégé: AMIA Annu Symp Proc
Pays: United States
ID NLM: 101209213

Informations de publication

Date de publication:
2020
Historique:
entrez: 3 5 2021
pubmed: 4 5 2021
medline: 14 7 2021
Statut: epublish

Résumé

Recent research in predicting protein secondary structure populations (SSP) based on Nuclear Magnetic Resonance (NMR) chemical shifts has helped quantitatively characterise the structural conformational properties of intrinsically disordered proteins and regions (IDP/IDR). Different from protein secondary structure (SS) prediction, the SSP prediction assumes a dynamic assignment of secondary structures that seem correlate with disordered states. In this study, we designed a single-task deep learning framework to predict IDP/IDR and SSP respectively; and multitask deep learning frameworks to allow quantitative predictions of IDP/IDR evidenced by the simultaneously predicted SSP. According to independent test results, single-task deep learning models improve the prediction performance of shallow models for SSP and IDP/IDR. Also, the prediction performance was further improved for IDP/IDR prediction when SSP prediction was simultaneously predicted in multitask models. With p53 as a use case, we demonstrate how predicted SSP is used to explain the IDP/IDR predictions for each functional region.

Identifiants

pubmed: 33936509
pii: 169_3411384
pmc: PMC8075420

Substances chimiques

Intrinsically Disordered Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1325-1334

Informations de copyright

©2020 AMIA - All rights reserved.

Références

Proteins. 2014 Feb;82 Suppl 2:127-37
pubmed: 23946100
Proteins. 2011;79 Suppl 10:107-18
pubmed: 21928402
J Biomol Struct Dyn. 2012;29(4):799-813
pubmed: 22208280
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W349-57
pubmed: 23748958
J Mol Biol. 1999 Oct 22;293(2):321-31
pubmed: 10550212
J Biol Chem. 2004 Jan 9;279(2):1291-6
pubmed: 14534297
Bioinformatics. 2017 Mar 1;33(5):685-692
pubmed: 28011771
Structure. 2012 Dec 5;20(12):2014-24
pubmed: 23063560
Sci Rep. 2016 Jan 11;6:18962
pubmed: 26752681
J Mol Biol. 1991 Nov 20;222(2):311-33
pubmed: 1960729
Nucleic Acids Res. 2007 Jan;35(Database issue):D786-93
pubmed: 17145717
Int J Mol Sci. 2015 Jul 29;16(8):17315-30
pubmed: 26230689
J Biomol Struct Dyn. 2012;29(6):643-9
pubmed: 22545995
J Mol Biol. 2015 Feb 27;427(4):982-996
pubmed: 25534081
Intrinsically Disord Proteins. 2013 Apr 1;1(1):e24428
pubmed: 28516009
Bioinformatics. 2015 Mar 15;31(6):857-63
pubmed: 25391399
Nucleic Acids Res. 2018 Jan 4;46(D1):D471-D476
pubmed: 29136219
Biochemistry. 2012 Mar 20;51(11):2224-31
pubmed: 22360139
Proc Natl Acad Sci U S A. 2008 Apr 15;105(15):5762-7
pubmed: 18391200
Nucleic Acids Res. 2008 Jan;36(Database issue):D402-8
pubmed: 17984079
Biochemistry. 1992 Feb 18;31(6):1647-51
pubmed: 1737021
Science. 1996 Nov 8;274(5289):948-53
pubmed: 8875929
Sci Rep. 2016 Mar 31;6:23750
pubmed: 27030593
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W460-4
pubmed: 17567614
Nat Rev Mol Cell Biol. 2005 Mar;6(3):197-208
pubmed: 15738986
Eur J Biochem. 1977 Nov 1;80(2):319-24
pubmed: 923582
J Chem Inf Model. 2018 Nov 26;58(11):2369-2376
pubmed: 30395465
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402
pubmed: 9254694
Bioinformatics. 2016 Sep 1;32(17):i672-i679
pubmed: 27587688

Auteurs

Xu Ying (X)

IBM Research Australia, Melbourne, Victoria, Australia.

Andre Leier (A)

University of Alabama at Birmingham, Birmingham, AL, USA.

Tatiana T Marquez-Lago (TT)

University of Alabama at Birmingham, Birmingham, AL, USA.

Jue Xie (J)

Monash University, Melbourne, Victoria, Australia.

Antonio Jose Jimeno Yepes (AJ)

IBM Research Australia, Melbourne, Victoria, Australia.

James C Whisstock (JC)

Monash University, Melbourne, Victoria, Australia.

Campbell Wilson (C)

Monash University, Melbourne, Victoria, Australia.

Jiangning Song (J)

Monash University, Melbourne, Victoria, Australia.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Humans Breast Neoplasms Female Deep Learning Ultrasonography, Mammary
Humans Deep Learning Mouth Neoplasms Drug Resistance, Neoplasm Cell Line, Tumor
1.00
Saccharomyces cerevisiae Lysine Cell Nucleolus RNA, Ribosomal Saccharomyces cerevisiae Proteins

Classifications MeSH