DeepCDpred: Inter-residue distance and contact prediction for improved prediction of protein structure.
Journal
PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081
Informations de publication
Date de publication:
2019
2019
Historique:
received:
19
09
2018
accepted:
13
12
2018
entrez:
9
1
2019
pubmed:
9
1
2019
medline:
29
9
2019
Statut:
epublish
Résumé
Rapid, accurate prediction of protein structure from amino acid sequence would accelerate fields as diverse as drug discovery, synthetic biology and disease diagnosis. Massively improved prediction of protein structures has been driven by improving the prediction of the amino acid residues that contact in their 3D structure. For an average globular protein, around 92% of all residue pairs are non-contacting, therefore accurate prediction of only a small percentage of inter-amino acid distances could increase the number of constraints to guide structure determination. We have trained deep neural networks to predict inter-residue contacts and distances. Distances are predicted with an accuracy better than most contact prediction techniques. Addition of distance constraints improved de novo structure predictions for test sets of 158 protein structures, as compared to using the best contact prediction methods alone. Importantly, usage of distance predictions allows the selection of better models from the structure pool without a need for an external model assessment tool. The results also indicate how the accuracy of distance prediction methods might be improved further.
Identifiants
pubmed: 30620738
doi: 10.1371/journal.pone.0205214
pii: PONE-D-18-27372
pmc: PMC6324825
doi:
Substances chimiques
Proteins
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e0205214Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
Bioinformatics. 2010 Apr 1;26(7):889-95
pubmed: 20164152
Bioinformatics. 2013 Jul 15;29(14):1815-6
pubmed: 23658418
Proc Natl Acad Sci U S A. 2011 Dec 6;108(49):E1293-301
pubmed: 22106262
Science. 2017 Jan 20;355(6322):294-298
pubmed: 28104891
Proteins. 1999;Suppl 3:171-6
pubmed: 10526365
Theor Biol Med Model. 2015 Sep 04;12:15
pubmed: 26338054
Nat Biotechnol. 2012 Nov;30(11):1072-80
pubmed: 23138306
In Silico Biol. 2003;3(3):241-64
pubmed: 12954088
Bioinformatics. 2015 Apr 1;31(7):999-1006
pubmed: 25431331
Bioinformatics. 2012 Jan 15;28(2):184-90
pubmed: 22101153
Bioinformatics. 2017 Aug 1;33(15):2296-2306
pubmed: 28369334
BMC Struct Biol. 2009 Jan 30;9:5
pubmed: 19183478
PLoS Comput Biol. 2007 Nov;3(11):e211
pubmed: 17983264
Bioinformatics. 2018 May 1;34(9):1466-1472
pubmed: 29228185
Elife. 2015 Sep 03;4:e09248
pubmed: 26335199
BMC Bioinformatics. 2014 Jan 10;15:6
pubmed: 24410833
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324
pubmed: 28056090
Methods Mol Biol. 2017;1484:55-63
pubmed: 27787820
Nucleic Acids Res. 2017 Jan 4;45(D1):D289-D295
pubmed: 27899584
PLoS Comput Biol. 2007 Mar 23;3(3):e52
pubmed: 17381236
Bioinformatics. 2003 Aug 12;19(12):1589-91
pubmed: 12912846
Nucleic Acids Res. 2017 Jul 3;45(W1):W416-W421
pubmed: 28460136
Proteins. 1994 Apr;18(4):309-17
pubmed: 8208723
Bioinformatics. 2017 Jul 15;33(14):i23-i29
pubmed: 28881974