DeepCDpred: Inter-residue distance and contact prediction for improved prediction of protein structure.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2019
Historique:
received: 19 09 2018
accepted: 13 12 2018
entrez: 9 1 2019
pubmed: 9 1 2019
medline: 29 9 2019
Statut: epublish

Résumé

Rapid, accurate prediction of protein structure from amino acid sequence would accelerate fields as diverse as drug discovery, synthetic biology and disease diagnosis. Massively improved prediction of protein structures has been driven by improving the prediction of the amino acid residues that contact in their 3D structure. For an average globular protein, around 92% of all residue pairs are non-contacting, therefore accurate prediction of only a small percentage of inter-amino acid distances could increase the number of constraints to guide structure determination. We have trained deep neural networks to predict inter-residue contacts and distances. Distances are predicted with an accuracy better than most contact prediction techniques. Addition of distance constraints improved de novo structure predictions for test sets of 158 protein structures, as compared to using the best contact prediction methods alone. Importantly, usage of distance predictions allows the selection of better models from the structure pool without a need for an external model assessment tool. The results also indicate how the accuracy of distance prediction methods might be improved further.

Identifiants

pubmed: 30620738
doi: 10.1371/journal.pone.0205214
pii: PONE-D-18-27372
pmc: PMC6324825
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0205214

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

Bioinformatics. 2010 Apr 1;26(7):889-95
pubmed: 20164152
Bioinformatics. 2013 Jul 15;29(14):1815-6
pubmed: 23658418
Proc Natl Acad Sci U S A. 2011 Dec 6;108(49):E1293-301
pubmed: 22106262
Science. 2017 Jan 20;355(6322):294-298
pubmed: 28104891
Proteins. 1999;Suppl 3:171-6
pubmed: 10526365
Theor Biol Med Model. 2015 Sep 04;12:15
pubmed: 26338054
Nat Biotechnol. 2012 Nov;30(11):1072-80
pubmed: 23138306
In Silico Biol. 2003;3(3):241-64
pubmed: 12954088
Bioinformatics. 2015 Apr 1;31(7):999-1006
pubmed: 25431331
Bioinformatics. 2012 Jan 15;28(2):184-90
pubmed: 22101153
Bioinformatics. 2017 Aug 1;33(15):2296-2306
pubmed: 28369334
BMC Struct Biol. 2009 Jan 30;9:5
pubmed: 19183478
PLoS Comput Biol. 2007 Nov;3(11):e211
pubmed: 17983264
Bioinformatics. 2018 May 1;34(9):1466-1472
pubmed: 29228185
Elife. 2015 Sep 03;4:e09248
pubmed: 26335199
BMC Bioinformatics. 2014 Jan 10;15:6
pubmed: 24410833
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324
pubmed: 28056090
Methods Mol Biol. 2017;1484:55-63
pubmed: 27787820
Nucleic Acids Res. 2017 Jan 4;45(D1):D289-D295
pubmed: 27899584
PLoS Comput Biol. 2007 Mar 23;3(3):e52
pubmed: 17381236
Bioinformatics. 2003 Aug 12;19(12):1589-91
pubmed: 12912846
Nucleic Acids Res. 2017 Jul 3;45(W1):W416-W421
pubmed: 28460136
Proteins. 1994 Apr;18(4):309-17
pubmed: 8208723
Bioinformatics. 2017 Jul 15;33(14):i23-i29
pubmed: 28881974

Auteurs

Shuangxi Ji (S)

School of Biosciences, University of Birmingham, Edgbaston Birmingham, B15 2TT, United Kingdom.

Tuğçe Oruç (T)

School of Biosciences, University of Birmingham, Edgbaston Birmingham, B15 2TT, United Kingdom.

Liam Mead (L)

School of Biosciences, University of Birmingham, Edgbaston Birmingham, B15 2TT, United Kingdom.

Muhammad Fayyaz Rehman (MF)

School of Biosciences, University of Birmingham, Edgbaston Birmingham, B15 2TT, United Kingdom.

Christopher Morton Thomas (CM)

School of Biosciences, University of Birmingham, Edgbaston Birmingham, B15 2TT, United Kingdom.

Sam Butterworth (S)

School of Biosciences, University of Birmingham, Edgbaston Birmingham, B15 2TT, United Kingdom.
Division of Pharmacy and Optometry, School of Health Sciences, Manchester Academic Health Sciences Centre, University of Manchester, Manchester, M13 9PL, United Kingdom.

Peter James Winn (PJ)

School of Biosciences, University of Birmingham, Edgbaston Birmingham, B15 2TT, United Kingdom.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Animals Hemiptera Insect Proteins Phylogeny Insecticides

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Classifications MeSH