How to choose templates for modeling of protein complexes: Insights from benchmarking template-based docking.
Amino Acid Sequence
Animals
Benchmarking
Binding Sites
Dogs
Escherichia coli
/ chemistry
Humans
Ligands
Molecular Docking Simulation
Peptides
/ chemistry
Protein Binding
Protein Conformation, alpha-Helical
Protein Conformation, beta-Strand
Protein Interaction Domains and Motifs
Protein Interaction Mapping
Protein Multimerization
Proteins
/ chemistry
Research Design
Software
Structural Homology, Protein
Thermodynamics
protein recognition
sequence homology
structure prediction
structure similarity
template detection
Journal
Proteins
ISSN: 1097-0134
Titre abrégé: Proteins
Pays: United States
ID NLM: 8700181
Informations de publication
Date de publication:
08 2020
08 2020
Historique:
received:
18
09
2019
revised:
07
01
2020
accepted:
22
01
2020
pubmed:
30
1
2020
medline:
26
1
2021
entrez:
30
1
2020
Statut:
ppublish
Résumé
Comparative docking is based on experimentally determined structures of protein-protein complexes (templates), following the paradigm that proteins with similar sequences and/or structures form similar complexes. Modeling utilizing structure similarity of target monomers to template complexes significantly expands structural coverage of the interactome. Template-based docking by structure alignment can be performed for the entire structures or by aligning targets to the bound interfaces of the experimentally determined complexes. Systematic benchmarking of docking protocols based on full and interface structure alignment showed that both protocols perform similarly, with top 1 docking success rate 26%. However, in terms of the models' quality, the interface-based docking performed marginally better. The interface-based docking is preferable when one would suspect a significant conformational change in the full protein structure upon binding, for example, a rearrangement of the domains in multidomain proteins. Importantly, if the same structure is selected as the top template by both full and interface alignment, the docking success rate increases 2-fold for both top 1 and top 10 predictions. Matching structural annotations of the target and template proteins for template detection, as a computationally less expensive alternative to structural alignment, did not improve the docking performance. Sophisticated remote sequence homology detection added templates to the pool of those identified by structure-based alignment, suggesting that for practical docking, the combination of the structure alignment protocols and the remote sequence homology detection may be useful in order to avoid potential flaws in generation of the structural templates library.
Identifiants
pubmed: 31994759
doi: 10.1002/prot.25875
pmc: PMC7375009
mid: NIHMS1551711
doi:
Substances chimiques
Ligands
0
Peptides
0
Proteins
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
1070-1081Subventions
Organisme : NIGMS NIH HHS
ID : R01 GM074255
Pays : United States
Organisme : NIH HHS
ID : R01GM074255
Pays : United States
Informations de copyright
© 2020 Wiley Periodicals, Inc.
Références
Nature. 2005 Jul 21;436(7049):415-9
pubmed: 16034420
Protein Sci. 2013 Nov;22(11):1655-63
pubmed: 23996787
Cell. 1999 May 28;97(5):635-46
pubmed: 10367892
Proteins. 2015 Jul;83(7):1238-51
pubmed: 25917548
J Mol Biol. 2003 Oct 3;332(5):989-98
pubmed: 14499603
Nature. 2005 Jun 2;435(7042):693-6
pubmed: 15864302
Nat Methods. 2011 Dec 25;9(2):173-5
pubmed: 22198341
PLoS Biol. 2009 Apr 28;7(4):e94
pubmed: 19402751
Proc Natl Acad Sci U S A. 2012 Jun 12;109(24):9438-41
pubmed: 22645367
Structure. 2000 Nov 15;8(11):1147-55
pubmed: 11080637
J Mol Biol. 1995 Apr 7;247(4):536-40
pubmed: 7723011
J Mol Biol. 2008 Dec 19;384(3):652-62
pubmed: 18929577
Bioinformatics. 2005 Apr 1;21(7):951-60
pubmed: 15531603
Proteins. 1997 Apr;27(4):556-66
pubmed: 9141135
Proc Natl Acad Sci U S A. 2009 Jan 13;106(2):450-5
pubmed: 19129502
Proteins. 2018 Mar;86 Suppl 1:302-310
pubmed: 28905425
Nat Protoc. 2011 Aug 11;6(9):1341-54
pubmed: 21886100
Proteins. 2019 Dec;87(12):1200-1221
pubmed: 31612567
Mol Cell. 2008 Dec 26;32(6):791-802
pubmed: 19111659
J Mol Biol. 2011 Apr 29;408(2):356-65
pubmed: 21354429
Structure. 2008 Jul;16(7):1116-25
pubmed: 18611384
Science. 2010 Sep 3;329(5996):1210-4
pubmed: 20813955
Proc Natl Acad Sci U S A. 2007 Jun 19;104(25):10583-8
pubmed: 17563375
Proteins. 2019 Mar;87(3):245-253
pubmed: 30520123
Protein Sci. 2018 Jan;27(1):172-181
pubmed: 28891124
Proteins. 2012 Apr;80(4):1239-49
pubmed: 22275112
Proteins. 2013 Dec;81(12):2082-95
pubmed: 24115211
Nucleic Acids Res. 2015 Jan;43(Database issue):D376-81
pubmed: 25348408
Proc Natl Acad Sci U S A. 2009 Oct 13;106(41):17377-82
pubmed: 19805138
Proteins. 2010 Nov 15;78(15):3235-41
pubmed: 20715056
Cell. 2010 Aug 20;142(4):568-79
pubmed: 20723758
EMBO J. 2003 Mar 3;22(5):1125-33
pubmed: 12606577
Acta Crystallogr D Biol Crystallogr. 1999 Jul;55(Pt 7):1257-63
pubmed: 10393292
Mol Cell. 2003 Feb;11(2):507-17
pubmed: 12620237
Structure. 2007 Mar;15(3):341-54
pubmed: 17355869
PLoS One. 2009 Dec 30;4(12):e8514
pubmed: 20041127
Proteins. 2015 Sep;83(9):1563-70
pubmed: 25488330
J Mol Biol. 2008 Jan 25;375(4):1141-51
pubmed: 18076904
Proteins. 2017 Mar;85(3):470-478
pubmed: 27701777
Biophys J. 2014 Oct 21;107(8):1785-1793
pubmed: 25418159
Protein Eng Des Sel. 2005 Jan;18(1):11-24
pubmed: 15790576
Proc Natl Acad Sci U S A. 2009 Mar 24;106(12):4665-70
pubmed: 19255450
Nature. 2012 Oct 25;490(7421):556-60
pubmed: 23023127
Nat Struct Biol. 1997 Jan;4(1):64-9
pubmed: 8989326
Proc Natl Acad Sci U S A. 2011 Jun 21;108(25):10266-71
pubmed: 21642534
Proc Natl Acad Sci U S A. 2010 Jun 15;107(24):10896-901
pubmed: 20534496
Proc Natl Acad Sci U S A. 2014 Aug 26;111(34):E3514-23
pubmed: 25114243
Structure. 1998 Jan 15;6(1):63-73
pubmed: 9493268
EMBO Rep. 2005 Jul;6(7):675-80
pubmed: 15947784
PLoS Comput Biol. 2014 Dec 04;10(12):e1003926
pubmed: 25474468
Nucleic Acids Res. 2005 Apr 22;33(7):2302-9
pubmed: 15849316
PLoS One. 2012;7(2):e31349
pubmed: 22348074
Proteins. 2013 Dec;81(12):2137-42
pubmed: 23946125
Proteins. 2007 Dec 1;69(4):704-18
pubmed: 17918726