Easy Not Easy: Comparative Modeling with High-Sequence Identity Templates.

Comparative modeling Conformational diversity Conformational ensemble Homology modeling Native state Protein dynamics Protein structure Structural divergence

Journal

Methods in molecular biology (Clifton, N.J.)
ISSN: 1940-6029
Titre abrégé: Methods Mol Biol
Pays: United States
ID NLM: 9214969

Informations de publication

Date de publication:
2023
Historique:
entrez: 24 3 2023
pubmed: 25 3 2023
medline: 28 3 2023
Statut: ppublish

Résumé

Homology modeling is the most common technique to build structural models of a target protein based on the structure of proteins with high-sequence identity and available high-resolution structures. This technique is based on the idea that protein structure shows fewer changes than sequence through evolution. While in this scenario single mutations would minimally perturb the structure, experimental evidence shows otherwise: proteins with high conformational diversity impose a limit of the paradigm of comparative modeling as the same protein sequence can adopt dissimilar three-dimensional structures. These cases present challenges for modeling; at first glance, they may seem to be easy cases, but they have a complexity that is not evident at the sequence level. In this chapter, we address the following questions: Why should we care about conformational diversity? How to consider conformational diversity when doing template-based modeling in a practical way?

Identifiants

pubmed: 36959443
doi: 10.1007/978-1-0716-2974-1_5
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

83-100

Informations de copyright

© 2023. Springer Science+Business Media, LLC, part of Springer Nature.

Références

Sali A, Blundell TL (1993) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234:779–815
pubmed: 8254673 doi: 10.1006/jmbi.1993.1626
Consortium TU, The UniProt Consortium (2019) UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res 47:D506–D515
doi: 10.1093/nar/gky1049
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The Protein Data Bank. Nucleic Acids Res 28:235–242
pubmed: 10592235 pmcid: 102472 doi: 10.1093/nar/28.1.235
Dawson NL, Lewis TE, Das S, Lees JG, Lee D, Ashford P, Orengo CA, Sillitoe I (2017) CATH: an expanded resource to predict protein function through structure and sequence. Nucleic Acids Res 45:D289–D295
pubmed: 27899584 doi: 10.1093/nar/gkw1098
Andreeva A, Kulesha E, Gough J, Murzin AG (2020) The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures. Nucleic Acids Res 48:D376–D382
pubmed: 31724711 doi: 10.1093/nar/gkz1064
Marino-Buslje C, Monzon AM, Zea DJ, Fornasari MS, Parisi G (2017) On the dynamical incompleteness of the Protein Data Bank. Brief Bioinform 20:356–359
doi: 10.1093/bib/bbx084
Anfinsen CB (1973) Principles that govern the folding of protein chains. Science 181:223–230
pubmed: 4124164 doi: 10.1126/science.181.4096.223
Boehr DD, Nussinov R, Wright PE (2009) The role of dynamic conformational ensembles in biomolecular recognition. Nat Chem Biol 5:789–796
pubmed: 19841628 pmcid: 2916928 doi: 10.1038/nchembio.232
Monzon AM, Fornasari MS, Zea DJ, Parisi G (2019) Exploring protein conformational diversity. Methods Mol Biol 1851:353–365
pubmed: 30298408 doi: 10.1007/978-1-4939-8736-8_20
Salahuddin P, Distributed Information Sub-Centre (DISC), Interdisciplinary Biotechnology Unit, Aligarh Muslim University (A. M. U. ), Aligarh, India (2015) Protein folding, misfolding, aggregation and amyloid formation: mechanisms of Aβ oligomer mediated toxicities. J Biochem Mol Biol Res 1:36–45
doi: 10.17554/j.issn.2313-7177.2015.01.4
Lin J-C, Liu H-L (2006) Protein conformational diseases: from mechanisms to drug designs. Curr Drug Discov Technol 3:145–153
pubmed: 16925522 doi: 10.2174/157016306778108866
Ellisdon AM, Bottomley SP (2004) The role of protein misfolding in the pathogenesis of human diseases. IUBMB Life 56:119–123
pubmed: 15185744 doi: 10.1080/15216540410001674003
Sweeney P, Park H, Baumann M et al (2017) Protein misfolding in neurodegenerative diseases: implications and strategies. Transl Neurodegener 6:6
pubmed: 28293421 pmcid: 5348787 doi: 10.1186/s40035-017-0077-5
Tress M, Tai C-H, Wang G, Ezkurdia I, López G, Valencia A, Lee B, Dunbrack RL Jr (2005) Domain definition and target classification for CASP6. Proteins 61(Suppl 7):8–18
pubmed: 16187342 doi: 10.1002/prot.20717
Kinch LN, Kryshtafovych A, Monastyrskyy B, Grishin NV (2019) CASP13 target classification into tertiary structure prediction categories. Proteins 87:1021–1036
pubmed: 31294862 pmcid: 6851465 doi: 10.1002/prot.25775
Yassine W, Taib N, Federman S et al (2009) Reversible transition between alpha-helix and beta-sheet conformation of a transmembrane domain. Biochim Biophys Acta 1788:1722. https://doi.org/10.1016/j.bbamem.2009.05.014
doi: 10.1016/j.bbamem.2009.05.014 pubmed: 19482005
Koshland DE (1998) Conformational changes: how small is big enough? Nat Med 4:1112–1114
pubmed: 9771734 doi: 10.1038/2605
Mesecar AD, Stoddard BL, Koshland DE Jr (1997) Orbital steering in the catalytic power of enzymes: small structural changes with large catalytic consequences. Science 277:202–206
pubmed: 9211842 doi: 10.1126/science.277.5323.202
Monzon AM, Zea DJ, Fornasari MS, Saldaño TE, Fernandez-Alberti S, Tosatto SCE, Parisi G (2017) Conformational diversity analysis reveals three functional mechanisms in proteins. PLoS Comput Biol 13:e1005398
pubmed: 28192432 pmcid: 5330503 doi: 10.1371/journal.pcbi.1005398
Jurcik A, Bednar D, Byska J et al (2018) CAVER Analyst 2.0: analysis and visualization of channels and tunnels in protein structures and molecular dynamics trajectories. Bioinformatics 34:3586–3588
pubmed: 29741570 pmcid: 6184705 doi: 10.1093/bioinformatics/bty386
Goddard TD, Huang CC, Meng EC, Pettersen EF, Couch GS, Morris JH, Ferrin TE (2018) UCSF ChimeraX: meeting modern challenges in visualization and analysis. Protein Sci 27:14–25
pubmed: 28710774 doi: 10.1002/pro.3235
Olechnovič K, Monastyrskyy B, Kryshtafovych A, Venclovas Č (2019) Comparative analysis of methods for evaluation of protein models against native structures. Bioinformatics 35:937–944
pubmed: 30169622 doi: 10.1093/bioinformatics/bty760
Lupyan D, Leo-Macias A, Ortiz AR (2005) A new progressive-iterative algorithm for multiple structure alignment. Bioinformatics 21:3255–3263
pubmed: 15941743 doi: 10.1093/bioinformatics/bti527
Kabsch W (1976) A solution for the best rotation to relate two sets of vectors. Acta Crystallogr A 32:922–923
doi: 10.1107/S0567739476001873
Kabsch W (1978) A discussion of the solution for the best rotation to relate two sets of vectors. Acta Crystallogr A 34:827–828
doi: 10.1107/S0567739478001680
Burra PV, Zhang Y, Godzik A, Stec B (2009) Global distribution of conformational states derived from redundant models in the PDB points to non-uniqueness of the protein structure. Proc Natl Acad Sci U S A 106:10505–10510
pubmed: 19553204 pmcid: 2705611 doi: 10.1073/pnas.0812152106
Sikic K, Tomic S, Carugo O (2010) Systematic comparison of crystal and NMR protein structures deposited in the protein data bank. Open Biochem J 4:83–95
pubmed: 21293729 pmcid: 3032220 doi: 10.2174/1874091X01004010083
Kosloff M, Kolodny R (2008) Sequence-similar, structure-dissimilar protein pairs in the PDB. Proteins Struct Funct Bioinf 71:891–902
doi: 10.1002/prot.21770
Tramontano A, Morea V (2004) Assessment of homology-based predictions in CASP5. Proteins Struct Funct Bioinf 55:782–782
doi: 10.1002/prot.20187
Rataj K, Witek J, Mordalski S, Kosciolek T, Bojarski AJ (2014) Impact of template choice on homology model efficiency in virtual screening. J Chem Inf Model 54:1661–1668
pubmed: 24813470 doi: 10.1021/ci500001f
Parisi G, Zea DJ, Monzon AM, Marino-Buslje C (2015) Conformational diversity and the emergence of sequence signatures during evolution. Curr Opin Struct Biol 32:58–65
pubmed: 25749052 doi: 10.1016/j.sbi.2015.02.005
Ingram VM (1957) Gene mutations in human haemoglobin: the chemical difference between normal and sickle cell haemoglobin. Nature 180:326–328
pubmed: 13464827 doi: 10.1038/180326a0
Hunt JA, Ingram VM (1959) A terminal peptide sequence of human haemoglobin? Nature 184(Suppl 9):640–641
doi: 10.1038/184640b0
Molina-Vila MA, Nabau-Moretó N, Tornador C, Sabnis AJ, Rosell R, Estivill X, Bivona TG, Marino-Buslje C (2014) Activating mutations cluster in the “molecular brake” regions of protein kinases and do not associate with conserved or catalytic residues. Hum Mutat 35:318–328
pubmed: 24323975 doi: 10.1002/humu.22493
Huang Y-WA, Zhou B, Wernig M, Südhof TC (2017) ApoE2, ApoE3, and ApoE4 differentially stimulate APP transcription and Aβ secretion. Cell 168:427–441.e21
pubmed: 28111074 doi: 10.1016/j.cell.2016.12.044
Illergård K, Ardell DH, Elofsson A (2009) Structure is three to ten times more conserved than sequence—a study of structural response in protein cores. Proteins Struct Funct Bioinf 77:499–508
doi: 10.1002/prot.22458
Waterhouse A, Bertoni M, Bienert S et al (2018) SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res 46:W296–W303
pubmed: 29788355 pmcid: 6030848 doi: 10.1093/nar/gky427
Sander C, Schneider R (1991) Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins 9:56–68
pubmed: 2017436 doi: 10.1002/prot.340090107
El-Gebali S, Mistry J, Bateman A et al (2019) The Pfam protein families database in 2019. Nucleic Acids Res 47:D427–D432
pubmed: 30357350 doi: 10.1093/nar/gky995
Zea DJ, Monzon AM, Parisi G, Marino-Buslje C (2018) How is structural divergence related to evolutionary information? Mol Phylogenet Evol 127:859–866
pubmed: 29953938 doi: 10.1016/j.ympev.2018.06.033
Vetrivel I, de Brevern AG, Cadet F, Srinivasan N, Offmann B (2019) Structural variations within proteins can be as large as variations observed across their homologues. Biochimie 167:162–170
pubmed: 31560932 doi: 10.1016/j.biochi.2019.09.013
Monzon AM, Zea DJ, Marino-Buslje C, Parisi G (2017) Homology modeling in a dynamical world. Protein Sci 26:2195–2206
pubmed: 28815769 pmcid: 5654859 doi: 10.1002/pro.3274
Zea DJ, Anfossi D, Nielsen M, Marino-Buslje C (2017) MIToS.jl: mutual information tools for protein sequence analysis in the Julia language. Bioinformatics 33:564–565
pubmed: 27797756 doi: 10.1093/bioinformatics/btw646
Elber R, Karplus M (1987) Multiple conformational states of proteins: a molecular dynamics analysis of myoglobin. Science 235:318–321
pubmed: 3798113 doi: 10.1126/science.3798113
Narayanan C, Bernard DN, Doucet N (2016) Role of conformational motions in enzyme function: selected methodologies and case studies. Catalysts. https://doi.org/10.3390/catal6060081
Saldaño TE, Freixas VM, Tosatto SCE, Parisi G, Fernandez-Alberti S (2020) Exploring conformational space with thermal fluctuations obtained by normal-mode analysis. J Chem Inf Model 60:3068. https://doi.org/10.1021/acs.jcim.9b01136
doi: 10.1021/acs.jcim.9b01136 pubmed: 32216314
Bienert S, Waterhouse A, de Beer TAP, Tauriello G, Studer G, Bordoli L, Schwede T (2017) The SWISS-MODEL repository—new features and functionality. Nucleic Acids Res 45:D313–D319
pubmed: 27899672 doi: 10.1093/nar/gkw1132
Narunsky A, Nepomnyachiy S, Ashkenazy H, Kolodny R, Ben-Tal N (2015) ConTemplate suggests possible alternative conformations for a query protein of known structure. Structure 23:2162–2170
pubmed: 26455800 doi: 10.1016/j.str.2015.08.018
Palopoli N, Monzon AM, Parisi G, Fornasari MS (2016) Addressing the role of conformational diversity in protein structure prediction. PLoS One 11:e0154923
pubmed: 27159429 pmcid: 4861349 doi: 10.1371/journal.pone.0154923
Zhang Y, Skolnick J (2004) Scoring function for automated assessment of protein structure template quality. Proteins 57:702–710
pubmed: 15476259 doi: 10.1002/prot.20264
Ramachandran GN, Sasisekharan V (1968) Conformation of polypeptides and proteins **The literature survey for this review was completed in September 1967, with the journals which were then available in Madras and the preprinta which the authors had received. ††By the authors’ request, the publishers have left certain matters of usage and spelling in the form in which they wrote them. In: Anfinsen CB, Anson ML, Edsall JT, Richards FM (eds) Advances in protein chemistry. Academic Press, pp 283–437
Laskowski RA, MacArthur MW, Moss DS, Thornton JM (1993) PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Crystallogr 26:283. https://doi.org/10.1107/S0021889892009944
doi: 10.1107/S0021889892009944
Zhou AQ, O’Hern CS, Regan L (2011) Revisiting the Ramachandran plot from a new angle. Protein Sci 20:1166–1171
pubmed: 21538644 pmcid: 3149190 doi: 10.1002/pro.644
Eisenberg D, Lüthy R, Bowie JU (1997) VERIFY3D: assessment of protein models with three-dimensional profiles. Methods Enzymol 277:396–404
pubmed: 9379925 doi: 10.1016/S0076-6879(97)77022-8
Gore S, Sanz García E, Hendrickx PMS et al (2017) Validation of structures in the Protein Data Bank. Structure 25:1916–1927
pubmed: 29174494 pmcid: 5718880 doi: 10.1016/j.str.2017.10.009
Schrodinger LLC (2010) The PyMOL molecular graphics system Version 1:0
Pettersen EF, Goddard TD, Huang CC (2004) UCSF Chimera—a visualization system for exploratory research and analysis. J Comput Chem 25:1605–1612
pubmed: 15264254 doi: 10.1002/jcc.20084
Léonard S, Joseph AP, Srinivasan N, Gelly J-C, de Brevern AG (2014) mulPBA: an efficient multiple protein structure alignment method based on a structural alphabet. J Biomol Struct Dyn 32:661–668
pubmed: 23659291 doi: 10.1080/07391102.2013.787026
Maiti R, Van Domselaar GH, Zhang H, Wishart DS (2004) SuperPose: a simple server for sophisticated structural superposition. Nucleic Acids Res 32:W590–W594
pubmed: 15215457 pmcid: 441615 doi: 10.1093/nar/gkh477
Jarnot P, Ziemska-Legiecka J, Dobson L et al (2020) PlaToLoCo: the first web meta-server for visualization and annotation of low complexity regions in proteins. Nucleic Acids Res 48:W77. https://doi.org/10.1093/nar/gkaa339
doi: 10.1093/nar/gkaa339 pubmed: 32421769 pmcid: 7319588
Potenza E, Di Domenico T, Walsh I, Tosatto SCE (2015) MobiDB 2.0: an improved database of intrinsically disordered and mobile proteins. Nucleic Acids Res 43:D315–D320
pubmed: 25361972 doi: 10.1093/nar/gku982
Mészáros B, Erdos G, Dosztányi Z (2018) IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding. Nucleic Acids Res 46:W329–W337
pubmed: 29860432 pmcid: 6030935 doi: 10.1093/nar/gky384

Auteurs

Diego Javier Zea (DJ)

Laboratory of Computational and Quantitative Biology, LCQB, UMR 7238 CNRS, IBPS, Sorbonne Université, Paris, France.

Elin Teppa (E)

Toulouse Biotechnology Institute, TBI, Université de Toulouse, CNRS, INRA, INSA, Toulouse, France.

Cristina Marino-Buslje (C)

Fundacion Instituto Leloir, Buenos Aires, Argentina. cmb@leloir.org.ar.

Articles similaires

Photosynthesis Ribulose-Bisphosphate Carboxylase Carbon Dioxide Molecular Dynamics Simulation Cyanobacteria
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Fucosyltransferases Drug Repositioning Molecular Docking Simulation Molecular Dynamics Simulation Humans

Classifications MeSH