Cryo-EM model validation recommendations based on outcomes of the 2019 EMDataResource challenge.


Journal

Nature methods
ISSN: 1548-7105
Titre abrégé: Nat Methods
Pays: United States
ID NLM: 101215604

Informations de publication

Date de publication:
02 2021
Historique:
received: 11 06 2020
accepted: 21 12 2020
pubmed: 6 2 2021
medline: 1 4 2021
entrez: 5 2 2021
Statut: ppublish

Résumé

This paper describes outcomes of the 2019 Cryo-EM Model Challenge. The goals were to (1) assess the quality of models that can be produced from cryogenic electron microscopy (cryo-EM) maps using current modeling software, (2) evaluate reproducibility of modeling results from different software developers and users and (3) compare performance of current metrics used for model evaluation, particularly Fit-to-Map metrics, with focus on near-atomic resolution. Our findings demonstrate the relatively high accuracy and reproducibility of cryo-EM models derived by 13 participating teams from four benchmark maps, including three forming a resolution series (1.8 to 3.1 Å). The results permit specific recommendations to be made about validating near-atomic cryo-EM structures both in the context of individual experiments and structure data archives such as the Protein Data Bank. We recommend the adoption of multiple scoring parameters to provide full and objective annotation and assessment of the model, reflective of the observed cryo-EM map density.

Identifiants

pubmed: 33542514
doi: 10.1038/s41592-020-01051-w
pii: 10.1038/s41592-020-01051-w
pmc: PMC7864804
mid: EMS114920
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S. Validation Study

Langues

eng

Sous-ensembles de citation

IM

Pagination

156-164

Subventions

Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/P000975/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/S005099/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/L006383/1
Pays : United Kingdom
Organisme : NIGMS NIH HHS
ID : R01 GM079429
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM095583
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM123055
Pays : United States
Organisme : NIAID NIH HHS
ID : R37 AI036040
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM123159
Pays : United States
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : NIGMS NIH HHS
ID : R01 GM133840
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM131883
Pays : United States
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/P000517/1
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 208398
Pays : United Kingdom
Organisme : NIGMS NIH HHS
ID : P01 GM063210
Pays : United States
Organisme : Medical Research Council
ID : MR/N009614/1
Pays : United Kingdom

Commentaires et corrections

Type : CommentIn

Références

Mitra, A. K. Visualization of biological macromolecules at near-atomic resolution: cryo-electron microscopy comes of age. Acta Cryst. F 75, 3–11 (2019).
doi: 10.1107/S2053230X18015133
Lawson, C. L., Berman, H. M. & Chiu, W. Evolving data standards for cryo-EM structures. Struct. Dyn. 7, 014701 (2020).
pubmed: 32002441 pmcid: 6980868 doi: 10.1063/1.5138589
Henderson, R. et al. Outcome of the first electron microscopy validation task force meeting. Structure 20, 205–214 (2012).
pubmed: 22325770 doi: 10.1016/j.str.2011.12.014
Read, R. J. et al. A new generation of crystallographic validation tools for the Protein Data Bank. Structure 19, 1395–1412 (2011).
pubmed: 22000512 pmcid: 3195755 doi: 10.1016/j.str.2011.08.006
Montelione, G. T. et al. Recommendations of the wwPDB NMR Validation Task Force. Structure 21, 1563–1570 (2013).
pubmed: 24010715 doi: 10.1016/j.str.2013.07.021
Lawson, C. L. & Chiu, W. Comparing cryo-EM structures. J. Struct. Biol. 204, 523–526 (2018).
pubmed: 30321594 pmcid: 6464812 doi: 10.1016/j.jsb.2018.10.004
wwPDB Consortium. Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Res. 47, D520–D528 (2019).
doi: 10.1093/nar/gky949
Pintilie, G. et al. Measurement of atom resolvability in cryo-EM maps with Q-scores. Nat. Methods 17, 328–334 (2020).
pubmed: 32042190 pmcid: 7446556 doi: 10.1038/s41592-020-0731-1
Herzik, M. A. Jr, Wu, M. & Lander, G. C. High-resolution structure determination of sub-100 kDa complexes using conventional cryo-EM. Nat. Commun. 10, 1032 (2019).
pubmed: 30833564 pmcid: 6399227 doi: 10.1038/s41467-019-08991-8
Masuda, T., Goto, F., Yoshihara, T. & Mikami, B. The universal mechanism for iron translocation to the ferroxidase site in ferritin, which is mediated by the well conserved transit site. Biochem. Biophys. Res. Commun. 400, 94–99 (2010).
pubmed: 20705053 doi: 10.1016/j.bbrc.2010.08.017
Kryshtafovych, A., Adams, P. D., Lawson, C. L. & Chiu, W. Evaluation system and web infrastructure for the second cryo-EM Model Challenge. J. Struct. Biol. 204, 96–108 (2018).
pubmed: 30017700 pmcid: 6205695 doi: 10.1016/j.jsb.2018.07.006
Rosenthal, P. B. & Henderson, R. Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy. J. Mol. Biol. 333, 721–745 (2003).
pubmed: 14568533 doi: 10.1016/j.jmb.2003.07.013
Brown, A. et al. Tools for macromolecular model building and refinement into electron cryo-microscopy reconstructions. Acta Cryst. D 71, 136–153 (2015).
doi: 10.1107/S1399004714021683
Lagerstedt, I. et al. Web-based visualisation and analysis of 3D electron-microscopy data from EMDB and PDB. J. Struct. Biol. 184, 173–181 (2013).
pubmed: 24113529 pmcid: 3898923 doi: 10.1016/j.jsb.2013.09.021
Barad, B. A. et al. EMRinger: side chain-directed model and map validation for 3D cryo-electron microscopy. Nat. Methods 12, 943–946 (2015).
pubmed: 26280328 pmcid: 4589481 doi: 10.1038/nmeth.3541
Vasishtan, D. & Topf, M. Scoring functions for cryoEM density fitting. J. Struct. Biol. 174, 333–343 (2011).
pubmed: 21296161 doi: 10.1016/j.jsb.2011.01.012
Farabella, I. et al. TEMPy: a Python library for assessment of three-dimensional electron microscopy density fits. J. Appl. Crystallogr. 48, 1314–1323 (2015).
pubmed: 26306092 pmcid: 4520291 doi: 10.1107/S1600576715010092
Joseph, A. P., Lagerstedt, I., Patwardhan, A., Topf, M. & Winn, M. Improved metrics for comparing structures of macromolecular assemblies determined by 3D electron-microscopy. J. Struct. Biol. 199, 12–26 (2017).
pubmed: 28552721 pmcid: 5479444 doi: 10.1016/j.jsb.2017.05.007
Afonine, P. V. et al. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Cryst. D 74, 814–840 (2018).
doi: 10.1107/S2059798318009324
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Cryst. D 66, 12–21 (2010).
doi: 10.1107/S0907444909042073
Liebschner, D. et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Cryst. D 75, 861–877 (2019).
doi: 10.1107/S2059798319011471
Williams, C. J. et al. MolProbity: more and better reference data for improved all-atom structure validation. Protein Sci. 27, 293–315 (2018).
pubmed: 29067766 doi: 10.1002/pro.3330
Zemla, A. LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Res. 31, 3370–3374 (2003).
pubmed: 12824330 pmcid: 168977 doi: 10.1093/nar/gkg571
Mariani, V., Biasini, M., Barbato, A. & Schwede, T. LDDT: a local superposition-free score for comparing protein structures and models using distance difference tests. Bioinformatics 29, 2722–2728 (2013).
pubmed: 23986568 pmcid: 3799472 doi: 10.1093/bioinformatics/btt473
Bertoni, M., Kiefer, F., Biasini, M., Bordoli, L. & Schwede, T. Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology. Sci. Rep. 7, 10480 (2017).
pubmed: 28874689 pmcid: 5585393 doi: 10.1038/s41598-017-09654-8
Olechnovic, K., Kulberkyte, E. & Venclovas, C. CAD-score: a new contact area difference-based function for evaluation of protein structural models. Proteins 81, 149–162 (2013).
pubmed: 22933340 doi: 10.1002/prot.24172
Kryshtafovych, A., Monastyrskyy, B. & Fidelis, K. CASP prediction center infrastructure and evaluation measures in CASP10 and CASP ROLL. Proteins 82, 7–13 (2014).
pubmed: 24038551 doi: 10.1002/prot.24399
Prisant, M. G., Williams, C. J., Chen, V. B., Richardson, J. S. & Richardson, D. C. New tools in MolProbity validation: CaBLAM for CryoEM backbone, UnDowser to rethink ‘waters,’ and NGL Viewer to recapture online 3D graphics. Protein Sci. 29, 315–329 (2020).
pubmed: 31724275 doi: 10.1002/pro.3786
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Cryst. D 66, 486–501 (2010).
doi: 10.1107/S0907444910007493
Headd, J. J. et al. Use of knowledge-based restraints in phenix.refine to improve macromolecular refinement at low resolution. Acta Cryst. D 68, 381–390 (2012).
doi: 10.1107/S0907444911047834
Kryshtafovych, A. et al. Cryo-electron microscopy targets in CASP13: overview and evaluation of results. Proteins 87, 1128–1140 (2019).
pubmed: 31576602 pmcid: 7197460 doi: 10.1002/prot.25817
Gore, S. et al. Validation of structures in the Protein Data Bank. Structure 25, 1916–1927 (2017).
pubmed: 29174494 pmcid: 5718880 doi: 10.1016/j.str.2017.10.009
DiMaio, F., Zhang, J., Chiu, W. & Baker, D. Cryo-EM model validation using independent map reconstructions. Protein Sci. 22, 865–868 (2013).
pubmed: 23592445 pmcid: 3690725 doi: 10.1002/pro.2267
Pintilie, G., Chen, D. H., Haase-Pettingell, C. A., King, J. A. & Chiu, W. Resolution and probabilistic models of components in cryoEM maps of mature P22 bacteriophage. Biophys. J. 110, 827–839 (2016).
pubmed: 26743049 doi: 10.1016/j.bpj.2015.11.3522
Hryc, C. F. et al. Accurate model annotation of a near-atomic resolution cryo-EM map. Proc. Natl Acad. Sci. USA 114, 3103–3108 (2017).
pubmed: 28270620 pmcid: 5373346 doi: 10.1073/pnas.1621152114
Terwilliger, T. C., Sobolev, O. V., Afonine, P. V. & Adams, P. D. Automated map sharpening by maximization of detail and connectivity. Acta Cryst. D 74, 545–559 (2018).
doi: 10.1107/S2059798318004655
Hoh, S., Burnley, T. & Cowtan, K. Current approaches for automated model building into cryo-EM maps using Buccaneer with CCP-EM. Acta Cryst. D 76, 531–541 (2020).
doi: 10.1107/S2059798320005513
Pettersen, E. F. et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
pubmed: 15264254 doi: 10.1002/jcc.20084
Igaev, M., Kutzner, C., Bock, L. V., Vaiana, A. C. & Grubmuller, H. Automated cryo-EM structure refinement using correlation-driven molecular dynamics. eLife 8, https://doi.org/10.7554/eLife.43542 (2019).
Frenz, B., Walls, A. C., Egelman, E. H., Veesler, D. & DiMaio, F. RosettaES: a sampling strategy enabling automated interpretation of difficult cryo-EM maps. Nat. Methods 14, 797–800 (2017).
pubmed: 28628127 pmcid: 6009829 doi: 10.1038/nmeth.4340
Chojnowski, G., Pereira, J. & Lamzin, V. S. Sequence assignment for low-resolution modelling of protein crystal structures. Acta Cryst. D 75, 753–763 (2019).
doi: 10.1107/S2059798319009392
Burnley, T., Palmer, C. M. & Winn, M. Recent developments in the CCP-EM software suite. Acta Cryst. D 73, 469–477 (2017).
doi: 10.1107/S2059798317007859
Wang, Z. & Schröder, G. F. Real-space refinement with DireX: from global fitting to side-chain improvements. Biopolymers 97, 687–697 (2012).
pubmed: 22696405 doi: 10.1002/bip.22046
Trabuco, L. G., Villa, E., Mitra, K., Frank, J. & Schulten, K. Flexible fitting of atomic structures into electron microscopy maps using molecular dynamics. Structure 16, 673–683 (2008).
pubmed: 18462672 pmcid: 2430731 doi: 10.1016/j.str.2008.03.005
Terashi, G. & Kihara, D. De novo main-chain modeling for EM maps using MAINMAST. Nat. Commun. 9, 1618 (2018).
pubmed: 29691408 pmcid: 5915429 doi: 10.1038/s41467-018-04053-7
Si, D. et al. Deep learning to predict protein backbone structure from high-resolution cryo-EM density maps. Sci. Rep. 10, 4282 (2020).
pubmed: 32152330 pmcid: 7063051 doi: 10.1038/s41598-020-60598-y
Singharoy, A. et al. Molecular dynamics-based refinement and validation for sub-5 A cryo-electron microscopy maps. eLife 5, https://doi.org/10.7554/eLife.16105 (2016).
MacCallum, J. L., Perez, A. & Dill, K. A. Determining protein structures by combining semireliable data with atomistic physical models by Bayesian inference. Proc. Natl Acad. Sci. USA 112, 6985–6990 (2015).
pubmed: 26038552 pmcid: 4460504 doi: 10.1073/pnas.1506788112
Chen, M. & Baker, M. L. Automation and assessment of de novo modeling with Pathwalking in near atomic resolution cryoEM density maps. J. Struct. Biol. 204, 555–563 (2018).
pubmed: 30237066 doi: 10.1016/j.jsb.2018.09.005
McDonald, I. K. & Thornton, J. M. Satisfying hydrogen bonding potential in proteins. J. Mol. Biol. 238, 777–793 (1994).
pubmed: 8182748 doi: 10.1006/jmbi.1994.1334
Yang, H. et al. Automated and accurate deposition of structures solved by X-ray diffraction to the Protein Data Bank. Acta Cryst. D 60, 1833–1839 (2004).
doi: 10.1107/S0907444904019419
Brünger, A. T. Version 1.2 of the crystallography and NMR system. Nat. Protoc. 2, 2728–2733 (2007).
pubmed: 18007608 doi: 10.1038/nprot.2007.406
Hsin, J., Arkhipov, A., Yin, Y., Stone, J. E. & Schulten, K. Using VMD: an introductory tutorial. Curr. Protoc. Bioinformatics 24, https://doi.org/10.1002/0471250953.bi0507s24 (2008).
Lawson, C. L. et al. 2019 EMDataresource model metrics challenge dataset. Zenodo https://doi.org/10.5281/zenodo.4148789 (2020).
Chen, V. B., Davis, I. W. & Richardson, D. C. KING (Kinemage, Next Generation): a versatile interactive molecular and scientific visualization program. Protein Sci. 18, 2403–2409 (2009).
pubmed: 19768809 pmcid: 2788294 doi: 10.1002/pro.250

Auteurs

Catherine L Lawson (CL)

Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA. cathy.lawson@rutgers.edu.

Andriy Kryshtafovych (A)

Genome Center, University of California, Davis, CA, USA.

Paul D Adams (PD)

Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
Department of Bioengineering, University of California Berkeley, Berkeley, CA, USA.

Pavel V Afonine (PV)

Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.

Matthew L Baker (ML)

Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA.

Benjamin A Barad (BA)

Department of Integrated Computational Structural Biology, The Scripps Research Institute, La Jolla, CA, USA.

Paul Bond (P)

York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK.

Tom Burnley (T)

Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK.

Renzhi Cao (R)

Department of Computer Science, Pacific Lutheran University, Tacoma, WA, USA.

Jianlin Cheng (J)

Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA.

Grzegorz Chojnowski (G)

European Molecular Biology Laboratory, c/o DESY, Hamburg, Germany.

Kevin Cowtan (K)

York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK.

Ken A Dill (KA)

Laufer Center, Stony Brook University, Stony Brook, NY, USA.

Frank DiMaio (F)

Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA.

Daniel P Farrell (DP)

Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA.

James S Fraser (JS)

Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.

Mark A Herzik (MA)

Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, USA.

Soon Wen Hoh (SW)

York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK.

Jie Hou (J)

Department of Computer Science, Saint Louis University, St. Louis, MO, USA.

Li-Wei Hung (LW)

Los Alamos National Laboratory, Los Alamos, NM, USA.

Maxim Igaev (M)

Theoretical and Computational Biophysics, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany.

Agnel P Joseph (AP)

Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK.

Daisuke Kihara (D)

Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
Department of Computer Science, Purdue University, West Lafayette, IN, USA.

Dilip Kumar (D)

Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX, USA.

Sumit Mittal (S)

Biodesign Institute, Arizona State University, Tempe, AZ, USA.
School of Advanced Sciences and Languages, VIT Bhopal University, Bhopal, India.

Bohdan Monastyrskyy (B)

Genome Center, University of California, Davis, CA, USA.

Mateusz Olek (M)

York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK.

Colin M Palmer (CM)

Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK.

Ardan Patwardhan (A)

The European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, UK.

Alberto Perez (A)

Department of Chemistry, University of Florida, Gainesville, FL, USA.

Jonas Pfab (J)

Division of Computing & Software Systems, University of Washington, Bothell, WA, USA.

Grigore D Pintilie (GD)

Department of Bioengineering, Stanford University, Stanford, CA, USA.

Jane S Richardson (JS)

Department of Biochemistry, Duke University, Durham, NC, USA.

Peter B Rosenthal (PB)

Structural Biology of Cells and Viruses Laboratory, Francis Crick Institute, London, UK.

Daipayan Sarkar (D)

Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
Biodesign Institute, Arizona State University, Tempe, AZ, USA.

Luisa U Schäfer (LU)

Institute of Biological Information Processing (IBI-7: Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany.

Michael F Schmid (MF)

Division of CryoEM and Biomaging, SSRL, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, CA, USA.

Gunnar F Schröder (GF)

Institute of Biological Information Processing (IBI-7: Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany.
Physics Department, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.

Mrinal Shekhar (M)

Biodesign Institute, Arizona State University, Tempe, AZ, USA.
Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Dong Si (D)

Division of Computing & Software Systems, University of Washington, Bothell, WA, USA.

Abishek Singharoy (A)

Biodesign Institute, Arizona State University, Tempe, AZ, USA.

Genki Terashi (G)

Theoretical and Computational Biophysics, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany.

Thomas C Terwilliger (TC)

New Mexico Consortium, Los Alamos, NM, USA.

Andrea Vaiana (A)

Theoretical and Computational Biophysics, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany.

Liguo Wang (L)

Department of Biological Structure, University of Washington, Seattle, WA, USA.

Zhe Wang (Z)

The European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, UK.

Stephanie A Wankowicz (SA)

Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.
Biophysics Graduate Program, University of California, San Francisco, CA, USA.

Christopher J Williams (CJ)

Department of Biochemistry, Duke University, Durham, NC, USA.

Martyn Winn (M)

Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK.

Tianqi Wu (T)

Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA.

Xiaodi Yu (X)

SMPS, Janssen Research and Development, Spring House, PA, USA.

Kaiming Zhang (K)

Department of Bioengineering, Stanford University, Stanford, CA, USA.

Helen M Berman (HM)

Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA.
Department of Biological Sciences and Bridge Institute, University of Southern California, Los Angeles, CA, USA.

Wah Chiu (W)

Department of Bioengineering, Stanford University, Stanford, CA, USA. wahc@stanford.edu.
Division of CryoEM and Biomaging, SSRL, SLAC National Accelerator Laboratory, Stanford University, Menlo Park, CA, USA. wahc@stanford.edu.

Articles similaires

Photosynthesis Ribulose-Bisphosphate Carboxylase Carbon Dioxide Molecular Dynamics Simulation Cyanobacteria
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
alpha-Synuclein Humans Animals Mice Lewy Body Disease
Cryoelectron Microscopy Algorithms Image Processing, Computer-Assisted Consensus Software

Classifications MeSH