Assessment of three-dimensional RNA structure prediction in CASP15.

CASP15 conformational ensembles cryogenic electron microscopy deep learning molecular replacement ribonucleic acid structure prediction

Journal

Proteins
ISSN: 1097-0134
Titre abrégé: Proteins
Pays: United States
ID NLM: 8700181

Informations de publication

Date de publication:
Dec 2023
Historique:
revised: 21 08 2023
received: 25 04 2023
accepted: 07 09 2023
medline: 24 11 2023
pubmed: 25 10 2023
entrez: 25 10 2023
Statut: ppublish

Résumé

The prediction of RNA three-dimensional structures remains an unsolved problem. Here, we report assessments of RNA structure predictions in CASP15, the first CASP exercise that involved RNA structure modeling. Forty-two predictor groups submitted models for at least one of twelve RNA-containing targets. These models were evaluated by the RNA-Puzzles organizers and, separately, by a CASP-recruited team using metrics (GDT, lDDT) and approaches (Z-score rankings) initially developed for assessment of proteins and generalized here for RNA assessment. The two assessments independently ranked the same predictor groups as first (AIchemy_RNA2), second (Chen), and third (RNAPolis and GeneSilico, tied); predictions from deep learning approaches were significantly worse than these top ranked groups, which did not use deep learning. Further analyses based on direct comparison of predicted models to cryogenic electron microscopy (cryo-EM) maps and x-ray diffraction data support these rankings. With the exception of two RNA-protein complexes, models submitted by CASP15 groups correctly predicted the global fold of the RNA targets. Comparisons of CASP15 submissions to designed RNA nanostructures as well as molecular replacement trials highlight the potential utility of current RNA modeling approaches for RNA nanotechnology and structural biology, respectively. Nevertheless, challenges remain in modeling fine details such as noncanonical pairs, in ranking among submitted models, and in prediction of multiple structures resolved by cryo-EM or crystallography.

Identifiants

pubmed: 37876231
doi: 10.1002/prot.26602
doi:

Substances chimiques

RNA 63231-63-0
Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1747-1770

Subventions

Organisme : NIH HHS
ID : R35 GM122579
Pays : United States
Organisme : NIH HHS
ID : R35 GM122579
Pays : United States
Organisme : Howard Hughes Medical Institute
Pays : United States
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/S007105/1
Pays : United Kingdom

Commentaires et corrections

Type : UpdateOf

Informations de copyright

© 2023 The Authors. Proteins: Structure, Function, and Bioinformatics published by Wiley Periodicals LLC.

Références

Holley RW, Apgar J, Everett GA, et al. Structure of a ribonucleic acid. Science. 1965;147:1462-1465.
Madison JT, Everett GA, Kung H. Nucleotide sequence of a yeast tyrosine transfer RNA. Science. 1966;153:531-534.
Fuller W, Hodgson A. Conformation of the anticodon loop in tRNA. Nature. 1967;215:817-821.
Levitt M. Detailed molecular model for transfer ribonucleic acid. Nature. 1969;224:759-763.
Hingerty B, Brown RS, Jack A. Further refinement of the structure of yeast tRNAPhe. J Mol Biol. 1978;124:523-534.
Sussman JL, Holbrook SR, Warrant RW, Church GM, Kim SH. Crystal structure of yeast phenylalanine transfer RNA. I. Crystallographic refinement. J Mol Biol. 1978;123:607-630.
Westhof E, Leontis NB. An RNA-centric historical narrative around the protein data bank. J Biol Chem. 2021;296:100555.
Cruz JA, Blanchet M-F, Boniecki M, et al. RNA-puzzles: a CASP-like evaluation of RNA three-dimensional structure prediction. RNA. 2012;18:610-625.
Miao Z, Adamiak RW, Blanchet M-F, et al. RNA-puzzles round II: assessment of RNA structure prediction programs applied to three large RNA structures. RNA. 2015;21:1066-1084.
Miao Z, Adamiak RW, Antczak M, et al. RNA-puzzles round III: 3D RNA structure prediction of five riboswitches and one ribozyme. RNA. 2017;23:655-672.
Miao Z, Adamiak RW, Antczak M, et al. RNA-puzzles round IV: 3D structure predictions of four ribozymes and two aptamers. RNA. 2020;26:982-995.
Das R. RNA structure: a renaissance begins? Nat Methods. 2021;18:439.
Pereira J, Simpkin AJ, Hartmann MD, Rigden DJ, Keegan RM, Lupas AN. High-accuracy protein structure prediction in CASP14. Proteins. 2021;89:1687-1699.
Hogan MJ, Pardi N. mRNA vaccines in the COVID-19 pandemic and beyond. Annu Rev Med. 2022;73:17-39.
Lensink MF, Brysbaert G, Mauri T, et al. Prediction of protein assemblies, the next frontier: the CASP14-CAPRI experiment. Proteins. 2021;89:1800-1823.
Lensink MF, Brysbaert G, Nadzirin N, et al. Blind prediction of homo- and hetero-protein complexes: the CASP13-CAPRI experiment. Proteins. 2019;87:1200-1221.
Lensink MF, Velankar S, Baek M, Heo L, Seok C, Wodak SJ. The challenge of modeling protein assemblies: the CASP12-CAPRI experiment. Proteins. 2018;86(Suppl 1):257-273.
Lensink MF, Velankar S, Kryshtafovych A, et al. Prediction of homoprotein and heteroprotein complexes by protein docking and template-based modeling: a CASP-CAPRI experiment. Proteins. 2016;84(Suppl 1):323-348.
Ozden B, Kryshtafovych A, Karaca E. Assessment of the CASP14 assembly predictions. Proteins. 2021;89:1787-1799.
Kretsch RC, Andersen ES, Bujnicki JM, et al. RNA target highlights in CASP15: Evaluation of predicted models by structure providers. Proteins. 2023;91(12):1600-1615. doi:10.1002/prot.26550
Mulvaney T, Kretsch RC, Elliott L, et al. CASP15 cryo-EM protein and RNA targets: Refinement and analysis using experimental maps. Proteins. 2023;91(12):1935-1951. doi:10.1002/prot.26644
Parisien M, Cruz JA, Westhof E, Major F. New metrics for comparing and assessing discrepancies between RNA 3D structures and models. RNA. 2009;15:1875-1885.
Gendron P, Lemieux S, Major F. Quantitative analysis of nucleic acid three-dimensional structures. J Mol Biol. 2001;308:919-936.
Mathews BW. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta (BBA) - Protein Struct. 1975;405:442-451.
Gorodkin J, Stricklin SL, Stormo GD. Discovering common stem-loop motifs in unaligned RNA sequences. Nucleic Acids Res. 2001;29:2135-2144.
Davis IW, Leaver-Fay A, Chen VB, et al. MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res. 2007;35:W375-W383.
Murray LJW, Arendall WB 3rd, Richardson DC, Richardson JS. RNA backbone is rotameric. Proc Natl Acad Sci U S A. 2003;100:13904-13909.
Word JM, Lovell SC, LaBean TH, et al. Visualizing and quantifying molecular goodness-of-fit: small-probe contact dots with explicit hydrogen atoms. J Mol Biol. 1999;285:1711-1733.
Gong S, Zhang C, Zhang Y. RNA-align: quick and accurate alignment of RNA 3D structures based on size-independent TM-scoreRNA. Bioinformatics. 2019;35:4459-4461.
Zok T, Popenda M, Szachniuk M. MCQ4Structures to compute similarity of molecule structures. CEJOR Cent Eur J Oper Res. 2014;22:457-473.
Zhang C, Shine M, Pyle AM, Zhang Y. US-align: universal structure alignments of proteins, nucleic acids, and macromolecular complexes. Nat Methods. 2022;19:1109-1115.
Zemla A. LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Res. 2003;31:3370-3374.
Magnus M, Antczak M, Zok T, et al. RNA-puzzles toolkit: a computational resource of RNA 3D structure benchmark datasets, structure manipulation, and evaluation tools. Nucleic Acids Res. 2020;48:576-588.
Waleń T, Chojnowski G, Gierski P, Bujnicki JM. ClaRNA: a classifier of contacts in RNA 3D structures based on a comparative analysis of various classification schemes. Nucleic Acids Res. 2014;42:e151.
Mariani V, Biasini M, Barbato A, Schwede T. lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests. Bioinformatics. 2013;29:2722-2728.
Biasini M, Mariani V, Haas J, et al. OpenStructure: a flexible software framework for computational structural biology. Bioinformatics. 2010;26:2626-2628.
Liebschner D, Afonine PV, Baker ML, et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in phenix. Acta Crystallogr D Struct Biol. 2019;75:861-877.
Kwon S, Won J, Kryshtafovych A, Seok C. Assessment of protein model structure accuracy estimation in CASP14: old and new challenges. Proteins. 2021;89:1940-1948.
Beton JG, Cragnolini T, Kaleel M, Mulvaney T, Sweeney A, Topf M. Integrating model simulation tools and cryo-electron microscopy. Wiley Interdiscip Rev Comput Mol Sci. 2023;13:e1642.
Watkins AM, Rangan R, Das R. Using Rosetta for RNA homology modeling. Methods Enzymol. 2019;623:177-207.
Pettersen EF, Goddard TD, Huang CC, et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 2021;30:70-82.
Cragnolini T, Sahota H, Joseph AP, et al. TEMPy2: a python library with improved 3D electron microscopy density-fitting and validation workflows. Acta Crystallogr D Struct Biol. 2021;77:41-47.
Lagerstedt I, Moore WJ, Patwardhan A, et al. Web-based visualisation and analysis of 3D electron-microscopy data from EMDB and PDB. J Struct Biol. 2013;184:173-181.
Pintilie G, Zhang K, Su Z, Li S, Schmid MF, Chiu W. Measurement of atom resolvability in cryo-EM maps with Q-scores. Nat Methods. 2020;17:328-334.
McCoy AJ, Grosse-Kunstleve RW, Adams PD, Winn MD, Storoni LC, Read RJ. Phaser crystallographic software. J Appl Cryst. 2007;40:658-674.
Winn MD, Ballard CC, Cowtan KD, et al. Overview of the CCP4 suite and current developments. Acta Crystallogr D Biol Crystallogr. 2011;67:235-242.
Krissinel E, Lebedev AA, Uski V, et al. CCP4 cloud for structure determination and project management in macromolecular crystallography. Acta Crystallogr D Struct Biol. 2022;78:1079-1089.
Vagin A, Teplyakov A. Molecular replacement with MOLREP. Acta Crystallogr D Biol Crystallogr. 2010;66:22-25.
Theobald DL, Wuttke DS. THESEUS: maximum likelihood superpositioning and analysis of macromolecular structures. Bioinformatics. 2006;22:2171-2172.
Jumper J, Evans R, Pritzel A, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583-589.
Mirdita M, Schütze K, Moriwaki Y, Heo L, Ovchinnikov S, Steinegger M. ColabFold: making protein folding accessible to all. Nat Methods. 2022;19:679-682.
Simpkin AJ, Elliott LG, Stevenson K, Krissinel E, Rigden D, Keegan RM. Slice'N'Dice: maximising the value of predicted models for structural biologists. bioRxiv. 2022. doi:10.1101/2022.06.30.497974
Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825-2830.
Salehi-Ashtiani K, Lupták A, Litovchick A, Szostak JW. A genomewide search for ribozymes reveals an HDV-like sequence in the human CPEB3 gene. Science. 2006;313(5794):1788-1792.
Schroeder GM, Cavender CE, Blau ME, Jenkins JL, Mathews DH, Wedekind JE. A small RNA that cooperatively senses two stacked metabolites in one pocket for gene control. Nat Commun. 2022;13(1):199.
Jenkins JL, Krucinska J, McCarty RM, Bandarian V, Wedekind JE. Comparison of a preQ1 riboswitch aptamer in metabolite-bound and free states with implications for gene regulation. J Biol Chem. 2011;286:24626-24637.
Read RJ, Chavali G. Assessment of CASP7 predictions in the high accuracy template-based modeling category. Proteins. 2007;69(Suppl 8):27-37.
Kinch LN, Pei J, Kryshtafovych A, Schaeffer RD, Grishin NV. Topology evaluation of models for difficult targets in the 14th round of the critical assessment of protein structure prediction (CASP14). Proteins. 2021;89:1673-1686.
Haas J, Barbato A, Behringer D, et al. Continuous Automated Model EvaluatiOn (CAMEO) complementing the critical assessment of structure prediction in CASP12. Proteins. 2018;86:387-398.
Robin X, Haas J, Gumienny R, Smolinski A, Tauriello G, Schwede T. Continuous Automated Model EvaluatiOn (CAMEO)-perspectives on the future of fully automated evaluation of structure prediction methods. Proteins. 2021;89:1977-1986.
Przytula-Mally AI, Engilberge S, Johannsen S, Olieric V, Masquida B, Sigel RKO. Anticodon-like loop-mediated dimerization in the crystal structures of HdV-like CPEB3 ribozymes. bioRxiv. 2022. doi:10.1101/2022.09.22.508989
Sampedro Vallina N, McRae EKS, Hansen BK, Boussebayle A, Andersen ES. RNA origami scaffolds facilitate cryo-EM characterization of a Broccoli-Pepper aptamer FRET pair. Nucleic Acids Res. 2023;51(9):4613-4624. doi:10.1093/nar/gkad224
McRae EKS, Rasmussen HØ, Liu J, et al. Structure, folding and flexibility of co-transcriptional RNA origami. Nat Nanotechnol. 2023;1-10:808-817.
Chen J-H, Yajima R, Chadalavada DM, Chase E, Bevilacqua PC, Golden BL. A 1.9 A crystal structure of the HDV ribozyme precleavage suggests both Lewis acid and general acid mechanisms contribute to phosphodiester cleavage. Biochemistry. 2010;49:6508-6518.
Chen K, Zhou Y, Wang S, Xiong P. RNA tertiary structure modeling with BRiQ potential in CASP15. Proteins. 2023;91(12):1771-1778. doi:10.1002/prot.26574
Sarzynska J, Popenda M, Antczak M, Szachniuk M. RNA tertiary structure prediction using RNAComposer in CASP15. Proteins. 2023;91(12):1790-1799. doi:10.1002/prot.26578
Li J, Zhang S, Chen S-J. Advancing RNA 3D structure prediction: Exploring hierarchical and hybrid approaches in CASP15. Proteins. 2023;91(12):1779-1789. doi:10.1002/prot.26583
Baulin EF, Mukherjee S, Moafinejad SN, et al. RNA tertiary structure prediction in CASP15 by the GeneSilico group: Folding simulations based on statistical potentials and spatial restraints. Proteins. 2023;91(12):1800-1810. doi:10.1002/prot.26575
Cragnolini T, Kryshtafovych A, Topf M. Cryo-EM targets in CASP14. Proteins. 2021;89:1949-1958.
Vasishtan D, Topf M. Scoring functions for cryoEM density fitting. J Struct Biol. 2011;174:333-343.
Joseph AP, Lagerstedt I, Patwardhan A, Topf M, Winn M. Improved metrics for comparing structures of macromolecular assemblies determined by 3D electron-microscopy. J Struct Biol. 2017;199:12-26.
Leontis NB, Westhof E. Geometric nomenclature and classification of RNA base pairs. RNA. 2001;7:499-512.
Vallina NS, McRae EKS, Geary C, Andersen ES. An RNA paranemic crossover triangle as a 3D module for cotranscriptional nanoassembly. Small. 2022;19:2204651.
Bonilla SL, Vicens Q, Kieft JS. Cryo-EM reveals an entangled kinetic trap in the folding of a catalytic RNA. Sci Adv. 2022;8:eabq4144.
Li S, Palo MZ, Pintilie G, et al. Topological crossing in the misfolded Tetrahymena ribozyme resolved by cryo-EM. Proc Natl Acad Sci U S A. 2022;119:e2209146119.
Huang L, Wang J, Watkins AM, Das R, Lilley DMJ. Structure and ligand binding of the glutamine-II riboswitch. Nucleic Acids Res. 2019;47:7666-7675.
Murshudov GN, Skubák P, Lebedev AA, et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr D Biol Crystallogr. 2011;67:355-367.
Theobald DL, Wuttke DS. Accurate structural correlations from maximum likelihood superpositions. PLoS Comput Biol. 2008;4:e43.
Bibby J, Keegan RM, Mayans O, Winn MD, Rigden DJ. AMPLE: a cluster-and-truncate approach to solve the crystal structures of small proteins using rapidly computed ab initio models. Acta Crystallogr D Biol Crystallogr. 2012;68:1622-1631.
Oeffner RD, Afonine PV, Millán C, et al. On the application of the expected log-likelihood gain to decision making in molecular replacement. Acta Crystallogr D Struct Biol. 2018;74:245-255.
Shen T, Hu Z, Peng Z, et al. E2Efold-3D: end-to-end deep learning method for accurate de novo RNA 3D structure prediction. 2022. http://arxiv.org/abs/2207.01586
Pearce R, Omenn GS, De Zhang Y. De novo RNA tertiary structure prediction at atomic resolution using geometric potentials from deep learning. bioRxiv. 2022. doi:10.1101/2022.05.15.491755
Strobel EJ, Yu AM, Lucks JB. High-throughput determination of RNA structures. Nat Rev Genet. 2018;19:615-634.
Cossio P, Rohr D, Baruffa F, Rampp M, Lindenstruth V, Hummer G. BioEM: GPU-accelerated computing of Bayesian inference of electron microscopy images. Comput Phys Commun. 2017;210:163-171.
Kryshtafovych A, Antczak M, Szachniuk M, et al. New prediction categories in CASP15. Proteins. 2023;91(12):1550-1557. doi:10.1002/prot.26515

Auteurs

Rhiju Das (R)

Department of Biochemistry, Stanford University School of Medicine, Stanford, California, USA.
Biophysics Program, Stanford University School of Medicine, Stanford, California, USA.
Howard Hughes Medical Institute, Stanford University, Stanford, California, USA.

Rachael C Kretsch (RC)

Biophysics Program, Stanford University School of Medicine, Stanford, California, USA.

Adam J Simpkin (AJ)

Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, Liverpool, UK.

Thomas Mulvaney (T)

Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany.
University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.

Phillip Pham (P)

Department of Biochemistry, Stanford University School of Medicine, Stanford, California, USA.

Ramya Rangan (R)

Biophysics Program, Stanford University School of Medicine, Stanford, California, USA.

Fan Bu (F)

Guangzhou Laboratory, Guangzhou International Bio Island, Guangzhou, China.
Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China.

Ronan M Keegan (RM)

Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, Liverpool, UK.
UKRI-STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot, UK.

Maya Topf (M)

Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany.
University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.

Daniel J Rigden (DJ)

Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, Liverpool, UK.

Zhichao Miao (Z)

GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou National Laboratory, Guangzhou Medical University, Guangzhou, China.
Shanghai Key Laboratory of Anesthesiology and Brain Functional Modulation, Clinical Research Center for Anesthesiology and Perioperative Medicine, Translational Research Institute of Brain and Brain-Like Intelligence, Shanghai Fourth People's Hospital, School of Medicine, Tongji University, Shanghai, China.

Eric Westhof (E)

Architecture et Réactivité de l'ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, Strasbourg, France.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Humans Colorectal Neoplasms Biomarkers, Tumor Prognosis Gene Expression Regulation, Neoplastic

Classifications MeSH