RNA 3D structure prediction guided by independent folding of homologous sequences.


Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
22 Oct 2019
Historique:
received: 28 05 2019
accepted: 01 10 2019
entrez: 24 10 2019
pubmed: 24 10 2019
medline: 20 11 2019
Statut: epublish

Résumé

The understanding of the importance of RNA has dramatically changed over recent years. As in the case of proteins, the function of an RNA molecule is encoded in its tertiary structure, which in turn is determined by the molecule's sequence. The prediction of tertiary structures of complex RNAs is still a challenging task. Using the observation that RNA sequences from the same RNA family fold into conserved structure, we test herein whether parallel modeling of RNA homologs can improve ab initio RNA structure prediction. EvoClustRNA is a multi-step modeling process, in which homologous sequences for the target sequence are selected using the Rfam database. Subsequently, independent folding simulations using Rosetta FARFAR and SimRNA are carried out. The model of the target sequence is selected based on the most common structural arrangement of the common helical fragments. As a test, on two blind RNA-Puzzles challenges, EvoClustRNA predictions ranked as the first of all submissions for the L-glutamine riboswitch and as the second for the ZMP riboswitch. Moreover, through a benchmark of known structures, we discovered several cases in which particular homologs were unusually amenable to structure recovery in folding simulations compared to the single original target sequence. This work, for the first time to our knowledge, demonstrates the importance of the selection of the target sequence from an alignment of an RNA family for the success of RNA 3D structure prediction. These observations prompt investigations into a new direction of research for checking 3D structure "foldability" or "predictability" of related RNA sequences to obtain accurate predictions. To support new research in this area, we provide all relevant scripts in a documented and ready-to-use form. By exploring new ideas and identifying limitations of the current RNA 3D structure prediction methods, this work is bringing us closer to the near-native computational RNA 3D models.

Sections du résumé

BACKGROUND BACKGROUND
The understanding of the importance of RNA has dramatically changed over recent years. As in the case of proteins, the function of an RNA molecule is encoded in its tertiary structure, which in turn is determined by the molecule's sequence. The prediction of tertiary structures of complex RNAs is still a challenging task.
RESULTS RESULTS
Using the observation that RNA sequences from the same RNA family fold into conserved structure, we test herein whether parallel modeling of RNA homologs can improve ab initio RNA structure prediction. EvoClustRNA is a multi-step modeling process, in which homologous sequences for the target sequence are selected using the Rfam database. Subsequently, independent folding simulations using Rosetta FARFAR and SimRNA are carried out. The model of the target sequence is selected based on the most common structural arrangement of the common helical fragments. As a test, on two blind RNA-Puzzles challenges, EvoClustRNA predictions ranked as the first of all submissions for the L-glutamine riboswitch and as the second for the ZMP riboswitch. Moreover, through a benchmark of known structures, we discovered several cases in which particular homologs were unusually amenable to structure recovery in folding simulations compared to the single original target sequence.
CONCLUSION CONCLUSIONS
This work, for the first time to our knowledge, demonstrates the importance of the selection of the target sequence from an alignment of an RNA family for the success of RNA 3D structure prediction. These observations prompt investigations into a new direction of research for checking 3D structure "foldability" or "predictability" of related RNA sequences to obtain accurate predictions. To support new research in this area, we provide all relevant scripts in a documented and ready-to-use form. By exploring new ideas and identifying limitations of the current RNA 3D structure prediction methods, this work is bringing us closer to the near-native computational RNA 3D models.

Identifiants

pubmed: 31640563
doi: 10.1186/s12859-019-3120-y
pii: 10.1186/s12859-019-3120-y
pmc: PMC6806525
doi:

Substances chimiques

Riboswitch 0
RNA 63231-63-0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

512

Subventions

Organisme : NIGMS NIH HHS
ID : R35 GM122579
Pays : United States

Références

EMBO J. 2014 Nov 18;33(22):2692-703
pubmed: 25271255
RNA. 2010 Jul;16(7):1340-9
pubmed: 20498460
Nat Methods. 2010 Apr;7(4):291-4
pubmed: 20190761
Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7
pubmed: 25392425
RNA. 2015 Apr;21(4):486-7
pubmed: 25780106
Methods. 2019 Jun 1;162-163:68-73
pubmed: 31028927
Proteins. 1999 Jan 1;34(1):82-95
pubmed: 10336385
Nature. 2016 Sep 14;537(7620):339-46
pubmed: 27629640
Methods Mol Biol. 2009;540:115-28
pubmed: 19381556
Nucleic Acids Res. 2013 Apr;41(7):4307-23
pubmed: 23435231
BMC Bioinformatics. 2011 Aug 18;12:348
pubmed: 21851628
Bioinformatics. 2009 Jun 1;25(11):1422-3
pubmed: 19304878
Proc Natl Acad Sci U S A. 2011 May 10;108(19):7757-62
pubmed: 21518891
Nucleic Acids Res. 2018 Jan 4;46(D1):D202-D205
pubmed: 29069520
Mol Cell. 2015 Jan 22;57(2):317-28
pubmed: 25616067
Protein Sci. 2018 Jan;27(1):259-268
pubmed: 28960691
Bioinformatics. 2004 Dec 12;20(18):3702-4
pubmed: 15284097
RNA. 2017 May;23(5):655-672
pubmed: 28138060
Curr Protoc Nucleic Acid Chem. 2001 May;Chapter 7:Unit 7.6
pubmed: 18428874
Nucleic Acids Res. 2016 Apr 20;44(7):e63
pubmed: 26687716
Chem Biol. 2004 Dec;11(12):1729-41
pubmed: 15610857
Nucleic Acids Res. 2000 Jan 1;28(1):235-42
pubmed: 10592235
Chembiochem. 2003 Oct 6;4(10):936-62
pubmed: 14523911
Chem Biol. 2015 Jul 23;22(7):829-37
pubmed: 26144884
Chem Biol. 2014 Feb 20;21(2):205-16
pubmed: 24388757
Nat Rev Genet. 2016 Oct 14;17(11):679-692
pubmed: 27739534
Nat Chem Biol. 2014 Sep;10(9):787-92
pubmed: 25086507
Nucleic Acids Res. 2015 Dec 2;43(21):10444-55
pubmed: 26420827
Proteins. 1999;Suppl 3:171-6
pubmed: 10526365
Nucleic Acids Res. 2008 Nov;36(20):6355-62
pubmed: 18836192
RNA. 2012 Apr;18(4):610-25
pubmed: 22361291
Bioinformatics. 2009 May 1;25(9):1189-91
pubmed: 19151095
Genome Biol. 2010;11(3):108
pubmed: 20236470
Cell Rep. 2015 Dec 1;13(9):1800-13
pubmed: 26655897
Methods Enzymol. 2015;553:35-64
pubmed: 25726460
Cell. 2018 Jan 25;172(3):393-407
pubmed: 29373828
Nature. 2006 Jun 29;441(7097):1167-71
pubmed: 16728979
Methods Enzymol. 2009;469:119-39
pubmed: 20946787
Nucleic Acids Res. 2015 Jul 1;43(W1):W425-30
pubmed: 25977296
Nucleic Acids Res. 2014 Oct 29;42(19):e151
pubmed: 25159614
Proteins. 2001 Apr 1;43(1):1-11
pubmed: 11170209
Cell. 2014 Mar 27;157(1):77-94
pubmed: 24679528
Nucleic Acids Res. 2016 Jul 8;44(W1):W315-9
pubmed: 27095203
Nat Chem Biol. 2015 Aug;11(8):606-10
pubmed: 26167874
Nat Chem Biol. 2016 Sep;12(9):702-8
pubmed: 27398999
Nat Chem Biol. 2014 Sep;10(9):780-6
pubmed: 25086509
RNA. 2015 Jun;21(6):1066-84
pubmed: 25883046
RNA. 2000 Oct;6(10):1347-55
pubmed: 11073212
Cell. 2016 May 5;165(4):963-75
pubmed: 27087444

Auteurs

Marcin Magnus (M)

Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland. m.magnus@cent.uw.edu.pl.

Kalli Kappel (K)

Biophysics Program, Stanford University, Stanford, CA, USA.

Rhiju Das (R)

Biophysics Program, Stanford University, Stanford, CA, USA. rhiju@stanford.edu.
Department of Biochemistry, Stanford University, Stanford, CA, USA. rhiju@stanford.edu.
Department of Physics, Stanford University, Stanford, CA, USA. rhiju@stanford.edu.

Janusz M Bujnicki (JM)

Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland.
Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Poznan, Poland.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
alpha-Synuclein Humans Animals Mice Lewy Body Disease

Classifications MeSH