Evolutionary Analyses of Base-Pairing Interactions in DNA and RNA Secondary Structures.
coevolution
evolution
nucleic acid structure
probabilistic model
Journal
Molecular biology and evolution
ISSN: 1537-1719
Titre abrégé: Mol Biol Evol
Pays: United States
ID NLM: 8501455
Informations de publication
Date de publication:
01 02 2020
01 02 2020
Historique:
pubmed:
31
10
2019
medline:
21
10
2020
entrez:
31
10
2019
Statut:
ppublish
Résumé
Pairs of nucleotides within functional nucleic acid secondary structures often display evidence of coevolution that is consistent with the maintenance of base-pairing. Here, we introduce a sequence evolution model, MESSI (Modeling the Evolution of Secondary Structure Interactions), that infers coevolution associated with base-paired sites in DNA or RNA sequence alignments. MESSI can estimate coevolution while accounting for an unknown secondary structure. MESSI can also use graphics processing unit parallelism to increase computational speed. We used MESSI to infer coevolution associated with GC, AU (AT in DNA), GU (GT in DNA) pairs in noncoding RNA alignments, and in single-stranded RNA and DNA virus alignments. Estimates of GU pair coevolution were found to be higher at base-paired sites in single-stranded RNA viruses and noncoding RNAs than estimates of GT pair coevolution in single-stranded DNA viruses. A potential biophysical explanation is that GT pairs do not stabilize DNA secondary structures to the same extent that GU pairs do in RNA. Additionally, MESSI estimates the degrees of coevolution at individual base-paired sites in an alignment. These estimates were computed for a SHAPE-MaP-determined HIV-1 NL4-3 RNA secondary structure. We found that estimates of coevolution were more strongly correlated with experimentally determined SHAPE-MaP pairing scores than three nonevolutionary measures of base-pairing covariation. To assist researchers in prioritizing substructures with potential functionality, MESSI automatically ranks substructures by degrees of coevolution at base-paired sites within them. Such a ranking was created for an HIV-1 subtype B alignment, revealing an excess of top-ranking substructures that have been previously identified as having structure-related functional importance, among several uncharacterized top-ranking substructures.
Identifiants
pubmed: 31665393
pii: 5610083
doi: 10.1093/molbev/msz243
pmc: PMC6993869
doi:
Substances chimiques
DNA, Viral
0
RNA, Untranslated
0
RNA, Viral
0
RNA
63231-63-0
DNA
9007-49-2
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
576-592Informations de copyright
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Références
BMC Bioinformatics. 2011 Apr 18;12:103
pubmed: 21501497
Bioinformatics. 1999 Jun;15(6):446-54
pubmed: 10383470
Nat Protoc. 2006;1(3):1610-6
pubmed: 17406453
Syst Biol. 2004 Oct;53(5):685-92
pubmed: 15545249
J Gen Virol. 2004 Oct;85(Pt 10):3037-3047
pubmed: 15448367
Nat Methods. 2014 Sep;11(9):959-65
pubmed: 25028896
Virus Evol. 2015 May 26;1(1):vev003
pubmed: 27774277
RNA. 1995 Oct;1(8):807-14
pubmed: 7493326
J Mol Evol. 1994 Sep;39(3):306-14
pubmed: 7932792
J Comput Biol. 2000 Feb-Apr;7(1-2):277-92
pubmed: 10890402
Bioinformatics. 2012 Oct 15;28(20):2691-2
pubmed: 22877864
Nat Struct Mol Biol. 2010 Nov;17(11):1337-42
pubmed: 20953181
Nucleic Acids Res. 2004 Mar 19;32(5):1792-7
pubmed: 15034147
Cell. 1990 Feb 23;60(4):685-93
pubmed: 1689218
RNA. 2002 Jun;8(6):824-41
pubmed: 12088154
Nucleic Acids Res. 2014 Jan;42(Database issue):D7-17
pubmed: 24259429
PLoS Pathog. 2013;9(4):e1003294
pubmed: 23593004
Genetics. 1995 Mar;139(3):1429-39
pubmed: 7768450
EMBO J. 1985 Dec 16;4(13A):3617-23
pubmed: 4092690
J Mol Evol. 1981;17(6):368-76
pubmed: 7288891
Proc Natl Acad Sci U S A. 1991 Nov 15;88(22):10032-6
pubmed: 1946420
Nucleic Acids Res. 1994 Jun 11;22(11):2079-88
pubmed: 8029015
Science. 2007 Mar 16;315(5818):1583-6
pubmed: 17363674
J Virol. 2014 Feb;88(4):1972-89
pubmed: 24284329
J Mol Biol. 1999 May 21;288(5):911-40
pubmed: 10329189
Bioessays. 2003 Oct;25(10):930-9
pubmed: 14505360
PLoS One. 2012;7(9):e44376
pubmed: 22973441
Curr Protoc Bioinformatics. 2009 Jun;Chapter 12:12.2.1-12.2.16
pubmed: 19496057
Nucleic Acids Res. 1980 May 24;8(10):2275-93
pubmed: 6159576
Nat Methods. 2017 Jan;14(1):45-48
pubmed: 27819659
Methods Mol Biol. 2008;453:3-31
pubmed: 18712296
Proc Natl Acad Sci U S A. 2009 Jan 6;106(1):97-102
pubmed: 19109441
RNA. 2004 Sep;10(9):1337-51
pubmed: 15273323
Nucleic Acids Res. 2013 Jan;41(Database issue):D226-32
pubmed: 23125362
Nucleic Acids Res. 1978 Jun;5(6):1955-70
pubmed: 673842
PLoS One. 2010 Mar 10;5(3):e9490
pubmed: 20224823
Bioinformatics. 2009 Aug 1;25(15):1974-5
pubmed: 19398448
J Mol Biol. 2004 Feb 13;336(2):369-79
pubmed: 14757051
EMBO Rep. 2000 Jul;1(1):18-23
pubmed: 11256617
Mol Biol Evol. 1993 Nov;10(6):1396-401
pubmed: 8277861
Algorithms Mol Biol. 2006 Mar 16;1(1):3
pubmed: 16722605
Nature. 1953 Apr 25;171(4356):737-8
pubmed: 13054692
Biochemistry. 1974 Jul 30;13(16):3405-10
pubmed: 4601435
Nature. 2009 Aug 6;460(7256):711-6
pubmed: 19661910
Bioinformatics. 2006 Dec 15;22(24):2988-95
pubmed: 17038338