Widespread selection for extremely high and low levels of secondary structure in coding sequences across all domains of life.


Journal

Open biology
ISSN: 2046-2441
Titre abrégé: Open Biol
Pays: England
ID NLM: 101580419

Informations de publication

Date de publication:
31 05 2019
Historique:
entrez: 30 5 2019
pubmed: 30 5 2019
medline: 28 4 2020
Statut: ppublish

Résumé

Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection favours more stable RNA secondary structures, we currently lack a concrete idea of how many genes are affected within a species, and whether this is a universal phenomenon in nature. We searched for signs of structural selection in a global manner, analysing a set of 1 million coding sequences from 73 species representing all domains of life, as well as viruses, by means of our newly developed software PACKEIS. We show that codon composition and amino acid identity are main determinants of RNA secondary structure. In addition, we show that the arrangement of synonymous codons within coding sequences is non-random, yielding extremely high, but also extremely low, RNA structuredness significantly more often than expected by chance. Taken together, we demonstrate that selection for high and low levels of secondary structure is a widespread phenomenon. Our results provide another line of evidence that synonymous mutations are less neutral than commonly thought, which is of importance for many evolutionary models.

Identifiants

pubmed: 31138098
doi: 10.1098/rsob.190020
pmc: PMC6544989
doi:

Substances chimiques

RNA 63231-63-0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

190020

Références

Nucleic Acids Res. 1987 Oct 12;15(19):8023-40
pubmed: 3118331
Nucleic Acids Res. 1982 Nov 25;10(22):7055-74
pubmed: 6760125
Am J Trop Med Hyg. 2018 Jun;98(6):1563-1565
pubmed: 29557341
Proteomics. 2015 Sep;15(18):3163-8
pubmed: 25656970
Proc Natl Acad Sci U S A. 1999 Apr 13;96(8):4482-7
pubmed: 10200288
J Biol Chem. 1982 Mar 25;257(6):3026-31
pubmed: 7037777
RNA. 2007 Oct;13(10):1631-40
pubmed: 17684233
J Mol Evol. 1997 Nov;45(5):514-23
pubmed: 9342399
J Mol Biol. 1999 May 21;288(5):911-40
pubmed: 10329189
RNA. 2019 Jan;25(1):60-69
pubmed: 30327333
Genome Res. 2003 Sep;13(9):2042-51
pubmed: 12952875
Trends Genet. 2000 Jul;16(7):287-9
pubmed: 10858656
J Gen Virol. 2018 Dec;99(12):1551-1562
pubmed: 30372405
Genetics. 2001 Oct;159(2):623-33
pubmed: 11606539
Nucleic Acids Res. 2018 Jan 4;46(D1):D754-D761
pubmed: 29155950
Nat Commun. 2018 Oct 18;9(1):4328
pubmed: 30337527
PLoS Genet. 2006 Nov 3;2(11):e176
pubmed: 17083275
PLoS Biol. 2006 Jun;4(6):e180
pubmed: 16700628
Proc Natl Acad Sci U S A. 2010 Feb 23;107(8):3645-50
pubmed: 20133581
Algorithms Mol Biol. 2011 Nov 24;6:26
pubmed: 22115189
RNA. 2017 Sep;23(9):1352-1364
pubmed: 28630141
J Mol Evol. 2001 Oct-Nov;53(4-5):290-8
pubmed: 11675589
Curr Opin Virol. 2014 Aug;7:19-28
pubmed: 24732439
Biochim Biophys Acta. 2009 Sep-Oct;1789(9-10):495-517
pubmed: 19781674
Nat Plants. 2018 Mar;4(3):136-137
pubmed: 29497160
Genome Biol. 2005;6(9):R75
pubmed: 16168082
Philos Trans R Soc Lond B Biol Sci. 1995 Sep 29;349(1329):241-7
pubmed: 8577834
Curr Opin Virol. 2011 Dec;1(6):533-44
pubmed: 22180767
Science. 2009 Apr 10;324(5924):255-8
pubmed: 19359587
RNA. 2015 May;21(5):911-22
pubmed: 25802409
Immunity. 2017 Jun 20;46(6):992-1004.e5
pubmed: 28636969
Plant J. 2015 Nov;84(4):704-17
pubmed: 26402748
Proc Natl Acad Sci U S A. 2016 Mar 8;113(10):E1362-71
pubmed: 26903634
RNA. 2006 May;12(5):851-61
pubmed: 16540693
Nature. 1977 May 19;267(5608):275-6
pubmed: 865622
Curr Top Microbiol Immunol. 2013;371:153-81
pubmed: 23686235
Mol Biol Evol. 2009 Feb;26(2):451-61
pubmed: 19033257
Cell. 2007 Aug 10;130(3):413-26
pubmed: 17693253
Bioinformatics. 2019 Feb 15;35(4):579-583
pubmed: 30101307
Nucleic Acids Res. 2014 Apr;42(8):4813-22
pubmed: 24561808
Mol Biol Evol. 1988 Nov;5(6):704-16
pubmed: 3146682
Nucleic Acids Res. 2016 Jan 4;44(D1):D746-52
pubmed: 26481351

Auteurs

Daniel Gebert (D)

Institute of Organismic and Molecular Evolution iOME, Anthropology, Johannes Gutenberg University Mainz , Anselm-Franz-von-Bentzel-Weg 7, 55099 Mainz , Germany.

Julia Jehn (J)

Institute of Organismic and Molecular Evolution iOME, Anthropology, Johannes Gutenberg University Mainz , Anselm-Franz-von-Bentzel-Weg 7, 55099 Mainz , Germany.

David Rosenkranz (D)

Institute of Organismic and Molecular Evolution iOME, Anthropology, Johannes Gutenberg University Mainz , Anselm-Franz-von-Bentzel-Weg 7, 55099 Mainz , Germany.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH