Long first exons and epigenetic marks distinguish conserved pachytene piRNA clusters from other mammalian genes.
5-Methylcytosine
/ analogs & derivatives
Acetylation
Animals
DNA Methylation
/ genetics
DNA-Binding Proteins
/ metabolism
Epigenesis, Genetic
Evolution, Molecular
Exons
/ genetics
Histones
/ metabolism
Introns
/ genetics
Male
Mammals
/ genetics
Mice, Inbred C57BL
Nuclear Proteins
/ metabolism
Organ Specificity
/ genetics
Pachytene Stage
/ genetics
Promoter Regions, Genetic
/ genetics
RNA Splicing
/ genetics
RNA, Messenger
/ genetics
RNA, Small Interfering
/ metabolism
RNA-Binding Proteins
/ metabolism
Signal Transduction
/ genetics
Testis
/ metabolism
Transcription, Genetic
Journal
Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555
Informations de publication
Date de publication:
04 01 2021
04 01 2021
Historique:
received:
14
05
2020
accepted:
17
11
2020
entrez:
5
1
2021
pubmed:
6
1
2021
medline:
15
1
2021
Statut:
epublish
Résumé
In the male germ cells of placental mammals, 26-30-nt-long PIWI-interacting RNAs (piRNAs) emerge when spermatocytes enter the pachytene phase of meiosis. In mice, pachytene piRNAs derive from ~100 discrete autosomal loci that produce canonical RNA polymerase II transcripts. These piRNA clusters bear 5' caps and 3' poly(A) tails, and often contain introns that are removed before nuclear export and processing into piRNAs. What marks pachytene piRNA clusters to produce piRNAs, and what confines their expression to the germline? We report that an unusually long first exon (≥ 10 kb) or a long, unspliced transcript correlates with germline-specific transcription and piRNA production. Our integrative analysis of transcriptome, piRNA, and epigenome datasets across multiple species reveals that a long first exon is an evolutionarily conserved feature of pachytene piRNA clusters. Furthermore, a highly methylated promoter, often containing a low or intermediate level of CG dinucleotides, correlates with germline expression and somatic silencing of pachytene piRNA clusters. Pachytene piRNA precursor transcripts bind THOC1 and THOC2, THO complex subunits known to promote transcriptional elongation and mRNA nuclear export. Together, these features may explain why the major sources of pachytene piRNA clusters specifically generate these unique small RNAs in the male germline of placental mammals.
Identifiants
pubmed: 33397987
doi: 10.1038/s41467-020-20345-3
pii: 10.1038/s41467-020-20345-3
pmc: PMC7782496
doi:
Substances chimiques
BTBD18 protein, mouse
0
DNA-Binding Proteins
0
Histones
0
Nuclear Proteins
0
RNA, Messenger
0
RNA, Small Interfering
0
RNA-Binding Proteins
0
Thoc1 protein, mouse
0
5-hydroxymethylcytosine
1123-95-1
5-Methylcytosine
6R795CQT4H
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
73Subventions
Organisme : NICHD NIH HHS
ID : P01 HD078253
Pays : United States
Organisme : NICHD NIH HHS
ID : R01 HD049116
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM136275
Pays : United States
Références
Mol Cell. 2015 Apr 16;58(2):203-15
pubmed: 25818647
Mol Cell. 2008 Sep 26;31(6):785-99
pubmed: 18922463
Mol Cell. 2016 Jul 7;63(1):97-109
pubmed: 27292797
Cell Rep. 2013 Jun 27;3(6):2179-90
pubmed: 23791531
J Biol Chem. 2013 Oct 25;288(43):31350-6
pubmed: 24052263
Genes Dev. 2008 Apr 1;22(7):908-17
pubmed: 18381894
Bioinformatics. 2015 Jan 15;31(2):166-9
pubmed: 25260700
Science. 2007 May 4;316(5825):744-7
pubmed: 17446352
Genes Dev. 2016 Apr 1;30(7):840-55
pubmed: 27036967
Development. 2011 Aug;138(15):3319-30
pubmed: 21750041
Dev Cell. 2007 Apr;12(4):503-14
pubmed: 17395546
Genes Dev. 2015 May 15;29(10):1032-44
pubmed: 25995188
Nat Rev Genet. 2011 Nov 15;13(1):7-13
pubmed: 22083101
Nucleic Acids Res. 2020 May 7;48(8):4115-4138
pubmed: 32182340
Cell. 2014 Jun 5;157(6):1364-1379
pubmed: 24906153
Int J Biochem Cell Biol. 2017 Oct;91(Pt B):145-155
pubmed: 28673892
Silence. 2012 Dec 28;3(1):9
pubmed: 23273270
PLoS Genet. 2018 Feb 20;14(2):e1007233
pubmed: 29462142
Science. 2001 Aug 10;293(5532):1068-70
pubmed: 11498573
RNA. 2011 Jun;17(6):1048-56
pubmed: 21525145
Curr Protoc Bioinformatics. 2010 Dec;Chapter 11:Unit 11.7
pubmed: 21154709
Genes Dev. 2002 Nov 1;16(21):2792-9
pubmed: 12414732
Nucleic Acids Res. 2014 Oct 29;42(19):12249-60
pubmed: 25274738
Science. 2012 Dec 21;338(6114):1593-9
pubmed: 23258891
Genes Dev. 2014 Oct 1;28(19):2103-19
pubmed: 25223896
Cell Rep. 2015 Oct 27;13(4):854-863
pubmed: 26489470
Nature. 2006 Jul 13;442(7099):203-7
pubmed: 16751777
Genes Dev. 2006 Jul 1;20(13):1709-14
pubmed: 16766680
Nat Struct Mol Biol. 2012 Aug;19(8):773-81
pubmed: 22842725
Gene Expr. 2000;9(1-2):63-75
pubmed: 11097425
Dev Cell. 2013 Jan 14;24(1):13-25
pubmed: 23328397
Cell. 2019 Oct 17;179(3):632-643.e12
pubmed: 31607510
Cell Death Dis. 2013 Oct 24;4:e879
pubmed: 24157873
Science. 2006 Jul 21;313(5785):363-7
pubmed: 16778019
Nature. 2014 Nov 20;515(7527):355-64
pubmed: 25409824
RNA. 2000 Apr;6(4):638-50
pubmed: 10786854
Cell Rep. 2012 Jul 26;2(1):62-8
pubmed: 22840397
J Biol Chem. 2003 Aug 1;278(31):28750-7
pubmed: 12754203
Cell Rep. 2015 Mar 31;10(12):2069-82
pubmed: 25818294
Cell. 2013 Feb 28;152(5):957-68
pubmed: 23415457
Cell Res. 2015 Feb;25(2):193-207
pubmed: 25582079
Mol Cell. 2013 Apr 11;50(1):67-81
pubmed: 23523368
Nat Rev Genet. 2019 Feb;20(2):89-108
pubmed: 30446728
Nature. 2008 May 22;453(7194):534-8
pubmed: 18404147
Genes Dev. 2013 Apr 1;27(7):749-66
pubmed: 23592795
Proc Natl Acad Sci U S A. 2006 Sep 26;103(39):14423-8
pubmed: 16983072
Genome Biol. 2008;9(9):R137
pubmed: 18798982
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Mol Cell. 2008 Feb 1;29(2):271-8
pubmed: 18243121
Genes Dev. 2005 Jul 1;19(13):1512-7
pubmed: 15998806
Genes Dev. 2015 Jul 1;29(13):1403-15
pubmed: 26115953
Dev Cell. 2002 Jun;2(6):819-30
pubmed: 12062093
Bioinformatics. 2013 Jan 1;29(1):15-21
pubmed: 23104886
Dev Cell. 2017 Mar 13;40(5):453-466.e5
pubmed: 28292424
J Cell Sci. 2005 Dec 15;118(Pt 24):5825-34
pubmed: 16317046
Nature. 2017 Sep 7;549(7670):54-59
pubmed: 28847004
Nature. 2013 Jul 18;499(7458):360-3
pubmed: 23792564
Nucleic Acids Res. 2018 Jan 25;46(2):593-608
pubmed: 29126117
Mol Cell. 2018 Sep 6;71(5):775-790.e5
pubmed: 30193099
Cell Stem Cell. 2014 Aug 7;15(2):239-53
pubmed: 24835570
Cell. 2011 Sep 16;146(6):1016-28
pubmed: 21925322
Nat Ecol Evol. 2020 Jan;4(1):156-168
pubmed: 31900453
Nat Genet. 2007 Apr;39(4):457-66
pubmed: 17334365
Cell. 2014 Jun 5;157(6):1353-1363
pubmed: 24906152
Mol Cell. 2016 Apr 21;62(2):169-180
pubmed: 27105113
Elife. 2018 Jan 29;7:
pubmed: 29376823
PLoS One. 2017 Jan 26;12(1):e0170779
pubmed: 28125731
Nat Rev Genet. 2013 Mar;14(3):204-20
pubmed: 23400093
EMBO J. 2000 Feb 1;19(3):410-20
pubmed: 10722314
Bioinformatics. 2011 Jun 1;27(11):1571-2
pubmed: 21493656
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Nat Genet. 2020 Jul;52(7):728-739
pubmed: 32601478
Nature. 2006 Jul 13;442(7099):199-202
pubmed: 16751776
EMBO J. 2011 Nov 16;30(22):4601-15
pubmed: 21952049
Cell Res. 2014 Jun;24(6):680-700
pubmed: 24787618
Development. 2011 Sep;138(17):3835-45
pubmed: 21828100
Nature. 2008 May 22;453(7194):539-43
pubmed: 18404146
Genome Res. 2010 Jul;20(7):890-8
pubmed: 20501695
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Cell Rep. 2018 Sep 25;24(13):3413-3422.e4
pubmed: 30257203
Cell. 2009 Sep 18;138(6):1137-49
pubmed: 19732946
PLoS One. 2015 May 07;10(5):e0124860
pubmed: 25950437
Nat Rev Mol Cell Biol. 2017 Feb;18(2):90-101
pubmed: 27924077