Deciphering the complexity of human non-coding promoter-proximal transcriptome.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
01 08 2019
01 08 2019
Historique:
received:
03
07
2018
revised:
08
11
2018
accepted:
07
12
2018
pubmed:
12
12
2018
medline:
17
6
2020
entrez:
12
12
2018
Statut:
ppublish
Résumé
Long non-coding RNAs (lncRNAs) have gained increasing relevance in epigenetic regulation and nuclear functional organization. High-throughput sequencing approaches have revealed frequent non-coding transcription in promoter-proximal regions. However, a comprehensive catalogue of promoter-associated RNAs (paRNAs) and an analysis of the possible interactions with neighboring genes and genomic regulatory elements are missing. Integrating data from multiple cell types and experimental platforms we identified thousands of paRNAs in the human genome. paRNAs are transcribed in both sense and antisense orientation, are mostly non-polyadenylated and retained in the cell nucleus. Transcriptional regulators, epigenetic effectors and activating chromatin marks are enriched in paRNA-positive promoters. Furthermore, paRNA-positive promoters exhibit chromatin signatures of both active promoters and enhancers. Promoters with paRNAs reside preferentially at chromatin loop boundaries, suggesting an involvement in anchor site recognition and chromatin looping. Importantly, these features are independent of the transcriptional state of neighboring genes. Thus, paRNAs may act as cis-regulatory modules with an impact on local recruitment of transcription factors, epigenetic state and chromatin loop organization. This study provides a comprehensive analysis of the promoter-proximal transcriptome and offers novel insights into the roles of paRNAs in epigenetic processes and human diseases. Genomic coordinates of predicted paRNAs are available at https://figshare.com: https://doi.org/10.6084/m9.figshare.7392791.v1 and https://doi.org/10.6084/m9.figshare.4856630.v2. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 30535182
pii: 5237560
doi: 10.1093/bioinformatics/bty981
pmc: PMC6662291
doi:
Substances chimiques
Chromatin
0
RNA, Long Noncoding
0
Banques de données
figshare
['10.6084/m9.figshare.7392791.v1', '10.6084/m9.figshare.4856630.v2']
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
2529-2534Subventions
Organisme : Medical Research Council
ID : MR/P000711/1
Pays : United Kingdom
Informations de copyright
© The Author(s) 2018. Published by Oxford University Press.
Références
Science. 2007 Jun 8;316(5830):1484-8
pubmed: 17510325
Science. 2008 Dec 19;322(5909):1851-4
pubmed: 19056938
Science. 2008 Dec 19;322(5909):1849-51
pubmed: 19056940
Science. 2008 Dec 19;322(5909):1845-8
pubmed: 19056941
Mol Cell. 2010 May 28;38(4):576-89
pubmed: 20513432
Nat Biotechnol. 2010 Aug;28(8):817-25
pubmed: 20657582
Nat Biotechnol. 2010 Oct;28(10):1057-68
pubmed: 20944598
Proc Natl Acad Sci U S A. 2011 Jun 28;108(26):10460-5
pubmed: 21670248
Nat Struct Mol Biol. 2012 Nov;19(11):1068-75
pubmed: 23132386
Proc Natl Acad Sci U S A. 2013 Feb 19;110(8):2876-81
pubmed: 23382218
Nat Struct Mol Biol. 2013 Mar;20(3):300-7
pubmed: 23463315
Nature. 2013 Jun 27;498(7455):516-20
pubmed: 23728302
Nature. 2013 Aug 29;500(7464):598-602
pubmed: 23945587
BMC Genomics. 2013 Dec 23;14:914
pubmed: 24365181
Elife. 2014 May 27;3:e02200
pubmed: 24867637
Cell. 2014 Dec 18;159(7):1665-80
pubmed: 25497547
Science. 2015 Nov 20;350(6263):978-81
pubmed: 26516199
Nucleic Acids Res. 2016 Jan 29;44(2):524-37
pubmed: 26612865
Cell Stem Cell. 2016 May 5;18(5):637-52
pubmed: 26996597
Nature. 2016 Nov 17;539(7629):452-455
pubmed: 27783602
Nat Commun. 2017 May 30;8:15622
pubmed: 28555645
Nat Genet. 2017 Jul;49(7):1073-1081
pubmed: 28581502
Nat Rev Genet. 2016 Dec;17(12):772
pubmed: 28704353