HOT or not: examining the basis of high-occupancy target regions.
Amino Acid Motifs
Animals
Artifacts
Binding Sites
Caenorhabditis elegans
Chromatin Immunoprecipitation
/ methods
DNA
/ chemistry
Drosophila melanogaster
False Positive Reactions
G-Quadruplexes
Genome
Genome, Human
Genomics
Humans
Mice
Promoter Regions, Genetic
Protein Binding
Protein Domains
RNA
/ chemistry
Sequence Analysis, DNA
Transcription Factors
/ chemistry
Journal
Nucleic acids research
ISSN: 1362-4962
Titre abrégé: Nucleic Acids Res
Pays: England
ID NLM: 0411011
Informations de publication
Date de publication:
20 06 2019
20 06 2019
Historique:
accepted:
13
05
2019
revised:
02
05
2019
received:
12
11
2018
pubmed:
23
5
2019
medline:
4
12
2019
entrez:
23
5
2019
Statut:
ppublish
Résumé
High-occupancy target (HOT) regions are segments of the genome with unusually high number of transcription factor binding sites. These regions are observed in multiple species and thought to have biological importance due to high transcription factor occupancy. Furthermore, they coincide with house-keeping gene promoters and consequently associated genes are stably expressed across multiple cell types. Despite these features, HOT regions are solely defined using ChIP-seq experiments and shown to lack canonical motifs for transcription factors that are thought to be bound there. Although, ChIP-seq experiments are the golden standard for finding genome-wide binding sites of a protein, they are not noise free. Here, we show that HOT regions are likely to be ChIP-seq artifacts and they are similar to previously proposed 'hyper-ChIPable' regions. Using ChIP-seq data sets for knocked-out transcription factors, we demonstrate presence of false positive signals on HOT regions. We observe sequence characteristics and genomic features that are discriminatory of HOT regions, such as GC/CpG-rich k-mers, enrichment of RNA-DNA hybrids (R-loops) and DNA tertiary structures (G-quadruplex DNA). The artificial ChIP-seq enrichment on HOT regions could be associated to these discriminatory features. Furthermore, we propose strategies to deal with such artifacts for the future ChIP-seq studies.
Identifiants
pubmed: 31114922
pii: 5494779
doi: 10.1093/nar/gkz460
pmc: PMC6582337
doi:
Substances chimiques
Transcription Factors
0
RNA
63231-63-0
DNA
9007-49-2
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
5735-5745Informations de copyright
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.
Références
Mol Immunol. 2005 Feb;42(3):327-33
pubmed: 15589321
Mol Immunol. 1996 Mar-Apr;33(4-5):471-83
pubmed: 8676898
PLoS Comput Biol. 2011 Oct;7(10):e1002195
pubmed: 22039361
Genes Dev. 2016 Jun 1;30(11):1327-38
pubmed: 27298336
Autoimmunity. 2010 Dec;43(8):682-9
pubmed: 20828249
Nucleic Acids Res. 1986 Jun 25;14(12):5049-65
pubmed: 3725591
Bioinformatics. 2015 Apr 1;31(7):1127-9
pubmed: 25417204
Genes Dev. 2004 Jul 1;18(13):1618-29
pubmed: 15231739
Nucleic Acids Res. 2014 Dec 1;42(21):13051-60
pubmed: 25378309
Clin Exp Immunol. 2002 Dec;130(3):541-7
pubmed: 12452847
Mol Cell. 2012 Mar 30;45(6):814-25
pubmed: 22387027
Nature. 2014 Mar 27;507(7493):462-70
pubmed: 24670764
Adv Biochem Eng Biotechnol. 2007;104:65-85
pubmed: 17290819
J Immunol. 1993 Aug 1;151(3):1614-26
pubmed: 8393048
J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
Nature. 2014 Aug 28;512(7515):453-6
pubmed: 25164757
Genome Res. 2014 Jul;24(7):1138-46
pubmed: 24653213
Autoimmunity. 2007 Feb;40(1):38-47
pubmed: 17364496
Protein Sci. 2004 Dec;13(12):3222-9
pubmed: 15557264
Mol Immunol. 2009 Apr;46(7):1467-80
pubmed: 19181386
Nat Methods. 2016 Oct;13(10):823-7
pubmed: 27595404
F1000Res. 2015 Jul 13;4:235
pubmed: 26594335
Proc Natl Acad Sci U S A. 2013 Nov 12;110(46):18602-7
pubmed: 24173036
PLoS One. 2013 Dec 09;8(12):e83506
pubmed: 24349523
Genes Dev. 2014 Jul 1;28(13):1384-96
pubmed: 24990962
Nat Rev Genet. 2010 Nov;11(11):751-60
pubmed: 20877328
Genes Dev. 2011 May 15;25(10):1010-22
pubmed: 21576262
J Biol Chem. 2011 Jul 22;286(29):26220-7
pubmed: 21566138
Nucleic Acids Res. 2015 Aug 18;43(14):6959-68
pubmed: 26117547
Nat Immunol. 2011 Sep 20;12(10):918-22
pubmed: 21934668
J Immunol. 2009 Sep 1;183(5):3109-17
pubmed: 19648272
Genome Biol. 2012 Sep 26;13(9):R48
pubmed: 22950945
Biochem Cell Biol. 2013 Dec;91(6):378-94
pubmed: 24219279
Nat Rev Genet. 2015 Oct;16(10):583-97
pubmed: 26370899
Proc Natl Acad Sci U S A. 1997 Mar 4;94(5):1955-60
pubmed: 9050886
Mol Cell. 2016 Jul 7;63(1):167-78
pubmed: 27373332
Database (Oxford). 2010 Oct 12;2010:baq023
pubmed: 20940177
Mol Immunol. 2002 Jan;38(7):505-14
pubmed: 11750652
Clin Exp Rheumatol. 1995 Jan-Feb;13(1):7-10
pubmed: 7774106
J Nucleic Acids. 2011;2011:720798
pubmed: 22013507
Cell. 2013 Oct 24;155(3):713-24
pubmed: 24243024
Nat Genet. 2016 Oct;48(10):1267-72
pubmed: 27618450
Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169
pubmed: 27899622
Nature. 2015 Feb 19;518(7539):317-30
pubmed: 25693563
Genome Res. 2013 Oct;23(10):1590-600
pubmed: 23868195
Int Immunol. 2001 Sep;13(9):1099-107
pubmed: 11526090
Genes Dev. 2006 Jul 15;20(14):1838-47
pubmed: 16847344
Proc Natl Acad Sci U S A. 1995 Dec 19;92(26):12393-7
pubmed: 8618908
Nature. 2009 Jun 18;459(7249):927-30
pubmed: 19536255
Proc Natl Acad Sci U S A. 1995 Mar 28;92(7):2529-33
pubmed: 7708679
Nature. 2012 Sep 6;489(7414):57-74
pubmed: 22955616
Lupus. 2002;11(12):865-71
pubmed: 12529052
Immunol Lett. 2000 Jul 3;73(1):29-34
pubmed: 10963808
J Clin Invest. 1994 Jan;93(1):443-9
pubmed: 8282818
Nat Methods. 2018 Jul;15(7):499-504
pubmed: 29941872
Proc Natl Acad Sci U S A. 2014 Dec 2;111(48):17140-5
pubmed: 25313048
Clin Immunol. 2003 Dec;109(3):278-87
pubmed: 14697742
Nat Biotechnol. 2010 May;28(5):495-501
pubmed: 20436461
Lupus. 2005;14(7):543-50
pubmed: 16130511
Elife. 2015 Jul 16;4:
pubmed: 26182405
Nucleic Acids Res. 2012 Jan;40(Database issue):D290-301
pubmed: 22127870
Nat Genet. 2016 Nov;48(11):1385-1395
pubmed: 27668659
Science. 2010 Dec 24;330(6012):1775-87
pubmed: 21177976
Nucleic Acids Res. 2016 Jun 20;44(11):5299-312
pubmed: 27131367
Bioinformatics. 2014 Mar 15;30(6):775-83
pubmed: 24162561
J Immunol. 1985 Dec;135(6):4080-5
pubmed: 3877763
J Immunol Methods. 1994 Jul 12;173(1):7-10
pubmed: 8034988