Morten Rye1, Pål Sætrom, Tony Håndstad, Finn Drabløs. 1. Department of Cancer Research and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway. morten.rye@ntnu.no
Abstract
BACKGROUND: Transcription factor binding to DNA requires both an appropriate binding element and suitably open chromatin, which together help to define regulatory elements within the genome. Current methods of identifying regulatory elements, such as promoters or enhancers, typically rely on sequence conservation, existing gene annotations or specific marks, such as histone modifications and p300 binding methods, each of which has its own biases. RESULTS: Herein we show that an approach based on clustering of transcription factor peaks from high-throughput sequencing coupled with chromatin immunoprecipitation (Chip-Seq) can be used to evaluate markers for regulatory elements. We used 67 data sets for 54 unique transcription factors distributed over two cell lines to create regulatory element clusters. By integrating the clusters from our approach with histone modifications and data for open chromatin, we identified general methylation of lysine 4 on histone H3 (H3K4me) as the most specific marker for transcription factor clusters. Clusters mapping to annotated genes showed distinct patterns in cluster composition related to gene expression and histone modifications. Clusters mapping to intergenic regions fall into two groups either directly involved in transcription, including miRNAs and long noncoding RNAs, or facilitating transcription by long-range interactions. The latter clusters were specifically enriched with H3K4me1, but less with acetylation of lysine 27 on histone 3 or p300 binding. CONCLUSION: By integrating genomewide data of transcription factor binding and chromatin structure and using our data-driven approach, we pinpointed the chromatin marks that best explain transcription factor association with different regulatory elements. Our results also indicate that a modest selection of transcription factors may be sufficient to map most regulatory elements in the human genome.
BACKGROUND: Transcription factor binding to DNA requires both an appropriate binding element and suitably open chromatin, which together help to define regulatory elements within the genome. Current methods of identifying regulatory elements, such as promoters or enhancers, typically rely on sequence conservation, existing gene annotations or specific marks, such as histone modifications and p300 binding methods, each of which has its own biases. RESULTS: Herein we show that an approach based on clustering of transcription factor peaks from high-throughput sequencing coupled with chromatin immunoprecipitation (Chip-Seq) can be used to evaluate markers for regulatory elements. We used 67 data sets for 54 unique transcription factors distributed over two cell lines to create regulatory element clusters. By integrating the clusters from our approach with histone modifications and data for open chromatin, we identified general methylation of lysine 4 on histone H3 (H3K4me) as the most specific marker for transcription factor clusters. Clusters mapping to annotated genes showed distinct patterns in cluster composition related to gene expression and histone modifications. Clusters mapping to intergenic regions fall into two groups either directly involved in transcription, including miRNAs and long noncoding RNAs, or facilitating transcription by long-range interactions. The latter clusters were specifically enriched with H3K4me1, but less with acetylation of lysine 27 on histone 3 or p300 binding. CONCLUSION: By integrating genomewide data of transcription factor binding and chromatin structure and using our data-driven approach, we pinpointed the chromatin marks that best explain transcription factor association with different regulatory elements. Our results also indicate that a modest selection of transcription factors may be sufficient to map most regulatory elements in the human genome.
Authors: D Karolchik; R Baertsch; M Diekhans; T S Furey; A Hinrichs; Y T Lu; K M Roskin; M Schwartz; C W Sugnet; D J Thomas; R J Weber; D Haussler; W J Kent Journal: Nucleic Acids Res Date: 2003-01-01 Impact factor: 16.971
Authors: Bradley E Bernstein; Tarjei S Mikkelsen; Xiaohui Xie; Michael Kamal; Dana J Huebert; James Cuff; Ben Fry; Alex Meissner; Marius Wernig; Kathrin Plath; Rudolf Jaenisch; Alexandre Wagschal; Robert Feil; Stuart L Schreiber; Eric S Lander Journal: Cell Date: 2006-04-21 Impact factor: 41.582
Authors: Mathieu Blanchette; Alain R Bataille; Xiaoyu Chen; Christian Poitras; Josée Laganière; Céline Lefèbvre; Geneviève Deblois; Vincent Giguère; Vincent Ferretti; Dominique Bergeron; Benoit Coulombe; François Robert Journal: Genome Res Date: 2006-04-10 Impact factor: 9.043
Authors: Len A Pennacchio; Nadav Ahituv; Alan M Moses; Shyam Prabhakar; Marcelo A Nobrega; Malak Shoukry; Simon Minovitsky; Inna Dubchak; Amy Holt; Keith D Lewis; Ingrid Plajzer-Frick; Jennifer Akiyama; Sarah De Val; Veena Afzal; Brian L Black; Olivier Couronne; Michael B Eisen; Axel Visel; Edward M Rubin Journal: Nature Date: 2006-11-05 Impact factor: 49.962
Authors: Alexei A Sharov; Akira Nishiyama; Yong Qian; Dawood B Dudekula; Dan L Longo; David Schlessinger; Minoru S H Ko Journal: J Comput Biol Date: 2014-06-11 Impact factor: 1.479
Authors: Erin R Ewald; Gary S Wand; Fayaz Seifuddin; Xiaoju Yang; Kellie L Tamashiro; James B Potash; Peter Zandi; Richard S Lee Journal: Psychoneuroendocrinology Date: 2014-03-20 Impact factor: 4.905
Authors: Christopher M Vockley; Anthony M D'Ippolito; Ian C McDowell; William H Majoros; Alexias Safi; Lingyun Song; Gregory E Crawford; Timothy E Reddy Journal: Cell Date: 2016-08-25 Impact factor: 41.582