| Literature DB >> 27009150 |
Joshua Starmer1,2, Terry Magnuson3,4.
Abstract
BACKGROUND: Correctly identifying genomic regions enriched with histone modifications and transcription factors is key to understanding their regulatory and developmental roles. Conceptually, these regions are divided into two categories, narrow peaks and broad domains, and different algorithms are used to identify each one. Datasets that span these two categories are often analyzed with a single program for peak calling combined with an ad hoc method for domains.Entities:
Keywords: ChIP-seq; Computational analysis; Histone modifications; hiddenDomains
Mesh:
Substances:
Year: 2016 PMID: 27009150 PMCID: PMC4806451 DOI: 10.1186/s12859-016-0991-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Broad Domains in H3K27me3 ChIP-seq Data. a A UCSC Genome Browser screenshot of the ChIP-seq and domains called by the various methods. b The number of domains called for each method used and the average domain width. c The sensitivity and specificity for the original ChIP-seq dataset and down-sampled versions of it. The colors used in the graphs represent the same programs listed in the legend for (b)
Fig. 2Evaluating Domain Widths with H3K36me3 ChIP-seq Data. a A UCSC Genome Browser screenshot of the ChIP-seq and domains called by the various methods. b The percentage of domains from each method that overlapped gene bodies, the average domain width (with the average gene body with for transcribed genes indicated with a red line) and the percent of highly expressed genes larger than 3 kb that were overlapped by domains called by the various programs
Fig. 3Narrow Peaks in two GABP ChIP-seq Datasets. a A UCSC Genome Browser screenshot of the Jurkat ChIP-seq and peaks called by the various methods. b The number of peaks called for each method and the average peak width. c The number of peaks called, sensitivity and the percentage of peaks that overlapped predicted binding sites for the original ChIP-seq dataset and down-sampled versions of it. The colors used in the graphs represent the same programs listed in the legend for (b). d A UCSC Genome Browser screenshot of the ENCODE ChIP-seq and peaks called by the various methods. e The number of qPCR validated sites and predicted binding sites overlapped by the peaks and the percentage of peaks that overlapped predicted binding sites called by the various methods
Fig. 4Narrow Peaks in ESR1 and FOXA1 ChIP-seq Data. a A UCSC Genome Browser screenshot of the ESR1 ChIP-seq and peaks called by the various methods. b The number of peaks called, the number of predicted binding sites overlapped by peaks, and the percentage of peaks that overlapped predicted binding sites for the various methods. c A UCSC Genome Browser screenshot of the FOXA1 ChIP-seq and peaks called by the various methods. d The number of peaks called, the number of predicted binding sites overlapped by peaks, and the percentage of peaks that overlapped predicted binding sites for the various methods