| Literature DB >> 28592290 |
Poppy Simmonds1,2, Erick Loomis3, Edward Curry4.
Abstract
BACKGROUND: Profiles of DNA methylation of many tissues relevant in human disease have been obtained from microarrays and are publicly available. These can be used to generate maps of chromatin compartmentalization, demarcating open and closed chromatin across the genome. Additionally, large sets of genome-wide transcription factor binding profiles have been made available thanks to ChIP-seq technology.Entities:
Keywords: Chromatin compartments; Epigenetics; Prostate cancer; Transcriptional regulation
Mesh:
Substances:
Year: 2017 PMID: 28592290 PMCID: PMC5463361 DOI: 10.1186/s13073-017-0443-z
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Schematic illustrating the overall analysis procedure, using three input datasets: a PRAD and normal prostate tissue DNA methylation data from The Cancer Genome Atlas (TCGA); b transcription factor (TF) DNA binding profiles from Encyclopedia of DNA Elements (ENCODE) ChIP-seq experiments; c PRAD and normal prostate tissue gene expression data from TCGA. Analytical steps are numbered (i–vii), with the final intersection between lists of TFs from steps (iii) and (vii) giving our candidate driver TFs
Fig. 2Genome-wide profiles of chromatin compartmentalization from PRAD and normal prostate tissues. a Genome-wide profiles showing assignment of 100-kb windows to open or closed chromatin compartments in PRADs (top panel) and in normal prostate tissue (bottom panel). Horizontal axis denotes rank of each 100-kb window by genomic coordinate, with each chromosome separated by vertical dashed lines. Vertical axis represents the confidence of assignment of each window to the closed compartment (+ve values) or the open compartment (–ve values). Confidence of assignment is based on the normalized value of the first eigenvector of the DNA methylation correlation matrix from each set of samples. b Scatter-plot showing the relationship between compartment assignment in PRAD tumours and in normal prostate tissue. Horizontal axis gives the closed-ness score (value in first eigenvector of the DNA methylation correlation matrix) for each 100-kb genomic window in the normal prostate tissues. Vertical axis gives the closed-ness score for the same genomic windows in the PRAD tumour tissues. After excluding windows with a low confidence assignment (defined as having a value of less than 0.1 in the first eigenvector of the DNA methylation correlation matrix), windows assigned to the open chromatin compartment in tumours but the closed chromatin compartment in normal tissue are plotted in red, while windows assigned to the closed chromatin compartment in tumours but the open chromatin compartment in normal tissue are plotted in blue. c Genome-wide profile of the differences in compartment assignment between PRAD tumours and normal prostate tissues, as in a but featuring only 100-kb windows with a different compartment assignment in the two tissue types. Genomic windows assigned to the open chromatin compartment in tumours but the closed chromatin compartment in normal tissue are plotted in red, while windows assigned to the closed chromatin compartment in tumours but the open chromatin compartment in normal tissue are plotted in blue. d DNA accessibility in LNCaP prostate cancer cell line, shown for genomic regions classified according to inferred compartment call from normal prostate and PRAD tumour DNA methylation data. Numerical values plotted show the distribution of the average DNAse I sequencing signal across each 100-kb window. e Accessible chromatin histone modification H3K27ac in LNCaP prostate cancer cell line, shown for genomic regions classified according to inferred compartment call from normal prostate and PRAD tumour DNA methylation data. Numerical values plotted show the distribution of the proportion of each 100-kb window that is covered by an H3K27ac ChIP-seq peak
Fig. 3Analysis of transcription factor DNA binding profiles in the context of aberrant chromatin compartmentalization in prostate adenocarcinoma. a Degree of enrichment or depletion of TF ChIP-seq peak sites in the set of regions with PRAD-specific open chromatin compartmentalization. Each ENCODE ChIP-seq study has a corresponding data point, and they are ranked along the horizontal axis by the degree of enrichment (from most depleted on the left to most enriched on the right). Vertical axis gives log ratio of the observed over expected number of peaks lying in the regions of PRAD-specific open chromatin (i.e. windows assigned to the open chromatin compartment in tumours but the closed chromatin compartment in normal tissue). Dotted horizontal line at 0 indicates number of peaks in PRAD-specific open chromatin is exactly that expected by chance. b Heatmap showing overlap between peak locations for each of the most PRAD-specific open chromatin enriched TF ChIP-seq studies. Black indicates no overlap, white indicates total overlap. c Scatter-plot showing the relationship between a TF’s enrichment to PRAD-specific open chromatin and target gene overexpression in PRAD relative to normal prostate tissue. Horizontal axis gives log ratio of observed over expected number of peaks lying in PRAD-specific open chromatin (as in a) for each ENCODE TF ChIP-seq study. Vertical axis gives –log10 p value from test of systematic overexpression of inferred target genes of the corresponding TF (genes with ChIP-seq peak lying within 5 kb of TSS) in PRAD tumours relative to normal prostate tissues. Dashed red lines indicate the median value in each axis across all the included ENCODE TF ChIP-seq studies. d Volcano plot showing differential expression of candidate driver TFs in the context of all genes’ differential expression between PRAD tumours and normal prostate tissue. Horizontal axis gives log2 ratio between median expression in PRAD tumours and median expression across normal prostate tissues. Vertical axis gives –log10 p value of differential expression as evaluated through empirical Bayes moderated t test. Candidate drivers with ChIP-seq studies suggesting the TF’s DNA binding sites are enriched in regions of PRAD-specific open chromatin are shown in red, candidate driver (TRIM28) with ChIP-seq studies suggesting the TF’s DNA binding sites are enriched in regions of PRAD-specific closed chromatin is shown in blue