| Literature DB >> 30872630 |
Haitham Sobhy1, Rajendra Kumar2,3, Jacob Lewerentz4, Ludvig Lizana5,6, Per Stenberg7,8.
Abstract
In specific cases, chromatin clearly forms long-range loops that place distant regulatory elements in close proximity to transcription start sites, but we have limited understanding of many loops identified by Chromosome Conformation Capture (such as Hi-C) analyses. In efforts to elucidate their characteristics and functions, we have identified highly interacting regions (HIRs) using intra-chromosomal Hi-C datasets with a new computational method based on looking at the eigenvector that corresponds to the smallest eigenvalue (here unity). Analysis of these regions using ENCODE data shows that they are in general enriched in bound factors involved in DNA damage repair and have actively transcribed genes. However, both highly transcribed regions as well as transcriptionally inactive regions can form HIRs. The results also indicate that enhancers and super-enhancers in particular form long-range interactions within the same chromosome. The accumulation of DNA repair factors in most identified HIRs suggests that protection from DNA damage in these regions is essential for avoidance of detrimental rearrangements.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30872630 PMCID: PMC6418152 DOI: 10.1038/s41598-019-40770-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(A) A genome browser screenshot (gcMapExplorer[26]) of the Hi-C contact map for GM12878 cells at 10 kb resolution, the lower panel shows the stationary distribution (SD). Highly interacting regions (HIRs) are indicated by light blue arrows, Table S1. (B) Spearman rank correlations between Hi-C stationary distribution and ENCODE factors at 100 kb resolution. For the most strongly correlated factors, see Supplementary Information SI-4. (C) The boxplot shows enrichment of factors in the HIRs compared to the two flanking regions. The ChIP-seq values were normalized to the corresponding percentile values (for additional factors see Fig. S7). The factors are significantly more enriched in HIRs than in flanking regions (student t-test, all p-values < 0.005).
Figure 2(A) Numbers of double stranded breaks (aDSBs and nDSBs denote aphidicolin- and neocarzinostatin-induced DSBs, respectively) within the HIRs, the two flanking regions and the expected genomic densities. (B) Percentages of regions based on four cut-offs of Hi-C stationary distribution (HiC-SD) overlapping with DSBs. HIRs have significantly more overlap with DSBs than the other three types of regions. Indicated p-values are based on student t-tests.
Figure 3(A) Hierarchical clustering of HIRs (rows) and ENCODE factors (columns) based on Euclidian distances. HIR-F1 and HIR-F2 rows and columns are sorted as HIRs. (B) The boxplot shows enrichment of the most strongly enriched factors in the HIRs, relative to the two flanking regions for each class. The ChIP-seq values were normalized to the corresponding percentile values. (C) Average numbers of transcripts per HIR in each class. (D) Average gene expression levels (RPKM) in each HIR class (Fig. S11). (E) Percentages of the HIRs overlapping with TAD borders or completely localized within TADs.
Figure 4Numbers of (A) typical-enhancers and (B) super-enhancers overlapping HIRs, the two flanking regions and expected densities. (C) Percentages of regions, based on four cut-offs of Hi-C stationary distribution (HiC-SD) overlapping typical- and super-enhancers (combined). (D) Average numbers of typical-enhancers and (E) super-enhancers overlapping each HIR class. (F) Average numbers of ncRNAs overlapping each HIR class. P-values derived from Student t-tests are indicated.