| Literature DB >> 33489005 |
Cheynna Crowley1,2, Yuchen Yang1, Yunjiang Qiu3,4, Benxia Hu1,5, Armen Abnousi6, Jakub Lipiński7, Dariusz Plewczyński7,8, Di Wu2,9, Hyejung Won1,5, Bing Ren3,10,11, Ming Hu6, Yun Li1,2,12.
Abstract
Hi-C experiments have been widely adopted to study chromatin spatial organization, which plays an essential role in genome function. We have recently identified frequently interacting regions (FIREs) and found that they are closely associated with cell-type-specific gene regulation. However, computational tools for detecting FIREs from Hi-C data are still lacking. In this work, we present FIREcaller, a stand-alone, user-friendly R package for detecting FIREs from Hi-C data. FIREcaller takes raw Hi-C contact matrices as input, performs within-sample and cross-sample normalization, and outputs continuous FIRE scores, dichotomous FIREs, and super-FIREs. Applying FIREcaller to Hi-C data from various human tissues, we demonstrate that FIREs and super-FIREs identified, in a tissue-specific manner, are closely related to gene regulation, are enriched for enhancer-promoter (E-P) interactions, tend to overlap with regions exhibiting epigenomic signatures of cis-regulatory roles, and aid the interpretation or GWAS variants. The FIREcaller package is implemented in R and freely available at https://yunliweb.its.unc.edu/FIREcaller.Entities:
Keywords: Chromatin spatial organization; Frequently Interacting Regions (FIREs); Hi-C
Year: 2020 PMID: 33489005 PMCID: PMC7788093 DOI: 10.1016/j.csbj.2020.12.026
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Flow chart of calling FIREs using the FIREcaller software. * indicates when > 1 replicate per condition exists. Further detailed in Section 2.8.
Fig. 2Super-FIRE detection. A) Flow chart for super-FIRE identification. B) Scatterplot of clustered FIREs ranked by their super-FIRE scores for the Hi-C data from hippocampus [14], ordered from the least interactive regions (left) to the most interactive regions (right). Blue dashed line highlights the inflection point of the curve and the red dots highlight super-FIREs, which are clusters of contiguous FIREs to the right of the inflection point. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 3An example of a super-FIRE in human hippocampus tissue. Virtual 4C plot of a 1 Mb region (chr18:52,665,002–53,665,002) anchored at the schizophrenia-associated GWAS SNP rs9960767 (black vertical line), visualized by HUGIn [35]. The solid black, red and blue lines represent the observed contact frequency, expected contact frequency, and –log10(p-value) from Fit-Hi-C [40], respectively. The dashed purple and green lines represent significant thresholds corresponding to Bonferroni correction and 5% FDR, respectively. The yellow horizontal bar in the “FIREs” track depicts the 400 Kb super-FIRE region. The two orange horizontal bars in the “Enhancers” track mark the two hippocampus super-enhancers in the region. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Distribution of expression for genes overlapping fetal or adult brain FIREs. The leftmost pair of violin boxplots shows the expression profile of the 587 genes mapped to adult brain-specific FIREs, with expression measured in fetal brain cortex (blue) and adult brain cortex (red), respectively. The second pair of violin boxplots shows the expression profile of the 412 genes mapped to fetal brain-specific FIREs, again in fetal brain cortex (blue) and adult brain cortex (red), respectively. The third pair shows the expression profile of the 295 genes mapped to FIREs shared between fetal and adult brain, yet again in fetal brain cortex (blue) and in adult brain cortex (red). The fourth pair shows the expression profile of genes not overlapping any FIREs, with a total of 15,640 such genes (labelled “Not FIRE bins”). To the farthest right shows the expression profile of 816 genes overlapping with “permuted-FIREs” with fetal cortex gene expression (blue) and adult brain cortex gene expression (red). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Tissue-specific FIREs and shared FIREs, and overlapping genes.
| # FIREs | # FIREs overlapping with a gene | # of genes overlapping FIREs | |
|---|---|---|---|
| Adult-specific | 2,408 | 488 | 587 |
| Fetal-specific | 2,407 | 338 | 412 |
| Shared | 1,518 | 258 | 295 |
Tissue-Specific FIREs and Tissue-Specific E-P interactions in Liver and Left Ventricle tissues. In the table, we count the numbers of tissue-specific E-P interactions involving tissue-specific FIREs. For example, 1,093 means there are 1,093 left ventricle specific E-P interactions involving left ventricle-specific FIREs. Similarly, for the remaining three counts.
| Left Ventricle-Specific E-P | Liver-Specific E-P | |
|---|---|---|
| Left Ventricle-Specific FIRE | 1,093 | 416 |
| Liver-Specific FIRE | 951 | 1,392 |
Fig. 5H3K4me3 and H3K27ac ChIP-seq peaks are enriched at FIREs. X axis is the distance from a bin, with the bins grouped into FIRE bins and non-FIRE bins. Y axis is fold enrichment quantified by MACS [42] when applied to the corresponding histone ChIP-seq data.
Fig. 6Relationship between differential FIREs and cell-type-specific enhancers in GM12878 and H1 cells. The size of the dots corresponds to the OR and the color of the dots corresponds to the p-value.