| Literature DB >> 24253304 |
Qixuan Wang1, Jinyan Huang, Hanfei Sun, Jing Liu, Juan Wang, Qian Wang, Qian Qin, Shenglin Mei, Chengchen Zhao, Xiaoqin Yang, X Shirley Liu, Yong Zhang.
Abstract
Diversified histone modifications (HMs) are essential epigenetic features. They play important roles in fundamental biological processes including transcription, DNA repair and DNA replication. Chromatin regulators (CRs), which are indispensable in epigenetics, can mediate HMs to adjust chromatin structures and functions. With the development of ChIP-Seq technology, there is an opportunity to study CR and HM profiles at the whole-genome scale. However, no specific resource for the integration of CR ChIP-Seq data or CR-HM ChIP-Seq linkage pairs is currently available. Therefore, we constructed the CR Cistrome database, available online at http://compbio.tongji.edu.cn/cr and http://cistrome.org/cr/, to further elucidate CR functions and CR-HM linkages. Within this database, we collected all publicly available ChIP-Seq data on CRs in human and mouse and categorized the data into four cohorts: the reader, writer, eraser and remodeler cohorts, together with curated introductions and ChIP-Seq data analysis results. For the HM readers, writers and erasers, we provided further ChIP-Seq analysis data for the targeted HMs and schematized the relationships between them. We believe CR Cistrome is a valuable resource for the epigenetics community.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24253304 PMCID: PMC3965064 DOI: 10.1093/nar/gkt1151
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Content of the CR Cistrome database. For each CR collected, we provide basic information and ChIP-Seq data analysis results. For each reader, writer and eraser (if there are available ChIP-Seq data for the related HMs from the same cell line/pop, or the cell type, when the same cell line is not available), we also provide the ChIP-Seq analysis results for these HMs, including the same results as provided for the CR data, except that HMs don’t have motif scan results. Furthermore, the resultant Venn diagram, the genomic distribution of overlap peaks between CR and HM, average CR and HM profile in CR’s binding sites and reads density plot of CR and HM in CR’s peaks are shown to illustrate the relationship between the CRs and related HMs.
The statistics of CR ChIP-Seq data set
| CR type | CR number | CR name | ChIP-seq dataset number (human/mouse) |
|---|---|---|---|
| Reader | 5 | CHD1, RAG2, TAF3, PHF8, WDR5 | 18/7 |
| Writer | 11 | CREBBP, Ep300, EZH1, EZH2, KAT7, PCAF, SETDB1, WHSC1, WDR5, RAC3, KAT2A | 71/21 |
| Eraser | 14 | PHF8, KDM5B, KDM5A, KDM2A, KDM1A, HDAC6, HDAC3, HDAC2, HDAC1, SIRT6, SIRT1, KDM5C, KDM4A, KDM6B | 49/10 |
| Remodeler | 9 | CHD1, CHD2, CHD4, CHD7, MTA3, SMARCA4, SMARCB1, SMARCC1, SMARCC2 | 26/14 |
| Total | 36 | 146/48 |
After the quality control, we finally got 5 readers with 18 ChIP-Seq data sets in human and 7 ChIP-Seq data sets in mouse, 11 writers with 71 ChIP-Seq data sets in human and 21 ChIP-Seq data sets in mouse, 14 erasers with 49 ChIP-Seq data sets in human and 10 ChIP-Seq data sets in mouse, 9 remodelers with 26 ChIP-Seq data sets in human and 14 ChIP-Seq data sets in mouse, in total, that is 146 ChIP-Seq data sets in human and 48 ChIP-Seq data sets in mouse.
The statistics of HM ChIP-Seq data set
| HM type | HM number | HM name | ChIP-seq data set number (human/ mouse) |
|---|---|---|---|
| Methylation | 7 | H3K4me1, H3K4me2, H3K4me3, H4K20me1, H3K9me3, H3K27me3, H3K36me3 | 96/25 |
| Acetylation | 6 | H3K9ac, H3K56ac, H3K27ac, H3K18ac, H4K8ac, H4K5ac | 45/11 |
| Total | 13 | 141/36 |
After the quality control, we finally got 7 kinds of histone methylation with 96 ChIP-Seq data sets in human and 25 ChIP-Seq data sets in mouse and 6 kinds of acetylation with 45 ChIP-Seq data sets in human and 11 ChIP-Seq data sets in mouse.
The statistics of CR-histone linkage data set pair
| CR-HM link type | CR-HM number | ChIP-seq data set number |
|---|---|---|
| Writer-HM | 13 | 165 |
| Eraser-HM | 12 | 171 |
| Reader-HM | 5 | 122 |
| Total | 30 | 458 |
There are 13 pairs and 165 data sets of Writer-HM linkage, 12 pairs and 171 data sets of Eraser-HM linkage, 5 pairs and 122 data sets of Reader-HM linkage.
Figure 2.Screenshot depicting an example run of the CR Cistrome database. Assuming the user is interested in PHF8 in human fibroblasts, step 1 will return a page containing the basic information on PHF8, including its alias, introduction, functions and disease associations, which are manually generated from the literature and also external links to NCBI, UniProt, Wikipedia and GeneCards are provided, as well as ChIP-Seq data information and ChIP-Seq data information for the related HMs in human fibroblasts. Step 2 will provide a page containing the top three motifs for PHF8 in human fibroblasts and the average profile near the TSS and TTS as well as the average gene profile, genome enrichment, average conservation profile across PHF8 ChIP-Seq peaks and a brief data summary, which could help indicate the general quality of this data set. Step 3 will generate PHF8 data set id and H3K4me3 data set id for each comparison as well as the Venn diagram, distribution of their overlap peaks, the average PHF8 and H3K4me3 ChIP-Seq signal profile in PHF8’s binding sites and the reads density plot of PHF8 and H3K4me3 in PHF8’s binding sites.