| Literature DB >> 23656909 |
Chia-Chun Yang1, Michael J Buck, Min-Hsuan Chen, Yun-Fan Chen, Hsin-Chi Lan, Jeremy J W Chen, Chao Cheng, Chun-Chi Liu.
Abstract
BACKGROUND: Specific chromatin structures are associated with active or inactive gene transcription. The gene regulatory elements are intrinsically dynamic and alternate between inactive and active states through the recruitment of DNA binding proteins, such as chromatin-remodeling proteins.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23656909 PMCID: PMC3655836 DOI: 10.1186/1471-2164-14-310
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Overview of the algorithm. (A) To identify accessible and inaccessible chromatin regions, we mapped FAIRE-seq reads to a diploid genome (GM12878) using Bowtie without any mismatch. (B) We identified differential chromatin-state regions (DCSRs) and then separated DCSRs into accessible and inaccessible chromatin groups. (C) To discover chromatin motifs, we used all combinations of DNA sequences from 6mer to 10mer (motif candidates) to scan all of the DCSRs, each of which has 41 bases in total. For each motif candidate, we calculated occurrence frequency, and then performed paired t-tests between frequency vectors of the inaccessible chromatin group and accessible chromatin group. We selected chromatin motifs with P values < 0.01 by the paired t-tests. (D) To annotate the motifs, we used the motif sequences and then performed BLAST alignment against the entire transcription factor binding site sequences in both the TFe and hPDI databases. For each motif, we annotated the motif using the best E-value of alignment and homology > 0.8. (E) To provide more accurate annotation, we filtered out the transcript factors that are not expressed (FPKM < 1) by the RNA-seq data (SRP007417).
Chromatin motifs in genome-wide, intergenic, genic, and promoter regions (P value < 0.01)
| Genome-wide | 7829 | 245 | 163 (66.5%) | CACGTG |
| Intergenic regions | 5464 | 166 | 113 (68.1%) | CACGTG |
| Genic regions | 2262 | 156 | 100 (64.1%) | CAGGCTGGA |
| Promoter regions | 103 | 0 | N/A | N/A |
Figure 2The hypothesis of chromatin motifs. (A) A chromatin motif in the inaccessible state. The TF may bind to the motif with co-activators and then make the chromatin accessible. (B) A chromatin motif in the accessible state. The TF may bind to the motif with co-repressors and then make the chromatin inaccessible.
Top ten chromatin motifs and TF annotation
| CACGTG | CACGTG | 7.61E-08 | | [ | |||
| | CTCCTGAC | GTCAGGAG | 2.36E-06 | | | [ | |
| | CTGCAG | CTGCAG | 3.34E-06 | | | [ | |
| | TTTAGTAGAG | CTCTACTAAA | 3.42E-06 | | | | |
| | AGTAGAGA | TCTCTACT | 5.71E-06 | | | | |
| | GTGAAACCC | GGGTTTCAC | 7.98E-06 | | | [ | |
| | GGTGAAACCC | GGGTTTCACC | 1.23E-05 | | | [ | |
| | CTCCTGACCT | AGGTCAGGAG | 1.33E-05 | [ | |||
| | AGGTCAGGA | TCCTGACCT | 1.41E-05 | [ | |||
| | TCAGGA | TCCTGA | 1.85E-05 | | | | |
| CACGTG | CACGTG | 1.53E-07 | | [ | |||
| | TGTATACA | TGTATACA | 1.59E-05 | | | | |
| | ATCACAA | TTGTGAT | 2.45E-05 | | | [ | |
| | ATGTATACA | TGTATACAT | 3.20E-05 | | | | |
| | GGGTTTCAC | GTGAAACCC | 3.63E-05 | | | [ | |
| | GGTGAAACCC | GGGTTTCACC | 5.67E-05 | | | [ | |
| | ATGTATACAT | ATGTATACAT | 5.87E-05 | | | | |
| | GGTGAAAC | GTTTCACC | 8.17E-05 | | ZNF193 RAB18 | | |
| | CCCGGG | CCCGGG | 1.24E-04 | | | | |
| | GGTGAAACC | GGTTTCACC | 1.37E-04 | | ZNF193 RAB18 | | |
| CAGGCTGGA | TCCAGCCTG | 1.58E-06 | | | | | |
| | ACTCCAGCCT | AGGCTGGAGT | 2.46E-06 | | | | |
| | CTCCAGCCTG | CAGGCTGGAG | 2.66E-06 | | | | |
| | TCCAGCCTGG | CCAGGCTGGA | 6.96E-06 | | PIR | [ | |
| | AGGCTGGA | TCCAGCCT | 6.96E-06 | | | | |
| | GGAGGA | TCCTCC | 1.16E-05 | | | | |
| | CTCCAGCCT | AGGCTGGAG | 1.17E-05 | | | | |
| | TCAGAT | ATCTGA | 1.57E-05 | | | | |
| | GGAGTG | CACTCC | 1.57E-05 | | | | |
| CACTCCA | TGGAGTG | 2.17E-05 | [ | ||||
MYC allele-specific binding associates and DCSR/NDCSR
| DCSR | 31 | 36 |
| NDCSR | 21 | 61 |
Figure 3An example motif CTC. Assume that a motif candidate is the sequence CTC. We calculated the number of hits (frequency of sequences) among all sliding windows for each DCSR. The frequency vectors of the inaccessible and accessible chromatin groups are [2, 2, 0] and [1, 0, 0], respectively. Next, we performed a paired t-test between these two frequency vectors to determine the significance of this motif in terms of differential chromatin states.