| Literature DB >> 26425553 |
Guohua Wang1, Fang Wang2, Qian Huang2, Yu Li3, Yunlong Liu4, Yadong Wang2.
Abstract
Transcription factors are proteins that bind to DNA sequences to regulate gene transcription. The transcription factor binding sites are short DNA sequences (5-20 bp long) specifically bound by one or more transcription factors. The identification of transcription factor binding sites and prediction of their function continue to be challenging problems in computational biology. In this study, by integrating the DNase I hypersensitive sites with known position weight matrices in the TRANSFAC database, the transcription factor binding sites in gene regulatory region are identified. Based on the global gene expression patterns in cervical cancer HeLaS3 cell and HelaS3-ifnα4h cell (interferon treatment on HeLaS3 cell for 4 hours), we present a model-based computational approach to predict a set of transcription factors that potentially cause such differential gene expression. Significantly, 6 out 10 predicted functional factors, including IRF, IRF-2, IRF-9, IRF-1 and IRF-3, ICSBP, belong to interferon regulatory factor family and upregulate the gene expression levels responding to the interferon treatment. Another factor, ISGF-3, is also a transcriptional activator induced by interferon alpha. Using the different transcription factor binding sites selected criteria, the prediction result of our model is consistent. Our model demonstrated the potential to computationally identify the functional transcription factors in gene regulation.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26425553 PMCID: PMC4573618 DOI: 10.1155/2015/757530
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Overlapping between transcription factors binding regions and DHS sites. The blue bar and red bar represent the percentage of transcription factors that overlap and do not overlap with the DNase I hypersensitive sites, respectively.
Figure 2The histogram of TFCV scores for 2182 known PWMs. The x-axis is TFCV score and the y-axis is the frequency of the occurrence of TFCV for all known PWM.
Transcription factor's contribution value (TFCV) and estimated TFs' functional levels (TFL) of top 10 selected PWMs.
| Index | ID | TF name | PWM description | TFCV | TFL |
|---|---|---|---|---|---|
| 1 | M00772 | IRF | Interferon regulatory factor family | 326.928 | 14830.189 |
| 2 | M01882 | IRF-2 | Interferon regulatory factor 2 | 325.779 | 14680.555 |
| 3 | M02771 | IRF-9 | Interferon regulatory factor 9 | 322.969 | 15127.858 |
| 4 | M00258 | ISGF-3 | Interferon-stimulated response element | 320.613 | 9914.496 |
| 5 | M01881 | IRF-1 | Interferon regulatory factor 1 | 320.501 | 15363.707 |
| 6 | M02767 | IRF-3 | Interferon regulatory factor 3 | 317.408 | 11305.011 |
| 7 | M00699 | ICSBP | Interferon consensus sequence-binding protein | 314.717 | 7242.987 |
| 8 | M00248 | Oct-1 | Octamer factor 1 | 313.642 | 6287.612 |
| 9 | M01235 | IPF1 | Homeodomain-containing transactivator | 310.253 | 6593.312 |
| 10 | M01857 | AP-2 alpha | Activating enhancer binding protein 2 alpha | 309.403 | −3725.557 |
Figure 3TFCV profile of 5 selected highest TFBS candidate models. The spectra of TFCV of all the PWMs while the threshold of potential TFBS is the 5000th, 4000th, 3000th, 2000th, or 1000th highest similarity score for each PWM. The x-axis corresponds to 2188 PWMs and the y-axis corresponds to TFCV scores.
Figure 4The cross-correlation coefficients of TFCV score among 5 selected highest TFBS candidate models.
The top 10 transcription factors with the highest TFCV score in 5 selected highest TFBS candidate model.
| Index | Top 1000 | Top 2000 | Top 3000 | Top 4000 | Top 5000 |
|---|---|---|---|---|---|
| 1 | ICSBP | IRF-9 | IRF-2 | IRF-2 | IRF |
| 2 | IRF | IRF | IRF | IRF | IRF-2 |
| 3 | IRF-3 | ICSBP | IRF-9 | IRF-9 | IRF-9 |
| 4 | ISGF-3 | IRF-3 | IRF-1 | ISGF-3 | ISGF-3 |
| 5 | IRF-9 | IRF-2 | IRF-3 | IRF-1 | IRF-1 |
| 6 | IRF-1 | ISGF-3 | ISGF-3 | IRF-3 | IRF-3 |
| 7 | IRF | IRF-1 | ICSBP | ICSBP | ICSBP |
| 8 | EAR2 | IRF-7 | EAR2 | Oct-1 | Oct-1 |
| 9 | IRF-5 | IRF-1 | IRF-1 | IPF1 | IPF1 |
| 10 | RREB-1 | EWSR1-FLI1 | Lim1 | AP-2 alpha | AP-2 alpha |