| Literature DB >> 24238150 |
Shan Zhong, Xin He, Ziv Bar-Joseph1.
Abstract
BACKGROUND: Studies of gene regulation often utilize genome-wide predictions of transcription factor (TF) binding sites. Most existing prediction methods are based on sequence information alone, ignoring biological contexts such as developmental stages and tissue types. Experimental methods to study in vivo binding, including ChIP-chip and ChIP-seq, can only study one transcription factor in a single cell type and under a specific condition in each experiment, and therefore cannot scale to determine the full set of regulatory interactions in mammalian transcriptional regulatory networks.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24238150 PMCID: PMC3898213 DOI: 10.1186/1471-2164-14-796
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Overview of PIPES. (a, b) Starting with raw PBM data for a TF represented as fluorescence intensities to each individual probes, we first infer binding probabilities to individual short k-mers, and then a given sequence can be scored by such inferred binding probabilities. (c, d) To predict in vivo TF binding, we take as input the tissue-specific DNase I hypersensitivity data (tag counts) and convert them to probabilities that represent chromatin accessibility for each position in the genome at each tissue/cell types. (e) The PBM data, DNase data and other types of data including sequence conservation are combined using an integrative model. See Methods for details.
Figure 2Top inferred -mer binding probabilities for (a) Sox12, (b) Esrra, (c) Klf7 and (d) Pou2f1. The S&W PWMs and TRANSFAC motifs (when available) for these four factors are also shown. k-mers are colored according to whether they match the consensus sequences of the primary S&W PWM (red) or secondary S&W PWM (blue). k-mers matching the TRANSFAC motifs are indicated by a "*" in the front. Only k-mers with coefficients above 0.5 (normalized so that the maximum is equal to 1) are shown.
Figure 3AUCs of different integrative methods for predicting TF binding. The first four bars are from our k-mer PBM data analysis method (black), simple baseline using only DNase data (red), or from PIPES (green and purple). Cyan bars: Simple overlapping approach that overlays PWM matches with DNase data. Orange bars: CENTIPEDE using S&W PWMs to scan sequences and an odd log score of log2(1000) as cutoff. See main text and also Additional file 1 for more details.
Figure 4Results for tissue-specificity TF activity prediction. (a) Predicted tissue activity scores for 4 TFs across 18 representative tissue/cell types. Arrows indicate known functions of the TF in the corresponding tissue as supported by literature evidence. In all cases, the highest activity score matches a known tissue for the factor. (b) Pearson correlation coefficients between tissue specific expression experiments and the activity level predicted by our method. The distribution is based on 108 permutations of the activity scores. The value from the real predictions is indicated by the arrow on the right.
Top five predicted TFs for liver, retina and B cell
| | ||
| Hnf4a | 2.22 | Essential for maintaining hepatic gene expression and lipid homeostasis
[ |
| Rara | 1.90 | Important in maintaining liver homeostasis, and its disruption is linked to hepatocarcinogenesis
[ |
| Nr2f2 | 1.56 | Expressed in liver, and known to regulate liver-specific genes
[ |
| Rxra | 1.45 | Important role in liver metabolism
[ |
| Tcf7 | 1.44 | Downstream regulator in Wnt signaling which is critical in liver physiology and pathology
[ |
| | ||
| Crx | 4.28 | Regulates photoreceptor gene expression
[ |
| Pitx3 | 4.26 | Required for normal retina formation in Xenopus and zebrafish
[ |
| E2F3 | 4.06 | Involved in retina progenitor cell development
[ |
| Pitx2 | 3.92 | Pitx2-deficient mouse exhibits ocular abnormalities
[ |
| Gsc | 3.89 | Unknown function in retina. |
| | ||
| Sfpi1 | 2.09 | Essential regulator of B-cell differentiation
[ |
| Pou2f2 | 2.08 | Required for T-cell independent B cell activation
[ |
| Spic | 1.98 | Promotes B cell differentiation
[ |
| Pou2f3 | 1.94 | Unknown function in B cell, but has almost the same binding preference as Pou2f2 |
| Elf4 | 1.70 | Regulates proliferation of B cells
[ |