| Literature DB >> 20594356 |
Denis C Bauer1, Fabian A Buske, Timothy L Bailey.
Abstract
BACKGROUND: Quantitative models for transcriptional regulation have shown great promise for advancing our understanding of the biological mechanisms underlying gene regulation. However, all of the models to date assume a transcription factor (TF) to have either activating or repressing function towards all the genes it is regulating.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20594356 PMCID: PMC2912886 DOI: 10.1186/1471-2105-11-366
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Ability of the Segal model and Reinitz model to fit the Segal single-time data.
| Segal | 344 | 0.59 (0.009) |
| Reinitz | 18 | 0.27 (0.008) |
The second column shows the number of free parameters used in the models. The third column shows the average CC over all 44 developmental CRMs. The standard error is given in parentheses.
Figure 1Accuracy achieved using the configurations suggested by the role-determining methods. The figure shows the average CC achieved when a Reinitz model was allowed to use as many different configurations as suggested by the different role-determining methods. BEST-N means the n ∈ [1-4] configurations where chosen that have the best overall accuracy. SMALLEST-OPTIMAL means the model was allowed to use as many different configurations as necessary to fit each CRM optimally (here 17). Also shown is the average CC achieved over the 44 CRMs when the literature configuration was used. The standard error is ≤ 0.009 in all cases, hence no error bars are shown.
Assigning TF roles for different CRMs.
| Kr_CD1_ru | - (Δ) | + (∪) | - (∪) | - ( | - ( | + | - | NA |
| eve_37ext_ru | - | + | + ( | + | + (Δ) | NA | - | - |
| eve_stripe2 | + | + (Δ) | - (Δ) | - | - | - | - | NA |
| hb_anterior_actv | + | NA | + ( | - | - (Δ) | - | - | NA |
| kni_+1 | + ( | - (Δ) | - ( | - | + | - (Δ) | + | - |
| run_stripe5 | + | + | - | - | - | - | - | - ( |
| s | + (∪) | s ( | - (∪) | s | s ( | - (Δ) | s ( | |
| 111 | 110 | 127 | 94 | 110 | 117 | 103 | 104 | |
The first rows give the predictions for the TF roles for six of the 44 CRMs (see Additional file 1 Section 1 for the complete set of predictions). The roles for each TF (columns) are determined by majority vote of the three methods: "∪" - SMALLEST-OPTIMAL, "β" - BEST-N and "Δ" - SENSITIVITY. "+" means activator, "-" repressor, respectively and "NA" indicates that no "strong" role prediction could be made for the CRM. The disagreeing method, if any, is shown in brackets. The second last row gives the overall prediction of the role of the TF: activator, repressor or switcher, "s" (see main text for method). The last row shows the number of "strong" role predictions summed over the 44 CRMs and three methods (total of 132).
Improvement in the ability to fit the data when dual function for Hb and Kr are allowed.
| Reinitz | 1 | 18 | 0.27 (0.008) |
| Reinitz KrDual | 2 | 19 | 0.35 (0.009) |
| Reinitz HbDual | 2 | 19 | 0.37 (0.007) |
| Reinitz HbKrDual | 4 | 20 | 0.38 (0.007) |
| Segal | 1 | 344 | 0.59 (0.009) |
Each row shows the CC for the Reinitz model using the literature configuration, and additionally Hb, Kr or Hb and Kr as switching TFs, respectively. The results are contrasted to the Segal model given in the last row. The second column shows the number of different configurations in the approach. The third column indicates the number of free parameters in the model. The fourth column shows the average CC when training on all 44 CRMs simultaneously. The reported results for the Reinitz model are averaged over five independent runs.
Figure 2Performance improvement of models using dual functioning TFs. Each panel shows for six representative CRMs the observed output of the CRM (solid grey) compared to the normalized predicted shape using different TF roles in the Reinitz model. The grey dashed line shows the prediction for the best role assignment and individual training on the CRM. In dashed red is shown the performance when using the literature configuration and trained simultaneously on all 44 CRMs. The solid coloured lines show the prediction of HbDual KrDual and HbKrDual, respectively, trained simultaneously. The model with the best over-all performance is displayed from the five independent repeats.
TFs with enriched TFBSs in CRMs where Hb or Kr are activators or repressors.
| # of CRMs | 17 | 27 | 11 | 33 |
| Abd-B | 0.0 | 0.0 | 0.0 | 0.0 |
| Deaf1 | 0.079 | 0.005 | ||
| His2B | 0.048 | 0.0 | 0.0 | 0.058 |
| Hsf* | 0.085 | 0.008 | ||
| Kr | 0.0 | 0.0 | 0.0 | 0.0 |
| Bcd | 0.0 | 0.0 | 0.0 | 0.0 |
| Br-Z4+ | ||||
| Cad | 0.002 | 0.0 | 0.0 | 0.0 |
| Hb | 0.032 | 0.0 | 0.0 | 0.0 |
| Kni | 0.032 | 0.01 | ||
| Tll | 0.003 | 0.0 | 0.0 | 0.0 |
| Ttk | 0.02 | 0.0 | 0.003 | 0.0 |
The first row shows the number of CRMs in each set. Each following row shows a TF out of the 76 tested for which a Clover [19] analysis resulted in a significant over- or under-representation in the sequences where Hb is preferred as activator (second column) or repressor (third column) or Kr preferred as activator (fourth column) or repressor (fifth column), respectively. Highlighted in bold are the cases with differential enrichment between the activator and repressor set. The TF marked with "*" is not expressed during the developmental time points C13 and C14 as determined from in-situ staining images http://www.flyexpress.net. The TF marked with "+" has a binding profile that is very similar to Bcd. Empty cells indicate that the TF was not significant for the sequence sets of Hb.
Figure 3Motifs found in protein sequences according to . Each panel shows the logo representation of a motif found in the protein sequence of all eight regulatory TFs (left column) or all TFs predicted to have dual function (right column). Motifs were found using MEME [22] in "OOPS" mode with a minimum sequence length of four and a maximum of six.
Role of TFs in comparison with presence of SUMOylation consensus motif and the role reported in the literature.
| Perkins | + | + | s | - | s | s | - | NA |
| Schroeder | + | + | s | - | - | (s) | - | + |
| Rivera-Pomar | + | + | s | - | - | s | - | NA |
| Sanchez | + | + | s | - | - | - | - | NA |
| Jaeger | + | + | s | - | s | s | s | NA |
The first row indicates the role we assign to the TFs, where "+" indicate activators, "-" indicate repressors, and "s" indicates a TF switching roles. The second row shows the number of sites found in the protein sequence of the TF that match the SUMO-consensus motif. The last rows summarizes the roles for the TFs previously reported in the literature.