| Literature DB >> 19536199 |
Anthony Gitter1, Zehava Siegfried, Michael Klutstein, Oriol Fornes, Baldo Oliva, Itamar Simon, Ziv Bar-Joseph.
Abstract
The complementarity of gene expression and protein-DNA interaction data led to several successful models of biological systems. However, recent studies in multiple species raise doubts about the relationship between these two datasets. These studies show that the overwhelming majority of genes bound by a particular transcription factor (TF) are not affected when that factor is knocked out. Here, we show that this surprising result can be partially explained by considering the broader cellular context in which TFs operate. Factors whose functions are not backed up by redundant paralogs show a fourfold increase in the agreement between their bound targets and the expression levels of those targets. In addition, we show that incorporating protein interaction networks provides physical explanations for knockout effects. New double knockout experiments support our conclusions. Our results highlight the robustness provided by redundant TFs and indicate that in the context of diverse cellular systems, binding is still largely functional.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19536199 PMCID: PMC2710864 DOI: 10.1038/msb.2009.33
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
Figure 1Change in overlap for a range of P-value thresholds. The figure plots the overlap as the percentage of the binding interactions and knockout effects (right y axis) compared with the expected binding and knockout overlaps. The figure is overlaid with the significance of the overlap calculated using the hypergeometric distribution (left y axis). Note the significance peak between P-values of 0.001 and 0.005 (inset).
Figure 2Improved overlap between binding and knockout experiments. (A) A schematic view of our analysis. Both sequence homology and shared interactions may lead to one TF compensating for another. Here, the yellow TF can replace the green TF when it is knocked out and is able to recruit the transcription machinery leading to only small overlap between binding and knockout results. (B) The binding and knockout overlap for various subsets of the data. The P-value of the overlap is given above the columns, which indicate percentage overlap for the whole network analyses. Cleaning the data by removing genes reacting non-specifically to the stress of a knockout (‘general KO genes') and interactions not supported by sequence conservation improves the overlap and its significance. In addition, TFs without redundant paralogs have greater overlap.
Analysis of overlap based on paralogs and shared PPIs
Transcription factors were divided into four groups based on their most similar TF homolog as determined by the BLASTP E-values. These sets were further divided based on the percentage of PPI a TF shared with its paralog. TFs with a putative paralog that share at least 20% PPI are more likely to be redundant and thus exhibit lower overlap.
Figure 3Influence of physical interaction networks. TFs that do not directly bind a gene can exert influence through pathways of PPI and protein–DNA interactions. (A) A network consisting of YHR206W (red), its knockout targets (shades of blue), and 20% of all other yeast genes selected at random (shades of gray). Genes are arranged around YHR206W according to the shortest number of interaction edges needed to reach them. The black and dark blue nodes correspond to genes that are three or more interactions away, the medium gray and medium blue genes are two interactions away, and the light gray and light blue genes are a single interaction from YHR206W. In all, 85% of YHR206W's knockout-affected genes are either directly bound by YHR206W or another TF that can be reached through paths of length 1 or 2. (B) As longer paths in the network are examined, a much higher percentage of the knockout-affected genes are connected to the deleted TF. The P-value of the overlap is given above the columns, which indicate percentage overlap.