| Literature DB >> 18811975 |
Abstract
BACKGROUND: A gene regulatory module (GRM) is a set of genes that is regulated by the same set of transcription factors (TFs). By organizing the genome into GRMs, a living cell can coordinate the activities of many genes in response to various internal and external stimuli. Therefore, identifying GRMs is helpful for understanding gene regulation.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18811975 PMCID: PMC2575218 DOI: 10.1186/1471-2164-9-439
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The flowchart of HIMIA.
Figure 2The 29 GRMs identified in this study. Each rectangle stands for a module and the ovals in a rectangle indicate the TFs that regulate the module. A TF name is colored blue if it is known to be involved in heat shock response but black otherwise. The periphery of a rectangle is colored purple if the module has at least one enriched MIPS functional category but black otherwise. An oval is colored green if the TF's function is consistent with at least one of the module's enriched MIPS functional categories. Two ovals are connected by an undirected red line if these two TFs have physical interactions indicated by the current protein-protein interaction data [22]. Two ovals are connected by a directed black line if the two TFs have genetic interactions indicated by mutant or ChIP-chip data [20,21]. For example, Msn2→Yap1 means that either TF Msn2 binds to the promoter of gene YAP1 or the disruption of TF Msn2 results in a significant change of the expression of gene YAP1.
The number of inferred heat-inducible target genes regulated by each of the 12 heat-responsive TFs
| TF name | # of inferred heat-inducible target genes |
| 73 | |
| 67 | |
| 64 | |
| 48 | |
| 29 | |
| 23 | |
| Cst6 | 14 |
| Ste12 | 14 |
| Rox1 | 13 |
| Ume6 | 8 |
| Dig1 | 7 |
| 6 |
Seven well-known heat-responsive TFs are bold-faced and colored blue. The TFs are ordered by the number of their inferred heat-inducible target genes.
Figure 3The interaction network formed by the 12 identified heat-responsive TFs. An oval indicates an identified heat-responsive TF. A TF name is colored blue if it is known to be involved in heat shock response but black otherwise. Two ovals are connected by an undirected red line if the two TFs have physical interactions indicated by the current protein-protein interaction data [22]. Two ovals are connected by a directed green line if these two TFs have genetic interactions indicated by the mutant or ChIP-chip data [20,21]. Note that the 12 identified heat-responsive TFs form a highly connected network of interactions.
Identified activators of GRMs
| Activator identified | Module # ( |
| Cst6 | #2 (3.7 × 10-4), #3 (3.7 × 10-4), #15 (6.4 × 10-5), #18 (1 × 10-3) |
| Hsf1 | #14 (6.8 × 10-5), #21 (3.7 × 10-4), #27 (2.4 × 10-13) |
| Msn2 | #12 (8.5 × 10-5), #13 (4.1 × 10-5), #17 (3.3 × 10-5), #19 (8.6 × 10-4), #25 (5.8 × 10-13) |
| Msn4 | #13 (1.2 × 10-8), #26 (<1 × 10-13) |
| Rlm1 | #24 (1.9 × 10-4), #29 (2 × 10-10) |
| Yap1 | #1 (9.7 × 10-6), #12 (9 × 10-9), #14 (5.3 × 10-7), #20 (3.3 × 10-7) |
A TF is said to be an activator of a module if the p-value for observing so many TF-gene pairs in the module each with a significant positive correlation is less than 0.001.
Figure 4Refining clusters of genes involved in the same cellular process. (A) Refining the cluster of genes involved in protein (re)folding. HIMIA assigns the 20 identified genes that are involved in protein (re)folding into five modules. (B) Refining the cluster of genes involved in protein degradation. HIMIA assigns the 16 identified genes that are involved in protein degradation into five modules.
Performance comparison of five module inference tools
| Algorithm name | TP | FP | FN | Jaccard similarity score |
| HIMIA | 7 | 5 | 25 | 0.19 |
| ReMoDiscoery | 11 | 38 | 21 | 0.16 |
| GRAM | 13 | 56 | 19 | 0.15 |
| MOFA | 8 | 32 | 24 | 0.13 |
| SAMBA | 10 | 54 | 22 | 0.12 |
Performance comparison was based on the Jaccard similarity score [12,34], which scores the overlap between an algorithm's output and the list of known stress-responsive TFs. Specifically, the score is defined as TP/(TP+FP+FN) [12,34], where TP stands for true positives, FP for false positives, and FN for false negatives. Note that the higher the Jaccard similarity score, the better the ability of an algorithm to retrieve the known stress-responsive TFs.
Randomization tests of HIMIA
| Jaccard similarity score | |
| HIMIA (using original input data) | 0.19 |
| HIMIA (using randomized gene expression data) | 0.025 (mean), 0.023 (standard deviation) |
| HIMIA (using randomized ChIP-chip data) | 0.009 (mean), 0.016 (standard deviation) |
Two randomization tests were performed. First, we randomly permuted the gene expression data 1000 times. Second, we randomly permuted the ChIP-chip data 1000 times. When either of the two randomized data sets was used as the input, HIMIA could no longer effectively identify the known stress-responsive TFs, indicating that the output of HIMIA is statistically meaningful and is far beyond random expectation.