| Literature DB >> 20047660 |
Matthew Hansen1, Logan Everett1, Larry Singh1, Sridhar Hannenhalli1.
Abstract
BACKGROUND: Functionally related genes tend to be correlated in their expression patterns across multiple conditions and/or tissue-types. Thus co-expression networks are often used to investigate functional groups of genes. In particular, when one of the genes is a transcription factor (TF), the co-expression-based interaction is interpreted, with caution, as a direct regulatory interaction. However, any particular TF, and more importantly, any particular regulatory interaction, is likely to be active only in a subset of experimental conditions. Moreover, the subset of expression samples where the regulatory interaction holds may be marked by presence or absence of a modifier gene, such as an enzyme that post-translationally modifies the TF. Such subtlety of regulatory interactions is overlooked when one computes an overall expression correlation.Entities:
Year: 2010 PMID: 20047660 PMCID: PMC2826332 DOI: 10.1186/1748-7188-5-4
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
Figure 1The figure illustrates the intuition behind Mimosa. Consider a TF gene X and a potential target gene Y. The expression values of X and Y for all expression samples are shown as a heat plot and as a scatter plot. We presume that X and Y expression are correlated only in an unknown subset of samples (depicted by "+") and not in the remaining samples (denoted by "-"). Mimosa computes the maximum likelihood partition of samples. Then given the sample partition, a third gene Z with differential expression between the two partitions may represent a potential modifier. To be precise, we assign a partition probability to each sample as opposed to a binary partition.
Performance of Mimosa on synthetic data.
| 0.1 | 0.25 | 0.5 | 0.75 | 0.9 | |
|---|---|---|---|---|---|
| 0.6 (2) | 44, 14% | 1, 53% | 1, 76% | 7, 32% | 215, 5% |
| 0.8 (3) | 1, 70% | 1, 99% | 1, 100% | 1, 83% | 35, 10% |
| 0.923 (5) | 1, 99% | 1, 100% | 1, 100% | 1, 99% | 4, 30% |
Columns represent f ranges and rows represent a ranges (corresponding aspect ratio is shown in parenthesis; see §Methods). Figures in each cell are based on 100 TF-Gene pairs, and shows (1) the median rank of the correct modifier, and (2) the fraction of 100 cases where the correct modifier was top ranked based on the t-test p-value.
Figure 2Distribution of percentile ranks of the correct modifier predicted from among 6000 candidate modifiers, for the 510 experimentally determined TF-Gene-Modifier triplets. Mimosa ranks the correct modifier among the top 5% in 23% of the cases.
Figure 3The distribution of correlations among . The data used is taken from yeast TF-Gene pairs; specifically, the 6960 yeast TF-Gene pairs detected by Mimosa (see text).
GO molecular functions enriched in the putative modifiers detected for the TF-Gene pairs in S. cerevisiae based on ChIP-chip data and 314 expression samples.
| Molecular function term | % Coverage | p-value | FDR (%) |
|---|---|---|---|
| catalytic activity | 43 | 1.32E-04 | 0.22 |
| nucleotide binding | 14 | 1.27E-05 | 0.02 |
| purine nucleotide binding | 13 | 4.74E-05 | 0.08 |
| purine ribonucleotide binding | 12 | 1.49E-05 | 0.02 |
| ribonucleotide binding | 12 | 1.49E-05 | 0.02 |
| RNA binding | 11 | 1.60E-09 | 2.71E-06 |
| structural molecule activity | 10 | 4.34E-12 | 7.36E-09 |
| structural constituentof ribosome | 9 | 4.38E-22 | 7.43E-19 |
| GTP binding | 3 | 7.43E-06 | 0.01 |
| guanyl nucleotide binding | 3 | 7.43E-06 | 0.01 |
| guanyl ribonucleotide binding | 3 | 7.43E-06 | 0.01 |
| oxidoreductase activity, acting on CH-OH group of donors | 3 | 4.66E-05 | 0.08 |
| translation regulator activity | 3 | 8.71E-07 | 1.48E-03 |
| oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor | 3 | 1.10E-04 | 0.19 |
| translation factor activity, nucleic acid | 3 | 2.14E-07 | 3.62E-04 |
| GTPase activity | 2 | 2.39E-03 | 3.98 |
| rRNA binding | 2 | 8.85E-11 | 1.50E-07 |
| snoRNA binding | 2 | 8.58E-09 | 1.45E-05 |
| ligase activity, forming aminoacyl-tRNA and related compounds | 2 | 1.33E-06 | 2.25E-03 |
| ligase activity, forming carbon- oxygen bonds | 2 | 1.33E-06 | 2.25E-03 |
| aminoacyl-tRNA ligase activity | 2 | 1.33E-06 | 2.25E-03 |
| RNA helicase activity | 2 | 4.29E-05 | 0.07 |
| ATP-dependent RNA Helicase activity | 2 | 9.12E-07 | 1.54E-03 |
| RNA-dependent ATPase activity | 2 | 9.12E-07 | 1.54E-03 |
| translation initiation factor activity | 1 | 4.61E-04 | 0.78 |
Potential modulators of STAT1 activity detected by Mimosa using the known STAT1 targets and gene expression data from normal B cell and B cell cancers.
| Gene Name | Evidence |
|---|---|
| Refseq Id | [Pubmed Id for the references are provided in square brackets] |
| A Ser/Ther protein kinase that functions upstream of the JAK-STAT signal transduction pathway according to the KEGG pathway database | |
| An E2 SUMO-conjugating enzyme implicated in SUMOylation of STAT1 in conjunction with PIAS1 [12855578, 12764129]. | |
| A dual specificity protein phosphatase. STAT1 is known to be primarily regulated by reversible tyrosine phosphorylation. DUSP1 has been shown to function in a JAK2-dependent manner [14551204] and the members of the JAK family are the canonical regulators of STATs, thus suggesting DUSP1 as a potential upstream modulator of STAT1. | |
| A Ser/Thr kinase that negatively regulates the TGF- | |
| A phosphatase functioning upstream of major kinases such as AKT/PKB | |
| An early activation antigen functioning downstream of IFN- | |
| Modulates cellular growth through up-regulation of p21 [15753078], which in turn is regulated by the STAT1 homolog STAT5A [12393707]. | |
| Belongs to a family of adhesion/homing receptors which play important roles in leukocyte-endothelial cell interaction [12370391], while STAT1 also plays a crucial role in leukocyte-infiltration into the liver in T cell hepatitis [15246962]. | |
| Glucocorticoid-induced leucine zipper (GILZ) interacts with NF-kappaB | |
| An interferon regulatory factor 7, belonging to the same TF family as two known STAT1 co-factors, IRF-1 and IRF8 [18929502]. | |
| Co-induced with STAT3 by HIV-1 gp120 [12089333]. | |
| Related to POLR2J. | |
| A zinc finger transcription factor known to be a HNF-4 | |
| A proteasomal ubiquitin receptor whose expression has been shown to be induced by IFN- | |
| A 26S proteasome non-ATPase regulatory subunit involved in the processing of class I MHC peptides [8811196]. | |
| | Interferon-induced transmembrane proteins. These may be involved in STAT1 modulation, or they may be downstream of a pathway, most likely IFN- |
| MHC class I genes. The function of this class of genes is well-characterized as cell-surface antigen presenters, and it is difficult to imagine how these genes might function upstream of STAT1. A more likely explanation is that they are activated downstream of, or in parallel to, STAT1 by another gene which also functions as a STAT1 modulator or co-factor. It is particularly striking that all of these genes belong to MHC class I, and none in MHC class II, which are known to be regulated by STAT1 [18929502]. | |
Figure 4The figure shows (1) The distribution of Log-Likelihood ratios for randomly generated (normal, i.i.d.) expression data for 400, and 1200 samples, permuted 20,000 times, (2) . The "null" distribution is defined by f = 0, implying an absence of a mixture.