| Literature DB >> 16584535 |
Martino Barenco1, Daniela Tomescu, Daniel Brewer, Robin Callard, Jaroslav Stark, Michael Hubank.
Abstract
Full exploitation of microarray data requires hidden information that cannot be extracted using current analysis methodologies. We present a new approach, hidden variable dynamic modeling (HVDM), which derives the hidden profile of a transcription factor from time series microarray data, and generates a ranked list of predicted targets. We applied HVDM to the p53 network, validating predictions experimentally using small interfering RNA. HVDM can be applied in many systems biology contexts to predict regulation of gene activity quantitatively.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16584535 PMCID: PMC1557743 DOI: 10.1186/gb-2006-7-3-r25
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Model based estimation of activity profile of p53. (a) Markov Chain Monte Carlo output for potential transcription factor activity profile values for first time series replicate at 4 hours (x axis) and 6 hours (y axis). (b) Concentration of p21transcript determined by real-time polymerase chain reaction after addition of actinomycin D (10 μg/ml) to irradiated (5 Gy, 4 hours) MOLT4 cells cultured in RPMI. Expressed as percentage of initial concentration. (c) Using the degradation rate of p21dramatically restricted the range of solutions to the Markov Chain Monte Carlo.
Figure 2Parameter estimation for a training set of five known p53 targets. (a) The model equation was solved to estimate values for the parameters basal transcription Bj sensitivity Sj, and degradation Dj for the five p53 targets DDB2, p21, SESN1/hPA26, BIK, and TNFRSF10b/TRAILreceptor 2. (b) Simultaneously, the activity profile f(t) of p53 was derived from three separate microarray time courses.
Figure 3Experimentally determined p53 activity profile. The activity profile of p53 was measured by Western blot to determine the levels of ser-15 phosphorylated p53 (ser15P-p53). ser-15 phosphorylation is a measure of p53 activity. IR, ionizing radiation. IR, ionizing irradiation.
Figure 4Choice and number of training set genes does not significantly affect the predicted activity profile. (a) Predicted activity profile of p53 derived using different numbers of known targets in the training set, from three to ten genes. (b) Predicted activity profile of p53 derived using 100 combinations of three randomly selected training set genes from a pool of 10 known targets.
Figure 5Hidden variable dynamic modeling screening of upregulated genes. Model predicted profile (red) and experimental expression profile (black) of typical genes representing two classes of model prediction (class 1 and class 2). (a) Class 1 genes with good model score (M < 100) and high sensitivity P value (sensitivity Z score > 2; for example LRMP). (b) Class 1 genes with atypical expression profiles (for example, p53TG1); this profile occurs because of a low predicted degradation rate. (c,d) Two class 2 genes with low model score (M > 100) but high sensitivity P value (sensitivity Z score > 2; for example, TNFSF10 and IER3).
Top 50 genes predicted by hidden variable dynamic modeling to be p53 regulated, ranked by sensitivity Z score
| Gene title | Gene symbol | Affymetrix identifier | Model score (M) | Sensitivity (Z score) | RNAi validation score |
| 203409_at | 18.74 | 18.24 | 10.74 | ||
| CD38 antigen (p45) | 205692_s_at | 36.69 | 14.77 | 9.02 | |
| 207813_s_at | 79.82 | 13.19 | 7.72 | ||
| Hypothetical protein FLJ22457 | 221081_s_at | 60.45 | 11.01 | 6.33 | |
| 213293_s_at | 41.36 | 10.99 | 6.07 | ||
| Carnitine O-octanoyltransferase | 204573_at | 84.40 | 10.98 | 3.80 | |
| Glutaminase 2 (liver, mitochondrial) | 205531_s_at | 42.83 | 10.28 | 2.52 | |
| 219019_at | 78.80 | 9.90 | 3.09 | ||
| Hect domain and RLD 5 | 219863_at | 37.65 | 9.55 | 1.91 | |
| 208796_s_at | 17.04 | 9.37 | 5.18 | ||
| 205780_at | 19.43 | 9.35 | 6.57 | ||
| Activating signal cointegrator 1 complex subunit 3 | 212815_at | 60.34 | 9.26 | 5.93 | |
| 218346_s_at | 8.37 | 9.25 | 3.90 | ||
| 219628_at | 41.33 | 9.19 | 3.70 | ||
| 209295_at | 27.34 | 9.05 | 6.52 | ||
| Chromosome 6 open reading frame 4 | 215411_s_at | 86.45 | 8.81 | 6.64 | |
| 202284_s_at | 24.98 | 8.40 | 8.07 | ||
| 216396_s_at | 88.04 | 8.20 | 4.09 | ||
| 206571_s_at | 62.88 | 7.54 | 1.88 | ||
| Lymphoid-restricted membrane protein | 204674_at | 26.92 | 7.36 | 3.40 | |
| 209375_at | 43.09 | 7.36 | 5.80 | ||
| TNF (ligand) superfamily, member 4 (Ox40L) | 207426_s_at | 34.73 | 7.15 | 5.26 | |
| 33132_at | 77.75 | 7.09 | -1.44 | ||
| AMP-activated protein kinase, beta 1 subunit | 201834_at | 25.72 | 7.01 | 6.30 | |
| Transducer of ERBB2, 1 | 202704_at | 92.69 | 6.79 | 5.78 | |
| 218403_at | 48.33 | 6.50 | 7.75 | ||
| Sortilin-related receptor, L(DLR class) | 203509_at | 15.66 | 6.34 | 1.70 | |
| 216252_x_at | 44.31 | 6.23 | 4.54 | ||
| 201477_s_at | 46.58 | 6.19 | 0.41 | ||
| Archaemetzincins-2 | 218167_at | 37.48 | 6.16 | 1.22 | |
| Galactose-3-O-sulfotransferase 4 | 219815_at | 38.62 | 5.97 | 3.12 | |
| 203725_at | 84.23 | 5.89 | 11.05 | ||
| 218627_at | 7.23 | 5.87 | 3.56 | ||
| Major histocompatibility complex, class I, B | 209140_x_at | 89.77 | 5.79 | 0.63 | |
| Testis specific, 10 | 220623_s_at | 20.85 | 5.67 | 0.47 | |
| Hypothetical protein MDS025 | 218288_s_at | 31.35 | 5.66 | 2.38 | |
| 209917_s_at | 22.22 | 5.65 | 4.05 | ||
| Leukemia inhibitory factor | 205266_at | 14.86 | 5.62 | 3.42 | |
| Interferon stimulated exonuclease gene 20 kDa-like 1 | 219361_s_at | 48.55 | 5.56 | 5.43 | |
| Lymphoid-restricted membrane protein | 35974_at | 42.06 | 5.56 | 3.69 | |
| Integral membrane protein 2B | 217732_s_at | 20.25 | 5.52 | -0.19 | |
| 210405_x_at | 46.05 | 5.52 | 1.69 | ||
| 208070_s_at | 65.17 | 5.45 | 6.73 | ||
| 210886_x_at | 30.15 | 5.42 | 2.88 | ||
| 221640_s_at | 55.27 | 5.31 | 1.54 | ||
| AMP-activated protein kinase, beta 1 | 201835_s_at | 25.45 | 5.27 | 5.92 | |
| 201577_at | 83.39 | 5.15 | 3.38 | ||
| Tubulin, gamma 1 | 201714_at | 41.74 | 5.09 | 0.02 | |
| Solute carrier family 7, member 6 | 203579_s_at | 18.59 | 4.98 | 2.56 | |
| 209849_s_at | 21.02 | 4.92 | 1.11 |
Low model scores and higher Z score constitute better model fits. The data are compared with validation scores for gene sensitivity to small interfering (si)RNAp53 (higher = better). Plain text indicates genes not previously recorded as p53 targets. Bold text indicates experimentally demonstrated p53 targets.
Figure 6Small interfering (si)RNAp53 reduces p53 protein levels and transcription of p53 target genes. (a) Transfection of siRNAp53 reduces p53 protein levels below control values. (b) Real-time quantitative polymerase chain reaction measurement of three p53 target genes (GADD45α, p21, and HDM2) and a control gene (GAPDH) after transfection of siRNAp53 and irradiation. IR, ionizing irradiation.
Figure 7Model validation. (a) Effect of small interfering (si)RNAp53 on irradiation (5 Gy) induced change in transcript levels at 4 hours of the 74 class 1 genes. (b) Effect of altering Sj Z score threshold for class 1 on proportion of true targets identified (% of p53 upregulated genes at 4 hours predicted; black line) and accuracy of class 1 predictions (percentage of predictions made that were verified by siRNAp53; red line). Accuracy and proportion of the data explained reveal an inverse relationship. (c) Individual comparison of the effect of siRNAp53 on 74 class 1 genes with the best M and p53 sensitivity Sj score, ranked by sensitivity. Bars represent the validation score, a Z score measuring the effectiveness of siRNAp53 on reducing post-irradiation upregulation of transcript. Higher scores indicate effective blocking of the response.
Figure 8Model performance. Distribution of 459 upregulated genes that pass degradation filter based on model score and predicted sensitivity to p53. Sj Z score = 3 and model = 100 thresholds are shown. A total of 115 Genes verified as p53 targets at 4 hours are shown in red.
Figure 9K means clustering of upregulated genes based on expression values. A total of 754 upregulated genes were optimally grouped into eight K means clusters (C1 to C8). The 50 best hidden variable dynamic modeling predictions (Table 1) are split among six clusters (highlighted in yellow). Accurate prediction of p53 targets is therefore not possible using K means at this level.