| Literature DB >> 15598612 |
Takeharu Yamanaka1, Hiroyoshi Toyoshiba, Hideko Sone, Frederick M Parham, Christopher J Portier.
Abstract
One major unresolved issue in the analysis of gene expression data is the identification and quantification of gene regulatory networks. Several methods have been proposed for identifying gene regulatory networks, but these methods predominantly focus on the use of multiple pairwise comparisons to identify the network structure. In this article, we describe a method for analyzing gene expression data to determine a regulatory structure consistent with an observed set of expression profiles. Unlike other methods this method goes beyond pairwise evaluations by using likelihood-based statistical methods to obtain the network that is most consistent with the complete data set. The proposed algorithm performs accurately for moderate-sized networks with most errors being minor additions of linkages. However, the analysis also indicates that sample sizes may need to be increased to uniquely identify even moderate-sized networks. The method is used to evaluate interactions between genes in the SOS signaling pathway in Escherichia coli using gene expression data where each gene in the network is over-expressed using plasmids inserts.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15598612 PMCID: PMC1247658 DOI: 10.1289/txg.7105
Source DB: PubMed Journal: Environ Health Perspect ISSN: 0091-6765 Impact factor: 9.031
Figure 1A simple gene interaction network consisting of four genes.
Figure 2Network linkages of key genes in the SOS response in E. coli as identified by the TAO-Gen algorithm.
Figure 3A literature-based linkage map between genes in the SOS response in E. coli. The map represents inducible genes/proteins in the SOS response for repair from DNA damage. Black lines indicate pathways in the normal repair process and red lines with arrows activation/induction due to an exposure to damaging agents. Recombination and repair, DNA damage–inducible protein, nucleotide excision repair, error-prone repair, and stationary-phase regulator have family molecules in each box. Green circles are genes used for the analysis.
Estimated means, standard deviations and percentage above 0 for all interactions in SOS response genes for E. coli identified as linked by the TAO-Gen algorithm (see Figure 2).
| From | To | Type | Mean | SD | % < 0 |
|---|---|---|---|---|---|
| Activate | 0.435 | 0.065 | 0.00 | ||
| Activate | 0.137 | 0.056 | 0.99 | ||
| Activate | 0.393 | 0.161 | 0.93 | ||
| Activate | 0.365 | 0.129 | 0.42 | ||
| Repress | −0.356 | 0.091 | 99.97 | ||
| Activate | 0.193 | 0.093 | 2.06 | ||
| Repress | −0.158 | 0.065 | 98.86 | ||
| Repress | −0.287 | 0.156 | 96.61 | ||
| Repress | −0.550 | 0.169 | 99.85 | ||
| Repress | −0.077 | 0.029 | 99.46 | ||
| Activate | 0.512 | 0.204 | 0.81 | ||
| Activate | 0.031 | 0.012 | 0.55 | ||
| Activate | 0.496 | 0.108 | 0.02 | ||
| Plasmid insert | Activate | 0.458 | 0.080 | 0.00 | |
| Activate | 0.396 | 0.041 | 0.00 | ||
| Activate | 2.443 | 0.039 | 0.00 | ||
| Activate | 0.062 | 0.130 | 30.95 | ||
| Activate | 1.188 | 0.110 | 0.00 | ||
| Activate | 1.007 | 0.093 | 0.00 | ||
| Activate | 1.409 | 0.069 | 0.00 | ||
| Activate | 3.319 | 0.074 | 0.00 | ||
| Activate | 0.513 | 0.100 | 0.00 | ||
| MMC | Activate | 0.979 | 0.282 | 0.06 | |
| Activate | 0.479 | 0.108 | 0.05 | ||
| Activate | 0.637 | 0.345 | 3.28 | ||
| Activate | 0.896 | 0.282 | 0.07 | ||
| Activate | 0.969 | 0.252 | 0.05 | ||
| Activate | 0.460 | 0.221 | 2.12 | ||
| Activate | 1.233 | 0.204 | 0.00 | ||
| Activate | 1.255 | 0.248 | 0.00 |
Results from 100,000 Monte Carlo simulations of four hypothetical four-gene networks (A, B, C, D) describing the ability of the TAO-Gen algorithm to specify the correct network.
| Frequency (%) of resulting optimal network structure
| Rank (%) of the posterior likelihood for the true network over all possible 543 acyclic networks
| |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sample size | True model | 1 | 2 | 3 | 4–10 | |||||||
| 100 arrays | A | 922 (92) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 68 (7) | 0 (0) | 922 (92) | 52 (5) | 10 (1) | 16 (2) |
| 1,000 sims | B | 977 (98) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 6 (1) | 0 (0) | 977 (98) | 17 (2) | 4 (0.4) | 2 (0.2) |
| C | 929 (93) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 71 (7) | 0 (0) | 929 (93) | 50 (5) | 8 (1) | 13 (1) | |
| D | 980 (98) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 6 (1) | 0 (0) | 980 (98) | 13 (1) | 5 (0.5) | 2 (0.2) | |
| 50 arrays | A | 1,716 (86) | 4 (0.2) | 3 (0.2) | 6 (0.3) | 4 (0.2) | 165 (8) | 0 (0) | 1,716 (87) | 157 (8) | 34 (2) | 70 (4) |
| 2,000 sims | B | 1,841 (92) | 8 (0.4) | 0 (0) | 4 (0.2) | 8 (0.4) | 41 (2) | 0 (0) | 1,841 (92) | 82 (4) | 20 (1) | 55 (3) |
| C | 1,745 (87) | 6 (0.3) | 4 (0.2) | 3 (0.2) | 6 (0.3) | 175 (9) | 0 (0) | 1,745 (88) | 128 (6) | 41 (2) | 62 (3) | |
| D | 1,860 (93) | 4 (0.2) | 0 (0) | 2 (0.1) | 0 (0) | 46 (2) | 0 (0) | 1,860 (93) | 68 (3) | 30 (2) | 42 (2) | |
| 25 arrays | A | 2,920 (73) | 76 (2) | 72 (2) | 56 (1) | 77 (2) | 328 (8) | 3 (0.1) | 2,920 (73) | 423 (10) | 112 (3) | 387 (10) |
| 4,000 sims | B | 3,179 (80) | 92 (2) | 55 (1) | 48 (1) | 47 (1) | 192 (5) | 8 (0.2) | 3,179 (79) | 348 (9) | 133 (3) | 249 (6) |
| C | 2,891 (72) | 60 (1) | 100 (2) | 56 (1) | 76 (2) | 296 (7) | 4 (0.1) | 2,891 (72) | 404 (10) | 114 (3) | 444 (11) | |
| D | 3,086 (77) | 76 (2) | 96 (2) | 48 (1) | 48 (1) | 164 (4) | 8 (0.2) | 3,086 (77) | 328 (8) | 149 (4) | 365 (9) | |
| 10 arrays | A | 3,198 (32) | 909 (9) | 741 (7) | 230 (2) | 149 (2) | 328 (3) | 497 (5) | 3,198 (32) | 1,027 (10) | 781 (8) | 2,389 (24) |
| 10,000 sims | B | 3,768 (38) | 1,002 (10) | 1,051 (10) | 220 (2) | 309 (3) | 378 (4) | 567 (6) | 3,768 (38) | 966 (10) | 821 (8) | 2,519 (25) |
| C | 3,177 (32) | 892 (9) | 691 (7) | 230 (2) | 151 (2) | 398 (4) | 457 (5) | 3,177 (32) | 1,232 (12) | 769 (8) | 2,347 (23) | |
| D | 3,768 (38) | 1,052 (10) | 1,031 (10) | 280 (3) | 259 (3) | 538 (5) | 477 (5) | 3,768 (38) | 1,146 (11) | 871 (9) | 2,371 (24) | |
(A) β14 = 2.0, β13 = 0.8, β23 = 0.8, β34 = −1.3, σ1 = σ2 = σ3 = σ4 = 1.0
(B) β14 = 2.0, β13 = 0.8, β23 = 0.8, β34 = −5.0, σ1 = σ2 = σ3 = σ4 = 1.0
(C) β14 = 2.0, β13 = 0.8, β23 = 0.8, β34 = −1.3, σ1 = σ2 = σ3 = σ4 = 1/3
(D) β14 = 2.0, β13 = 0.8, β23 = 0.8, β34 = −5.0, σ1 = σ2 = σ3 = σ4 = 1/3
Figure 4A hypothetical eight gene network used for the Monte-Carlo simulations in Table 3. The numbers attached to the arrows show linear parameters, where positive numbers correspond to up-regulations and negative numbers down-regulations.
Number (percent) of linkages between two genes identified by the TAO-Gen algorithm in 1,000 Monte Carlo simulations of the hypothetical eight-gene network shown in Figure 3.
| From gene number | To cell number
| ||||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | ||
| 100 Chips | 1 | —— | 3 (0.3) | 1,000 (100) | 1,000 (100) | 4 (0.4) | 1 (0.1) | 4 (0.4) | 5 (0.5) |
| 2 | 0 (0) | —— | 999 (99.9) | 9 (0.9) | 1,000 (100) | 1 (0.1) | 3 (0.3) | 7 (0.7) | |
| 3 | 0 (0) | 1 (0.1) | —— | 1,000 (100) | 0 (0) | 0 (0) | 0 (0) | 1,000 (100) | |
| 4 | 0 (0) | 0 (0) | 0 (0) | —— | 0 (0) | 0 (0) | 0 (0) | 0 (0) | |
| 5 | 0 (0) | 0 (0) | 0 (0) | 3 (0.3) | —— | 0 (0) | 1,000 (100) | 999 (99.9) | |
| 6 | 2 (0) | 0 (0) | 2 (0.2) | 2 (0.2) | 2 (0.2) | —— | 1,000 (100) | 8 (0.8) | |
| 7 | 0 (0) | 0 (0) | 0 (0) | 1 (0.1) | 0 (0) | 0 (0) | —— | 1,000 (100) | |
| 8 | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | —— | |
| 50 Chips | 1 | —— | 4 (0.4) | 980 (98) | 1,000 (100) | 23 (2.3) | 11 (1.1) | 23 (2.3) | 8 (0.8) |
| 2 | 8 (0.8) | —— | 977 (97.7) | 19 (1.9) | 989 (98.9) | 6 (0.6) | 13 (1.3) | 24 (2.4) | |
| 3 | 14 (1.4) | 2 (0.2) | —— | 995 (99.5) | 3 (0.3) | 3 (0.3) | 9 (0.9) | 1,000 (100) | |
| 4 | 0 (0) | 0 (0) | 5 (0.5) | —— | 0 (0) | 0 (0) | 1 (0.1) | 0 (0) | |
| 5 | 2 (0.2) | 9 (0.9) | 14 (1.4) | 7 (0.7) | —— | 4 (0.4) | 991 (99.1) | 973 (97.3) | |
| 6 | 10 (1) | 4 (0.4) | 15 (1.5) | 13 (1.3) | 15 (1.5) | —— | 989 (98.9) | 11 (1.1) | |
| 7 | 1 (0.1) | 0 (0) | 0 (0) | 7 (0.7) | 7 (0.7) | 2 (0.2) | —— | 998 (99.8) | |
| 8 | 0 (0) | 0 (0) | 0 (0) | 5 (0.5) | 0 (0) | 0 (0) | 2 (0.2) | —— | |
| 25 Chips | 1 | —— | 33 (3.3) | 832 (83.2) | 960 (96) | 26 (2.6) | 18 (1.8) | 26 (2.6) | 50 (5) |
| 2 | 20 (2) | —— | 751 (75.1) | 63 (6.3) | 912 (91.2) | 14 (1.4) | 57 (5.7) | 94 (9.4) | |
| 3 | 37 (3.7) | 46 (4.6) | —— | 933 (93.3) | 10 (1) | 5 (0.5) | 46 (4.6) | 962 (96.2) | |
| 4 | 1 (0.1) | 0 (0) | 63 (6.3) | —— | 2 (0.2) | 0 (0) | 2 (0.2) | 11 (1.1) | |
| 5 | 5 (0.5) | 50 (5) | 59 (5.9) | 34 (3.4) | —— | 9 (0.9) | 905 (90.5) | 811 (81.1) | |
| 6 | 9 (0.9) | 10 (1) | 19 (1.9) | 38 (3.8) | 64 (6.4) | —— | 857 (85.7) | 69 (6.9) | |
| 7 | 2 (0.2) | 0 (0) | 21 (2.1) | 24 (2.4) | 60 (6) | 19 (1.9) | —— | 964 (96.4) | |
| 8 | 2 (0.2) | 0 (0) | 13 (1.3) | 9 (0.9) | 0 (0) | 0 (0) | 33 (3.3) | —— | |
| 10 Chips | 1 | —— | 51 (5.1) | 516 (51.6) | 702 (70.2) | 63 (6.3) | 30 (3) | 73 (7.3) | 141 (14.1) |
| 2 | 49 (4.9) | —— | 335 (33.5) | 155 (15.5) | 590 (59) | 35 (3.5) | 171 (17.1) | 166 (16.6) | |
| 3 | 73 (7.3) | 84 (8.4) | —— | 596 (59.6) | 67 (6.7) | 16 (1.6) | 126 (12.6) | 641 (64.1) | |
| 4 | 23 (2.3) | 15 (1.5) | 227 (22.7) | —— | 11 (1.1) | 8 (0.8) | 22 (2.2) | 71 (7.1) | |
| 5 | 16 (1.6) | 106 (10.6) | 79 (7.9) | 87 (8.7) | —— | 33 (3.3) | 519 (51.9) | 375 (37.5) | |
| 6 | 35 (3.5) | 30 (3) | 73 (7.3) | 93 (9.3) | 95 (9.5) | —— | 408 (40.8) | 187 (18.7) | |
| 7 | 9 (0.9) | 18 (1.8) | 74 (7.4) | 79 (7.9) | 168 (16.8) | 51 (5.1) | —— | 693 (69.3) | |
| 8 | 3 (0.3) | 2 (0.2) | 68 (6.8) | 51 (5.1) | 24 (2.4) | 8 (0.8) | 135 (13.5) | —— | |
Linkage that exists in the original simulated model.