| Literature DB >> 20126643 |
Kevin Y Yip1, Roger P Alexander, Koon-Kiu Yan, Mark Gerstein.
Abstract
We performed computational reconstruction of the in silico gene regulatory networks in the DREAM3 Challenges. Our task was to learn the networks from two types of data, namely gene expression profiles in deletion strains (the 'deletion data') and time series trajectories of gene expression after some initial perturbation (the 'perturbation data'). In the course of developing the prediction method, we observed that the two types of data contained different and complementary information about the underlying network. In particular, deletion data allow for the detection of direct regulatory activities with strong responses upon the deletion of the regulator while perturbation data provide richer information for the identification of weaker and more complex types of regulation. We applied different techniques to learn the regulation from the two types of data. For deletion data, we learned a noise model to distinguish real signals from random fluctuations using an iterative method. For perturbation data, we used differential equations to model the change of expression levels of a gene along the trajectories due to the regulation of other genes. We tried different models, and combined their predictions. The final predictions were obtained by merging the results from the two types of data. A comparison with the actual regulatory networks suggests that our approach is effective for networks with a range of different sizes. The success of the approach demonstrates the importance of integrating heterogeneous data in network reconstruction.Entities:
Mesh:
Year: 2010 PMID: 20126643 PMCID: PMC2811182 DOI: 10.1371/journal.pone.0008121
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
AUROC of our predictions.
| Ecoli1 | Ecoli2 | Yeast1 | Yeast2 | Yeast3 | |
| Size 10 | 0.928 | 0.912 | 0.949 | 0.747 | 0.714 |
| Size 50 | 0.930 | 0.924 | 0.917 | 0.792 | 0.805 |
| Size 100 | 0.948 | 0.960 | 0.915 | 0.856 | 0.783 |
pAUROC of our predictions.
| Ecoli1 | Ecoli2 | Yeast1 | Yeast2 | Yeast3 | Overall AUROC | |
| Size 10 | 9.771e-07 | 2.629e-07 | 9.941e-07 | 2.931e-04 | 1.046e-03 | 9.523e-06 |
| Size 50 | 2.396e-27 | 4.328e-31 | 1.477e-25 | 1.808e-21 | 1.386e-29 | 5.210e-27 |
| Size 100 | 1.226e-52 | 5.876e-42 | 4.087e-70 | 5.755e-99 | 1.722e-92 | 3.112e-71 |
Figure 1The Yeast1-size10 network.
(a) The actual network. (b) Our top-10 predictions.
Prediction accuracy per batch on the size 10 networks.
| Ecoli1 | Ecoli2 | Yeast1 | Yeast2 | Yeast3 | ||||||
| Batch | Predicted | Correct | Predicted | Correct | Predicted | Correct | Predicted | Correct | Predicted | Correct |
| 1 | 11 | 7 | 16 | 12 | 11 | 9 | 13 | 9 | 12 | 8 |
| 2 | 6 | 1 | 4 | 0 | 5 | 0 | 5 | 1 | 5 | 4 |
| 3 | 0 | 0 | 1 | 1 | 3 | 0 | 1 | 0 | 1 | 0 |
| 4 | 5 | 1 | 8 | 0 | 7 | 0 | 4 | 2 | 4 | 0 |
| 5 | 4 | 0 | 8 | 1 | 6 | 0 | 10 | 3 | 5 | 1 |
| 6 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 | 63 | 1 | 53 | 1 | 58 | 1 | 57 | 10 | 63 | 9 |
| Total | 90 | 11 | 90 | 15 | 90 | 10 | 90 | 25 | 90 | 22 |
Prediction accuracy per batch on the size 50 networks.
| Ecoli1 | Ecoli2 | Yeast1 | Yeast2 | Yeast3 | ||||||
| Batch | Predicted | Correct | Predicted | Correct | Predicted | Correct | Predicted | Correct | Predicted | Correct |
| 1 | 96 | 52 | 133 | 69 | 145 | 57 | 176 | 83 | 201 | 100 |
| 2 | 76 | 2 | 85 | 1 | 80 | 8 | 87 | 12 | 102 | 16 |
| 3 | 77 | 0 | 78 | 1 | 69 | 1 | 56 | 1 | 64 | 2 |
| 4 | 196 | 0 | 153 | 1 | 185 | 1 | 156 | 5 | 113 | 3 |
| 5 | 178 | 1 | 169 | 1 | 167 | 2 | 177 | 6 | 149 | 2 |
| 6 | 5 | 0 | 16 | 0 | 9 | 0 | 11 | 0 | 6 | 0 |
| 7 | 1822 | 7 | 1816 | 9 | 1795 | 8 | 1787 | 53 | 1815 | 50 |
| Total | 2450 | 62 | 2450 | 82 | 2450 | 77 | 2450 | 160 | 2450 | 173 |
Prediction accuracy per batch on the size 100 networks.
| Ecoli1 | Ecoli2 | Yeast1 | Yeast2 | Yeast3 | ||||||
| Batch | Predicted | Correct | Predicted | Correct | Predicted | Correct | Predicted | Correct | Predicted | Correct |
| 1 | 410 | 101 | 377 | 108 | 483 | 118 | 656 | 257 | 710 | 302 |
| 2 | 387 | 11 | 319 | 1 | 317 | 20 | 282 | 22 | 311 | 31 |
| 3 | 162 | 0 | 198 | 0 | 129 | 0 | 145 | 3 | 135 | 3 |
| 4 | 650 | 0 | 685 | 1 | 575 | 2 | 604 | 12 | 638 | 13 |
| 5 | 683 | 1 | 656 | 2 | 746 | 3 | 739 | 10 | 667 | 24 |
| 6 | 53 | 0 | 72 | 0 | 82 | 2 | 67 | 0 | 59 | 2 |
| 7 | 7555 | 12 | 7593 | 7 | 7568 | 21 | 7407 | 85 | 7380 | 176 |
| Total | 9900 | 125 | 9900 | 119 | 9900 | 166 | 9900 | 389 | 9900 | 551 |
Prediction of the first two batches on the size 10 networks when their orders are swapped.
| Ecoli1 | Ecoli2 | Yeast1 | Yeast2 | Yeast3 | ||||||
| Batch | Predicted | Correct | Predicted | Correct | Predicted | Correct | Predicted | Correct | Predicted | Correct |
| 1 | 6 | 1 | 5 | 1 | 5 | 0 | 5 | 1 | 5 | 4 |
| 2 | 11 | 7 | 15 | 12 | 11 | 9 | 13 | 9 | 12 | 8 |
Probability of having at least the observed number of correctly predicted regulation events in batches 2–6 by chance, given the total number of predictions in these batches.
| Ecoli1 | Ecoli2 | Yeast1 | Yeast2 | Yeast3 | |
| Size-10 | 0.0247 | 0.1922 | 1 | 0.1925 | 0.0923 |
| Size-50 | 0.4015 | 0.3036 | 0.0003 | 0.0273 | 0.0078 |
| Size-100 | 0.0012 | 0.1670 | 0.0000 | 0.0000 | 0.0001 |
Figure 2Two regulation events that were missed by the noise models but detected by the differential equation models.
(a) The actual Ecoli1-size10 netowrk. (b) The homozygous deletion profile of G7 in the Ecoli1-size10 network. (c) A perturbation time series of G7 in the Ecoli1-size10 network. (d) The actual Ecoli2-size10 network. (e) A perturbation time series of G6 in the Ecoli2-size10 network. (f) A perturbation time series of G6 in the Ecoli2-size10 network.