| Literature DB >> 24728051 |
Rosa Aghdam1, Mojtaba Ganjali1, Changiz Eslahchi2.
Abstract
Inferring gene regulatory networks (GRNs) is a major issue in systems biology, which explicitly characterizes regulatory processes in the cell. The Path Consistency Algorithm based on Conditional Mutual Information (PCA-CMI) is a well-known method in this field. In this study, we introduce a new algorithm (IPCA-CMI) and apply it to a number of gene expression data sets in order to evaluate the accuracy of the algorithm to infer GRNs. The IPCA-CMI can be categorized as a hybrid method, using the PCA-CMI and Hill-Climbing algorithm (based on MIT score). The conditional dependence between variables is determined by the conditional mutual information test which can take into account both linear and nonlinear genes relations. IPCA-CMI uses a score and search method and defines a selected set of variables which is adjacent to one of X or Y. This set is used to determine the dependency between X and Y. This method is compared with the method of evaluating dependency by PCA-CMI in which the set of variables adjacent to both X and Y, is selected. The merits of the IPCA-CMI are evaluated by applying this algorithm to the DREAM3 Challenge data sets with n variables and n samples (n = 10, 50, 100) and to experimental data from Escherichia coil containing 9 variables and 9 samples. Results indicate that applying the IPCA-CMI improves the precision of learning the structure of the GRNs in comparison with that of the PCA-CMI.Entities:
Mesh:
Year: 2014 PMID: 24728051 PMCID: PMC3984085 DOI: 10.1371/journal.pone.0092600
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The PC Algorithm based on CMI test (PCA-CMI) [5].
| 1: | Start with a complete undirected graph |
| 2: |
|
| 3: | Repeat |
| 4: | For each |
| 5: | For each |
| 6: | Determine if there is |
| 7: | If this set exists |
| 8: | Remove the edge between |
| 9: |
|
| 10: | Until |
Zero order of the Improvement of PC Algorithm based on CMI test.
| 1: | Start with a complete undirected graph |
| 2: | Repeat |
| 3: | For each |
| 4: | For each |
| 5: | If |
| 6: | Remove the edge between |
| 7: | The MIT score was utilized in the HC algorithm to construct |
i order () of the Improvement of PC Algorithm based on CMI test.
| 1: | Start with |
| 2: |
|
| 3: | Repeat |
| 4: | For each |
| 5: | For each |
| 6: | Test whether |
| 7: | If this set exists |
| 8: | Remove the edge between |
| 9: | The MIT score was utilized in the HC algorithm to direct the structure. |
| 10: | For each |
| 11: | The weight value for variable |
| 12: | A selected set |
| 13: | i = i+1 |
| 14: | Until |
The result of gene expression data set DREAM3 Challenge with 10 genes and sample number 10.
| Algorithm | TP | FP | ACC | FPR | FDR | PPV | F | MCC | TPR |
| PCA1 | 7 | 1 | 0.91 | 0.03 | 0.13 | 0.87 | 0.78 | 0.73 | 0.7 |
| IPCA1 |
|
|
|
|
|
|
|
|
|
Result of DREAM3 in size of 10 with first-order CMI test with threshold 0.05. The second row of the table indicates the result of first-order PCA-CMI (PCA1) the third row of the table shows the result of first-order IPCA-CMI (IPCA1).
Figure 1Comparing the result of the PCA-CMI and the IPCA-CMI for inferring the structure of DREAM3 contains 10 variables and 10 edges.
(A) The true network with 10 variables and 10 edges. (B) Firs-order network inferred by the PCA-CMI. The edge with red line G2–G4 is false positives, while the edges G1–G2, G3–G5 and G4–G9 are false negative. (C) First-order network obtained by the IPCA-CMI. The false positive edge G2–G4 in (B) is successfully removed by the IPCA-CMI, in addition edges G1–G2 and G3–G5 are successfully found by this algorithm.
The result of Simulated and Real data sets in order 0.
| Network | TP | FP | ACC | FPR | FDR | PPV | F | MCC | TPR |
| DREAM10 | 9 | 1 | 0.95 | 0.02 | 0.10 | 0.9 | 0.90 | 0.87 | 0.90 |
| DREAM50 | 36 | 54 | 0.92 | 0.05 | 0.6 | 0.4 | 0.43 | 0.39 | 0.46 |
| DREAM100 | 70 | 58 | 0.96 | 0.01 | 0.45 | 0.55 | 0.47 | 0.46 | 0.42 |
| SOS | 18 | 4 | 0.72 | 0.33 | 0.18 | 0.82 | 0.78 | 0.40 | 0.75 |
The second row of the table shows the result of DREAM3 in size of 10 with threshold 0.05. The third row denotes the result of DREAM3 in size of 50 with threshold 0.1. The forth row of the table indicates the result of DREAM3 in size of 100 with threshold 0.1. Finally the last row shows the result of SOS DNA repair network with threshold 0.01.
The result of gene expression data set DREAM3 Challenge with 50 genes and sample number 50.
| Algorithm | TP | FP | ACC | FPR | FDR | PPV | F | MCC | TPR |
| PCA1 | 24 | 23 | 0.93 | 0.02 | 0.49 | 0.51 | 0.39 | 0.37 | 0.31 |
| PCA2 | 22 | 21 | 0.93 | 0.02 | 0.49 | 0.51 | 0.37 | 0.35 | 0.29 |
| IPCA1 |
| 26.5 |
| 0.02 |
| 0.51 |
|
|
|
| IPCA2 |
|
|
|
|
|
|
|
|
|
Result of DREAM3 in size of 50 with different CMI orders with threshold 0.1. The second and third rows of the table indicate the result of first-order PCA-CMI (PCA1) and second-order PCA-CMI (PCA2), respectively. The forth and fifth rows of the table show the result of IPCA-CMI of first-order(IPCA1) and second-order(IPCA2), respectively.
The result of gene expression data set DREAM3 Challenge with 100 genes and sample number 100.
| Algorithm | TP | FP | ACC | FPR | FDR | PPV | F | MCC | TPR |
| PCA1 | 49 | 25 | 0.971 | 0.005 | 0.34 | 0.66 | 0.41 | 0.43 | 0.28 |
| PCA2 | 46 | 25 | 0.971 | 0.005 | 0.35 | 0.64 | 0.38 | 0.41 | 0.27 |
| IPCA1 |
| 29.77 |
| 0.006 | 0.35 | 0.65 |
|
|
|
| IPCA2 |
|
|
|
|
|
|
|
|
|
Result of DREAM3 in size of 100 with different CMI orders with threshold 0.1. The second and third rows of the table indicate the result of first-order PCA-CMI (PCA1) and second-order PCA-CMI (PCA2), respectively. The forth and fifth rows of the table show the result of IPCA-CMI of first-order (IPCA1) and second-order (IPCA2), respectively.
The result of experimental data from Escherichia coil containing 9 genes and sample number 9.
| Algorithm | TP | FP | ACC | FPR | FDR | PPV | F | MCC | TPR |
| PCA1 | 18 | 4 | 0.72 | 0.33 | 0.18 | 0.82 | 0.78 | 0.40 | 0.75 |
| IPCA1 |
|
|
|
|
| 0.82 |
|
| 0.75 |
The result of SOS DNA repair network in size of 9 with 24 edges. Results are related to the order 1 of CMI with threshold 0.01. The second row of the table indicates the result of first-order PCA-CMI (PCA1). The third row of the table show the result of IPCA-CMI of first-order (IPCA1).
The probability of occurrence of GRNs.
| Algorithm | DREAM10 | DREAM50 | DREAM100 | SOS |
| PCA | 1.948475e-05 | 6.211307e-17 | 9.751598e-53 | 0.01755 |
| IPCA |
|
|
|
|
A determination of the probability of selecting a subgraph using the PCA-CMI and the IPCA-CMI. The second row of the table indicates the result of last order PCA-CMI (PCA). The third row of the table show the result of last order IPCA-CMI (IPCA).