| Literature DB >> 29113310 |
Bin Yu1,2,3, Jia-Meng Xu1,3, Shan Li1,3, Cheng Chen1,3, Rui-Xin Chen1,3, Lei Wang4, Yan Zhang3,5, Ming-Hui Wang1,3.
Abstract
Gene regulatory networks (GRNs) research reveals complex life phenomena from the perspective of gene interaction, which is an important research field in systems biology. Traditional Bayesian networks have a high computational complexity, and the network structure scoring model has a single feature. Information-based approaches cannot identify the direction of regulation. In order to make up for the shortcomings of the above methods, this paper presents a novel hybrid learning method (DBNCS) based on dynamic Bayesian network (DBN) to construct the multiple time-delayed GRNs for the first time, combining the comprehensive score (CS) with the DBN model. DBNCS algorithm first uses CMI2NI (conditional mutual inclusive information-based network inference) algorithm for network structure profiles learning, namely the construction of search space. Then the redundant regulations are removed by using the recursive optimization algorithm (RO), thereby reduce the false positive rate. Secondly, the network structure profiles are decomposed into a set of cliques without loss, which can significantly reduce the computational complexity. Finally, DBN model is used to identify the direction of gene regulation within the cliques and search for the optimal network structure. The performance of DBNCS algorithm is evaluated by the benchmark GRN datasets from DREAM challenge as well as the SOS DNA repair network in Escherichia coli, and compared with other state-of-the-art methods. The experimental results show the rationality of the algorithm design and the outstanding performance of the GRNs.Entities:
Keywords: comprehensive score model; dynamic Bayesian network; gene regulatory networks; multiple time-delayed; network structure profiles
Year: 2017 PMID: 29113310 PMCID: PMC5655205 DOI: 10.18632/oncotarget.21268
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Comparison of DREAM3 10 nodes real GRNs and inferred GRNs
DBNCS algorithm is used to infer the time-delayed of DREAM3 10-node GRNs
| Regulation relation | Time unit | Regulation relation | Time unit |
|---|---|---|---|
| G7→G8 | 2 | G1→G5 | 4 |
| G10→G7 | 4 | G1→G2 | 1 |
| G5→G7 | 4 | G1→G4 | 4 |
| G3→G5 | 1 | G6→G4 | 1 |
| G3→G1 | 2 |
Comparison of the different methods’ performances on the DREAM3 10 gene dataset
| Method | TPR | FPR | PPV | ACC | MCC | AUC |
|---|---|---|---|---|---|---|
| GENIE3 | 0.700 | 0.112 | 0.437 | 0.867 | 0.483 | 0.919 |
| ARACNE | 0.900 | 0.112 | 0.500 | 0.888 | 0.618 | 0.930 |
| NARROMI | 0.700 | 0.050 | 0.636 | 0.922 | 0.623 | 0.938 |
| DBNCS | 0.800 |
Figure 2Comparison of the results of constructing DREAM3 10-node GRNs with DBNCS algorithm
Figure 3Comparison of DREAM3 50 nodes real GRNs and inferred GRNs
DBNCS algorithm is used to infer the time-delayed of DREAM3 50-node GRNs
| Regulation relation | Time unit | Regulation relation | Time unit | Regulation relation | Time unit |
|---|---|---|---|---|---|
| G2→G1 | 3 | G25→G32 | 2 | G38→G41 | 1 |
| G2→G3 | 3 | G25→G35 | 3 | G38→G44 | 1 |
| G4→G7 | 5 | G25→G27 | 3 | G38→G45 | 5 |
| G4→G8 | 2 | G26→G22 | 1 | G38→G48 | 2 |
| G11→G9 | 1 | G26→G30 | 4 | G38→G50 | 2 |
| G13→G15 | 1 | G26→G46 | 3 | G39→G36 | 5 |
| G18→G28 | 4 | G26→G47 | 1 | G39→G49 | 2 |
| G21→G23 | 5 | G28→G20 | 1 | G41→G44 | 1 |
| G22→G18 | 3 | G31→G34 | 5 | ||
| G24→G40 | 1 | G33→G37 | 3 | ||
| G25→G31 | 3 | G34→G23 | 3 |
Comparison of the different methods’ performances on the DREAM3 50 gene dataset
| Method | TPR | FPR | PPV | ACC | MCC | AUC |
|---|---|---|---|---|---|---|
| GENIE3 | 0.481 | 0.078 | 0.167 | 0.908 | 0.245 | 0.843 |
| ARACNE | 0.597 | 0.082 | 0.192 | 0.908 | 0.303 | 0.832 |
| NARROMI | 0.532 | 0.062 | 0.217 | 0.925 | 0.307 | 0.839 |
| DBNCS | 0.416 |
Figure 4Comparison of the results of constructing DREAM3 50-node GRNs with DBNCS algorithm
Figure 5SOS DNA repair system of Escherichia coli
Comparison of the different methods’ performances on the SOS DNA repair network
| Method | TPR | FPR | PPV | ACC | MCC | AUC |
|---|---|---|---|---|---|---|
| GENIE3 | 0.500 | 0.208 | 0.546 | 0.694 | 0.299 | 0.684 |
| ARACNE | 0.708 | 0.625 | 0.362 | 0.486 | 0.083 | 0.739 |
| NARROMI | 0.667 | 0.458 | 0.421 | 0.583 | 0.197 | 0.791 |
| Grow-shring | 0.458 | 0.271 | 0.458 | 0.639 | 0.188 | 0.758 |
| IAMB | 0.583 | 0.229 | 0.560 | 0.708 | 0.351 | 0.809 |
| DBNCS | 0.667 | 0.229 |
Figure 6Comparison of Escherichia coli SOS DNA repair network and inferred gene regulatory network
Figure 7Comparison of the results of constructing SOS DNA repair network in E. coli 9-node GRNs with DBNCS algorithm
Figure 8The diagram of DBNCS method
(1) Using the CMI2NI algorithm, and through the gene microarray data to construct from the five genes and the interaction between the composition of the network structure profiles. (2) Redundant regulations are removed by RO algorithm: 2–3, 1–4. (3) The network structure profiles after the redundancy are decomposed into a series of cliques consisting of two co-expressing genes without loss: 2–1, 5–4, 1–3, 4–2, 3–5, the number of cliques is the same as the regulation relationship. (4) For each clique structure, the regulatory gene, target gene and the transcriptional delay are determined by the DBN based on the CS model to generate a series of dynamic sub-networks: 2→1,5→4,1→3,4→2,3→5. (5) Integration of a series of sub-networks, and get the final multiple time-delayed GRNs. The solid lines in Figure represent the true regulation between genes, and the dotted lines represent the redundancy correlation between the two genes.