| Literature DB >> 36237217 |
Guangyi Chen1, Zhi-Ping Liu1,2.
Abstract
Gene regulatory network (GRN) provides abundant information on gene interactions, which contributes to demonstrating pathology, predicting clinical outcomes, and identifying drug targets. Existing high-throughput experiments provide rich time-series gene expression data to reconstruct the GRN to further gain insights into the mechanism of organisms responding to external stimuli. Numerous machine-learning methods have been proposed to infer gene regulatory networks. Nevertheless, machine learning, especially deep learning, is generally a "black box," which lacks interpretability. The causality has not been well recognized in GRN inference procedures. In this article, we introduce grey theory integrated with the adaptive sliding window technique to flexibly capture instant gene-gene interactions in the uncertain regulatory system. Then, we incorporate generalized multivariate Granger causality regression methods to transform the dynamic grey association into causation to generate directional regulatory links. We evaluate our model on the DREAM4 in silico benchmark dataset and real-world hepatocellular carcinoma (HCC) time-series data. We achieved competitive results on the DREAM4 compared with other state-of-the-art algorithms and gained meaningful GRN structure on HCC data respectively.Entities:
Keywords: adaptive sliding window; causation; dynamic grey association; gene regulatory network inference; machine learning
Year: 2022 PMID: 36237217 PMCID: PMC9551017 DOI: 10.3389/fbioe.2022.954610
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
FIGURE 1The overview of the GreyNet framework. (A) is the expression matrix of genes. (B,C) is the procedure of dynamic grey association. The window length is automatically adjusted by information entropy (IE). We firstly sample the time points by the sliding window. Then, we input the sampled data into grey relational analysis to get the dynamic grey association coefficient as (D). (E) is the weight matrix generated by regression methods that transform the dynamic grey association to causal directional regulatory link as (F).
The dscription of the datasets used in the experiments.
| Network | #TF | #Gene | #Timepoints | #Samples |
|---|---|---|---|---|
| DREAM4 | 10 | 10 | 21 | 5 |
| DREAM4 | 100 | 100 | 21 | 10 |
| HCC | 21 | 258 | 10 | 105 |
The development stages of HCC.
| Development | Notation | #Samples |
|---|---|---|
| Normal | N | 13 |
| Choronic Hepatitis with low grade | FL | 8 |
| Choronic Hepatitis with high grade | FH | 12 |
| Cirrhosis | CS | 12 |
| Dysplastic nodules with low garde | DL | 11 |
| Dysplastic nodules with high garde | DH | 11 |
| Early hepatocellular carcinoma | eHCC | 5 |
| Hepatecellular carcinoma (TG1) | TG1 | 9 |
| Hepatecellular carcinoma (TG2) | TG2 | 12 |
| Hepatecellular carcinoma (TG3) | TG3 | 12 |
FIGURE 2The comparison of RF, GreyNet-RF, Xgboost, and GreyNet-Xgboost on DREAM4 insilico datasets. (A) is the results of AUROC on DREAM4 size10 networks. (B) is the results of AUPRC on DREAM4 size10 networks. (C) is the results of AUROC on DREAM4 size100 networks. (D) is the results of AUPRC on DREAM4 size100 networks.
FIGURE 3The comparison of LASSO, GreyNet-LASSO, Ridge, and GreyNet-Ridge on DREAM4 in in silico datasets. (A) is the results of AUROC on size10 networks. (B) is the results of AUPRC on size10 networks. (C) is the results of AUROC on size100 networks. (D) is the results of AUPRC on size100 networks.
The comparative results of models on DREAM4 data.
| DREAM4 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Method | Network1 | Network2 | Network3 | Network4 | Network5 | |||||
| AUROC | AUPRC | AUROC | AUPRC | AUROC | AUPRC | AUROC | AUPRC | AUROC | AUPRC | |
| GreyNet | 0.839 | 0.479 |
|
|
|
|
|
| 0.917 | 0.812 |
| BTNET(GB) | 0.834 | 0.516 | 0.698 | 0.362 | 0.682 | 0.473 | 0.822 | 0.560 |
| 0.774 |
| BTNET(AB) |
| 0.552 | 0.719 | 0.370 | 0.719 | 0.465 | 0.791 | 0.506 | 0.903 | 0.701 |
| SWING-RF | 0.832 | 0.508 | 0.779 | 0.525 | 0.815 | 0.546 | 0.728 | 0.441 | 0.925 | 0.753 |
| SWING-Dionesus | 0.743 | 0.469 | 0.786 | 0.484 | 0.789 | 0.421 | 0.772 | 0.540 | 0.807 | 0.625 |
| BiXGBoost | 0.816 |
| 0.784 | 0.422 | 0.771 | 0.376 | 0.787 | 0.533 | 0.888 | 0.741 |
| GENIE3-lag | 0.834 | 0.476 | 0.741 | 0.391 | 0.750 | 0.478 | 0.797 | 0.520 | 0.869 | 0.734 |
| Jump3 | 0.700 | 0.442 | 0.698 | 0.308 | 0.717 | 0.401 | 0.784 | 0.486 | 0.841 | 0.619 |
| TIGRESS | 0.758 | 0.375 | 0.602 | 0.222 | 0.618 | 0.200 | 0.764 | 0.324 | 0.804 | 0.411 |
| DREAM4 | ||||||||||
| GreyNet |
|
|
|
|
|
| 0.731 |
|
|
|
| BTNET(GB) | 0.776 | 0.186 | 0.694 | 0.113 | 0.759 | 0.235 | 0.723 | 0.143 | 0.758 | 0.165 |
| BTNET(AB) | 0.776 | 0.207 | 0.699 | 0.116 | 0.770 | 0.224 | 0.740 | 0.158 | 0.780 | 0.169 |
| SWING-RF | 0.793 | 0.192 | 0.723 | 0.116 | 0.759 | 0.214 | 0.742 | 0.193 | 0.775 | 0.160 |
| SWING-Dionesus | 0.772 | 0.124 | 0.700 | 0.095 | 0.709 | 0.194 | 0.727 | 0.187 | 0.771 | 0.143 |
| BiXGBoost | 0.744 | 0.138 | 0.682 | 0.075 | 0.716 | 0.119 | 0.702 | 0.106 | 0.728 | 0.090 |
| GENIE3-lag | 0.790 | 0.167 | 0.711 | 0.103 | 0.767 | 0.215 |
| 0.152 | 0.786 | 0.146 |
| Jump3 | 0.724 | 0.099 | 0.623 | 0.057 | 0.696 | 0.077 | 0.662 | 0.072 | 0.696 | 0.074 |
| TIGRESS | 0.715 | 0.054 | 0.532 | 0.037 | 0.483 | 0.018 | 0.467 | 0.018 | 0.521 | 0.022 |
The highest AUROC and AUPR are shown in bold for each network.
FIGURE 4The HCC GRN reconstructed by GreyNet. In the network, the larger blue hexagon nodes represent TFs. The circle orange nodes represent the target genes. The diamond green nodes represent some elite disease genes in HCC.
The enrichment of GO biological process and KEGG pathway in HCC GRN.
| GO:Term | Term Name | Representative Gene | Corr. |
|---|---|---|---|
| GO:0016055 | Wnt receptor signaling pathway | DVL1; WNT7B; DVL3; WNT8A; CCND1; WNT11; TCF7L2; FZD3; TCF7; LRP5; WNT9B; FZD1; WNT3; TCF7L1; FZD6; APC2; GSK3B; FRAT1; WNT4; FZD2; FZD4; APC; WNT5B; WNT7A; AXIN2; DVL2; CTNNB1; WNT1; CSNK1A1; FZD9; WNT10A; LEF1; FZD8; FRAT2; FZD5; WNT10B; AXIN1; WNT16; WNT5A | 1.1E-54 |
| GO:0008283 | Cell proliferation | WNT7B; CCND1; BAD; FZD3; SMAD4; FZD6; GSK3B; WNT4; BAK1; BCL2L1; CTNNB1; TGFB1; WNT1; FZD9; BAX; TERC; MAP2K1; HGF; WNT10B; MET; WNT5A | 7.0E-17 |
| GO:0007169 | Transmembrane receptor protein tyrosine kinase signaling pathway | AKT1; GRB2; IGF1R; PIK3R1; SOS1; SMARCC1; EGFR; PIK3R3; PTEN; FGFR4; GAB1; RAF1; HGF; MET | 5.3E-10 |
| GO:0051252 | Regulation of RNA metabolic process | ARID2; HNF1A; AKT2; TCF7L2; TCF7; SMAD3; ELK1; TCF7L1; SMAD4; SMARCC1; GSK3B; E2F3; WNT4; TGFB3; RB1; CDKN2A; WNT7A; CTNNB1; TGFB1; WNT1; E2F2; EPO; LEF1; MAP2K1; SMAD2; WNT10B; AXIN1; MET | 7.6E-9 |
| GO:0043408 | Regulation of MAPKKK cascade | AKT1; WNT7B; GRB2; AKT2; IGF1R; BRAF; APC; WNT7A; CTNNB1; GAB1; AXIN1; WNT5A | 5.7E-9 |
| GO:0006357 | Regulation of transcription from RNA polymerase II promoter | ARID2; HNF1A; AKT2; TCF7L2; SMAD3; TCF7L1; SMAD4; SMARCC1; TGFB3; RB1; CTNNB1; TGFB1; WNT1; EPO; LEF1; MAP2K1; SMAD2; WNT10B; AXIN1; MET | 1.7E-7 |
| GO:0007265 | Ras protein signal transduction | SHC3; GRB2; SOS1; CDKN1A; RB1; TP53; NRAS; MAPK3; RAF1; MAP2K1; CDKN2A; MAP2K2; KRAS; SHC2 | 3.3E-7 |
| GO:0043405 | Regulation of MAP kinase activity | WNT7B; GSK3B; TGFB3; GAB1; MAP2K1; PRKCA; HGF; AXIN1; MET; WNT5A | 1.0E-7 |
| GO:0006338 | Chromatin remodeling | HNF1A; RB1; ACTL6A; ARID1A; SMARCC1; SMARCD1; ARID1B; SMARCC2; SMARCA2; SMARCB1 | 5.6E-6 |
| GO:0046328 | Regulation of JNK cascade | AKT1; WNT7B; AKT2; WNT7A; GAB1; AXIN1; WNT5A | 2.3E-5 |
| GO:0007179 | Transforming growth factor beta receptor signaling pathway | SMAD4; SMAD3; TGFB1; TGFBR1; TGFB3; TGFBR2; TGFB2; SMAD2 | 9.1E-4 |
| GO:0006139 | Nucleobase, nucleoside, nucleotide and nucleic acid metabolic process | DPF1; HNF1A; NRAS; TCF7L2; DDB2; TCF7; SMARCD3; SMAD3; ELK1; TCF7L1; SMAD4; ABCC3; SMARCC1; KEAP1; E2F3; ACTL6A; ACTL6B; ARID1A; RB1; CDKN2A; CTNNB1; E2F2; TERT; PRKCB; POLK; UGP2; LEF1; TERC; SMAD2; SMARCC2; SMARCB1; AXIN1; DPF3 | 8.2E-4 |
| GO:0007264 | Small GTPase mediated signal transduction | SHC3; GRB2; SOS1; CDKN1A; RB1; TP53; BRAF; NRAS; MAPK3; RAF1; MAP2K1; SOS2; CDKN2A; MAP2K2; HMOX1; KRAS; SHC2 | 4.6E-4 |
| GO:0043491 | Protein kinase B signaling cascade | AKT1; RPS6KB1; AKT2; RPS6KB2 | 3.2E-4 |
| GO:0001889 | Liver development | CCND1; HGF; SMAD3; AFP; CTNNB1; HNF1A; ALDH2; ALDOB; ASS1 | 2.0E-4 |