| Literature DB >> 33054712 |
Ming Shi1,2, Sheng Tan3, Xin-Ping Xie4, Ao Li5, Wulin Yang6, Tao Zhu7, Hong-Qiang Wang8,9.
Abstract
BACKGROUND: Genes are regulated by various types of regulators and most of them are still unknown or unobserved. Current gene regulatory networks (GRNs) reverse engineering methods often neglect the unknown regulators and infer regulatory relationships in a local and sub-optimal manner.Entities:
Mesh:
Year: 2020 PMID: 33054712 PMCID: PMC7559338 DOI: 10.1186/s12864-020-07079-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1The global inference framework of GRNs (dlGRN). a Pipeline of dlGRN. b A global gene regulatory model based on ARs (GGRM)
Fig. 2Evaluation of the signal recovery power of dlGRN on Simulation data I. a Error barplots of average RRs and average PPVs (l = 50). b, c Changing curves of average RRs and average PPVs with l at SNR = 10 (b) and 15 (c). d, e Changing curves of average RRs(d) and average PPVs(e) over sample sizes with SNR
Performances (mean% ± std.% of AUROCs, mean% ± std.% of AUPRs) of different inference methods on Simulation data I (n = 20, SNR = 10, 20 and 30). Best results for each SNR case are in bold
| METHOD | SNR = 10 | SNR = 20 | SNR = 30 |
|---|---|---|---|
| GENIE3 | 81.13 ± 0.49, 53.83 ± 0.77 | 82.05 ± 0.51, 56.11 ± 0.61 | 81.89 ± 0.51, 56.04 ± 0.78 |
| CLR | 81.04 ± 0.29, 57.66 ± 0.42 | 81.95 ± 0.36, 59.80 ± 0.53 | 81.74 ± 0.44, 59.50 ± 0.69 |
| ARACNe-AP | 62.56 ± 1.45, 13.68 ± 5.37 | 64.17 ± 0.61, 18.63 ± 3.77 | 64.13 ± 1.32, 19.17 ± 7.03 |
| ARACNE | 81.84 ± 0.43, 55.20 ± 0.53 | 82.65 ± 0.41, 57.18 ± 0.73 | 82.55 ± 0.62, 56.37 ± 1.10 |
| dlGRN ( | 88.21 ± 0.51, 65.49 ± 1.10 | 90.05 ± 0.67, 70.10 ± 1.49 | 90.62 ± 0.41, 71.05 ± 0.89 |
| dlGRN ( | 96.10 ± 0.42, | 97.45 ± 0.29, | |
| dlGRN ( | 90.48 ± 0.45, 75.16 ± 1.06 | ||
| dlGRN ( | 90.40 ± 0.45, 75.86 ± 0.98 | 95.96 ± 0.32, 87.73 ± 0.41 | 97.54 ± 0.46, 91.44 ± 1.13 |
Results (AUROCs%, AUPRs%) of different methods on simulation data II, two real-world model organism data sets and three lung cancer data sets GSE32863, GSE10072, GSE7670. Best values for each data set are in bold
| Data sets | GENIE3 | CLR | ARACNe-AP | ARACNE | dlGRN |
|---|---|---|---|---|---|
| Simulation data II | 81.50, | 74.34, 22.63 | 68.19, 15.59 | 75.72, 19.12 | |
| 58.72, 1.12 | 56.55, 0.61 | 61.66, 0.80 | 68.77, 1.69 | ||
| 52.94, 0.31 | 52.43, 0.22 | 51.64, 0.02 | 53.01, 0.22 | ||
| GSE32863 | 52.67, 5.81 | 51.77, 5.52 | 51.09, 5.60 | 52.36, 5.73 | |
| GSE10072 | 51.84, 5.54 | 51.51, 5.43 | 51.17, 5.42 | 51.28, 5.40 | |
| GSE7670 | 53.17, 5.90 | 51.74, 5.58 | 51.64, 5.62 | 51.04, 5.58 |
Fig. 3True positives comparison on three lung cancer datasets. Numbers of true positives (TPs) in most highly scored num = 10, 50, 100, 150 and 200 regulations by dlGRNs and previous methods (GENIE3, CLR, ARACNe-AP and ARCNE) on three lung cancer data sets GSE32863 (a), GSE10072 (b), GSE7670 (c) and across these data sets (d)
Fig. 4Topological analysis of the reconstructed 2677-link GRNs by dlGRN and the four previous methods. a-c Topology of GRNs inferred by dlGRN on data sets, GSE32863 (a), GSE10072 (b) and GSE7670 (c). Node sizes are proportional to the connectivity. γ : slope of the fitted power-law curve; AN: Average number of correctly called TFs per target gene; ACC: average clustering coefficient. d-f Distributions of the log-transformed degrees of nodes in the GRNs on data sets, GSE32863 (d), GSE10072 (e) and GSE7670 (f). g-i Counts of each CRS pattern (P1-P5) predicted on data sets GSE32863 (g), GSE10072 (h) and GSE7670 (i)
Fig. 5Results of TF regulation inference for target gene ID2 on the three lung cancer datasets (GSE32863, GSE10072 and GSE7670) by different methods. a-e Venn diagrams of TFs between the background network and the three inferred GRNs by dlGRN (a) and four previous methods, GENIE3 (b), CLR (c), ARACNe-AP (d) and ARACNE (e). f-h Known and called TFs of ID2 by dlGRN on data sets, GSE32863 (f), GSE10072 (g) and GSE7670 (h)
Fig. 6Experimental verification of TFAP2C regulating EGFR in A549 cancer cells. a Relative expression levels of EGFR with or without knockdown of TFAP2C. “**” mean p-values< 0.01. b Potential transcriptional regulation mechanism of TFAP2C on EGFR. c Comparison of expression of genes along onco-signalling pathway activated by EGFR between tumor and normal tissues in the three data sets
Fig. 7DNA methylation regulation frequently occurs in cellular activity. a Comparison of probabilistic density between permutated and observed methylation regulation scores. b, c Boxplots of expression and DNA methylation of gene RAB25 in LUAD and adjacent normal tissues on data sets GSE32861 and GSE32863