| Literature DB >> 33172489 |
Anyou Wang1, Rong Hai2,3.
Abstract
OBJECTIVES: Numerous software has been developed to infer the gene regulatory network, a long-standing key topic in biology and computational biology. Yet the slowness and inaccuracy inherited in current software hamper their applications to the increasing massive data. Here, we develop a software, FINET (Fast Inferring NETwork), to infer a network with high accuracy and rapidity from big data.Entities:
Keywords: Accuracy; Elastic-net; FINET; Inference; Julia; LASSO; Network; Stability selection
Mesh:
Year: 2020 PMID: 33172489 PMCID: PMC7653809 DOI: 10.1186/s13104-020-05371-0
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Test dataset features
| Source | Dream5 network1 |
| Data type | In silico |
| Structure of interactions | RegulonDB |
| Size (observation*genes) | 805*1643 |
| Total interactions | 278,392 |
| True interactions | 4012 |
| False interactions | 274,380 |
Fig. 1FINET parameter optimization and performance. a Frequency cutoff optimization. Frequency cutoff from 0.1 to 1.0 vs AUC, precision and normalized true positive calling (true positive callings at each cutoff/max(true positive callings at each cutoff)). This data resulted from FINET running on network1 at dream5 with following settings, m = 4, n = 500, alpha = 0.5 (see github software website for details). b Comparisons of precision of resampling m subgroups (frequency cutoff > 0.95). c, d The overall performance of FINET when m = 8 (c) and 12 (d). e Performance comparison between FINET, ARACNe-AP and C3NET. X-axis lab fo ARACNe-AP and C3NET represent p-value and alpha value, respectively, designed for significant threshold in ARACNe-AP and C3NET, while m in FINET as the number of sub-groups as shown above
Comparison of FINET and C3NET
| Metrics | FINET | ARACNe-AP | C3NET |
|---|---|---|---|
| Precision (%) | 94.2 | 0.81 | 72.3 |
| Parameter sensitivity | Very | Moderate | Not |
| Implementation | Julia | Java | R |
| Big data | Fast | Moderate | Not practical |
| Sample size | Unlimited | 65 k | Not practical |
| Gene size | Unlimited | Limited | Not practical |
| Complete net1 (s)a | 108.6920796 | 145.4659786 | 82.7275046 |
| Memory | Share | Thread share | Not |
For comparable speed test of all software, FINET parameters were set to simple settings, m = 1 with 5 cross-validations, and ARACNe was set to 5 bootstraps
aCompleted in a computer node with 40 CPUs. All genes were used to infer their interactions