| Literature DB >> 28155637 |
Guangyong Zheng1, Yaochen Xu2,3, Xiujun Zhang4, Zhi-Ping Liu4, Zhuo Wang5, Luonan Chen6, Xin-Guang Zhu7.
Abstract
BACKGROUND: A gene regulatory network (GRN) represents interactions of genes inside a cell or tissue, in which vertexes and edges stand for genes and their regulatory interactions respectively. Reconstruction of gene regulatory networks, in particular, genome-scale networks, is essential for comparative exploration of different species and mechanistic investigation of biological processes. Currently, most of network inference methods are computationally intensive, which are usually effective for small-scale tasks (e.g., networks with a few hundred genes), but are difficult to construct GRNs at genome-scale.Entities:
Keywords: Gene regulatory network; Genome-wide; Parallel computing; Software
Mesh:
Year: 2016 PMID: 28155637 PMCID: PMC5260056 DOI: 10.1186/s12859-016-1324-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Workflow of the CMIP software package. First, expression datasets are used as input of the CMIP algorithm. Then the CPU or GPU programs are selected to reconstruct networks. Finally, result files recording interaction and relationship of gene pairs are generated as output
Fig. 2Diagram of threshold determination for gene interactions. Relationship between interaction and cutoff is first investigated, and then a fitting curve method based on exponential function is adopted to simulate relationship between them. Finally, the intersection of slope of the start and end sections of the fitting curve was chosen as the threshold
Effectiveness of threshold determination method under different criteria
| Criteria/manners | 0-order | 1-order | 2-order | 3-order | Percentage |
|---|---|---|---|---|---|
| Offset less than 5% | 3 | 7 | 4 | 4 | 45% |
| Offset less than 10% | 5 | 9 | 9 | 7 | 75% |
| Offset less than 20% | 7 | 10 | 10 | 10 | 93% |
Fig. 3Changes of the accuracy measurement for the CMIP algorithm. The CMIP programs were conducted on 10 benchmark datasets with 0-,1-,2- and 3-order manners to test impacts of different order parameters
Scores of various network inference methods on benchmark datasets
| Measurements | ARACNE | CLR | MI | GENIE3 | Inferelator | TIGRESS | CMIP |
|---|---|---|---|---|---|---|---|
| AUROC | 0.6689 | 0.7632 | 0.8004 | 0.7955 | 0.7014 | 0.8048 | 0.7945 |
| AUPR | 0.1465 | 0.2076 | 0.3537 | 0.3033 | 0.2172 | 0.3991 | 0.3637 |
| Average | 0.4077 | 0.4854 | 0.5771 | 0.5494 | 0.4593 | 0.6020 | 0.5791 |
Fig. 4Scores of various network inference methods. Performance of various network inference methods are compared on 10 benchmark datasets using the AUROC and AUPR measurement
Running time of different network inference programs
| Methods | CMIPa | CMIPb | MI | ARACNE | CLR | GENIE3 | Inferelator | TIGRESS |
|---|---|---|---|---|---|---|---|---|
| Multi-threads | Yes | Yes | No | No | No | No | Yes | No |
| Time (seconds) | 7 | 4 | 8 | 353 | 68 | 4715 | 2728 | 13089 |
aGPU version program, b CPU version program
Running time of the CMIP programs in pineapple GRNs reconstruction
| Programs/time (seconds) | Leaf base network | Leaf tip network |
|---|---|---|
| CMIPa | 1828 | 2277 |
| CMIPb | 2026 | 2733 |
aGPU version program, b CPU version program
Fig. 5Data processing pipeline of website for the CMIP algorithm. First, users can input transcriptomics data and submit their computing tasks through the “Network Inference” module. When a task is finished, a notifying letter will be sent to users. Simultaneously, users can check results of their tasks on the “Task query” module. In addition, users can obtain the CMIP software in the “Download” module