| Literature DB >> 26415849 |
Pau Bellot1,2, Catharina Olsen3,4, Philippe Salembier5, Albert Oliveras-Vergés6, Patrick E Meyer7.
Abstract
BACKGROUND: In the last decade, a great number of methods for reconstructing gene regulatory networks from expression data have been proposed. However, very few tools and datasets allow to evaluate accurately and reproducibly those methods. Hence, we propose here a new tool, able to perform a systematic, yet fully reproducible, evaluation of transcriptional network inference methods.Entities:
Mesh:
Year: 2015 PMID: 26415849 PMCID: PMC4587916 DOI: 10.1186/s12859-015-0728-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Reviews of GRN reconstruction methods and their characteristics
| Review | [ | [ | [ | This study |
|---|---|---|---|---|
| Number of variables | 100 | ∈ [1643,5950] | ∈ [10,100] | ∈ [300,2000] |
| Topologies | Yeast | E. coli & S. cerevisiae & S. aureus | Yeast & E. coli | Synthetic & Yeast & E. coli |
| Number of methods compared | 4 | 37 | 29 | 10 |
| Simulators | SynTReN | GNW | GNW | Rogers & GNW & SynTReN |
| Number of experiments | ∈ [20,200] | ∈ [160,805] | ∈ [10,100] | ∼150 |
| Impact of number of experiments | – | – |
|
|
| Impact of noise | – | – | – |
|
| Dataset availability | – |
| – |
|
| Benchmark extension | – |
| – |
|
| Possibility to change parameters | – | – | – |
|
Fig. 1Workflow of the network evaluation process
Datasources used in this study and their characteristics
| Datasource | Name | Topology | Experiments | Genes | Edges |
|---|---|---|---|---|---|
|
| R1 | Power-law | 1000 | 1000 | 1350 |
| tail topology | |||||
|
| S1 | E. coli | 800 | 300 | 468 |
|
| S2 | E. coli | 1000 | 1000 | 4695 |
|
| G1 | E. coli | 1565 | 1565 | 7264 |
|
| G2 | Yeast | 2000 | 2000 | 10392 |
Evaluation of the computational complexity. Mean CPU time in seconds of each reconstruction method on the different datasources in a 2 x Intel Xeon E5 2670 8C (2.6 GHz)
| Datasource | ARACNE | C3NET | CLR | GeneNet | Genie3 | MRNET | MutRank | MRNETB | PCIT | Zscore |
|---|---|---|---|---|---|---|---|---|---|---|
| R1 | 2.483 | 2.367 | 0.409 | 9.377 | 1310.486 | 7.200 | 0.638 | 11.195 | 11.352 | 0.086 |
| S1 | 0.106 | 0.215 | 0.059 | 0.917 | 183.266 | 0.120 | 0.056 | 0.406 | 0.333 | 0.010 |
| S2 | 1.775 | 1.904 | 0.349 | 9.504 | 950.648 | 7.101 | 0.585 | 10.907 | 10.898 | 0.091 |
| G1 | 10.442 | 6.795 | 1.079 | 29.612 | 2839.319 | 31.385 | 1.865 | 46.255 | 47.106 | 0.260 |
| G2 | 25.551 | 12.189 | 1.750 | 53.792 | 4115.408 | 60.143 | 3.431 | 100.375 | 103.085 | 0.418 |
Included GRN algorithms. GRN algorithms included in the current version (1.0) of the netbenchmark Bioconductor package
| GRN Algorithms | Wrapper function |
|---|---|
| ARACNE [ | aracne.wrap |
| C3NET [ | c3net.wrap |
| CLR [ | clr.wrap |
| GeneNet [ | genenet.wrap |
| Genie3 [ | genie3.wrap |
| MutRank [ | mutrank.wrap |
| MRNET/B [ | mrnet.wrap & mrnetb.wrap |
| PCIT [ | pcit.wrap |
| Zscore [ | zscore.wrap |
Performances of the various GRN inference methods on the datasources. AUPR in the top 20 % of the possible connections with a undirected evaluation for each GRN inference method on the different datasources of the benchmark with a 20 % local Gaussian noise and 10 % of global lognormal noise. The best statistically significant results tested with a Wilcoxon test are highlighted for each datasource. Results obtained with current version (1.0) of the package and are updated online
| Datasource | ARACNE | C3NET | CLR | GeneNet | Genie3 | MRNET | MutRank | MRNETB | PCIT | Zscore | Random | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R1 | mean | 0.004 | 0.002 | 0.005 | 0.140 | 0.024 | 0.005 | 0.042 | 0.005 |
| 0.140 | <0.001 |
|
| 1.1 | 0.789 | 1.22 | 16 | 2.97 | 1.26 | 7.27 | 1.26 | 16.1 | 13.6 | 0.0265 | |
| S1 | mean | 0.039 | 0.032 | 0.139 | 0.062 | 0.134 | 0.109 | 0.063 | 0.118 | 0.060 | 0.028 | 0.001 |
|
| 8.02 | 7.92 | 1.98 | 8.25 | 3.51 | 9.45 | 2.25 | 5.83 | 1.44 | 13.8 | 0.211 | |
| S2 | mean | 0.006 | 0.006 | 0.042 | 0.013 | 0.036 | 0.021 | 0.021 | 0.021 | 0.01 | 0.003 | <0.001 |
|
| 1.19 | 1.63 | 1.55 | 1.56 | 1 | 2.76 | 0.959 | 2.01 | 0.522 | 1.46 | 0.1 | |
| G1 | mean | 0.106 | 0.100 |
| 0.085 | 0.108 |
| 0.034 | 0.084 | 0.063 | 0.001 | <0.001 |
|
| 7.46 | 7.58 | 7.83 | 2.91 | 6.66 | 9.48 | 2.26 | 3.27 | 2.69 | 0.15 | 0.0141 | |
| G2 | mean | 0.101 | 0.095 | 0.106 | 0.037 | 0.069 |
| 0.025 | 0.058 | 0.044 | <0.001 | <0.001 |
|
| 11.4 | 9.95 | 4.49 | 1.62 | 3.44 | 9.49 | 1.43 | 2.23 | 2.16 | 0.0917 | 0.0265 | |
p < 0.05
Fig. 2Boxplots of performance. Each box represents the statistics of a method with the ranking performance across all datasources, the smaller the rank the better. The white dot represents the median of the distribution, the box goes form the first to third quartile, while whiskers are lines drawn from the ends of the box to the maximum and minimum of the data excluding outliers that are represented with a mark outside the whiskers
Results of the study on noise sensitivity. Mean AUPR in the top 20 % of the possible connetions with a undirected evaluation with respect to intensity (κ %) of Gaussian local noise (σ ). The best results are highlighted. Results obtained with current version (1.0) of the package and are updated online
| Datasource |
| ARACNE | C3NET | CLR | GeneNet | Genie3 | MRNET | MutRank | MRNETB | PCIT | Zscore | Random |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R1 | 0 | 0.008 | 0.004 | 0.010 | 0.140 | 0.025 | 0.010 | 0.040 | 0.010 |
| 0.133 | 0.001 |
| 25 | 0.004 | 0.002 | 0.006 | 0.134 | 0.022 | 0.006 | 0.038 | 0.005 | 0.167 | 0.132 | 0.001 | |
| 50 | 0.002 | 0.001 | 0.003 | 0.121 | 0.020 | 0.003 | 0.031 | 0.003 | 0.150 | 0.130 | 0.001 | |
| 75 | 0.001 | 0.001 | 0.002 | 0.086 | 0.018 | 0.002 | 0.025 | 0.002 | 0.126 |
| 0.001 | |
| 100 | 0.001 | 0.001 | 0.001 | 0.006 | 0.015 | 0.001 | 0.017 | 0.001 | 0.097 |
| 0.001 | |
| S1 | 0 | 0.091 | 0.140 | 0.132 | 0.114 | 0.137 | 0.199 | 0.060 | 0.120 | 0.023 | 0.072 | 0.002 |
| 25 | 0.040 | 0.033 |
| 0.059 | 0.133 | 0.112 | 0.062 | 0.126 | 0.059 | 0.035 | 0.002 | |
| 50 | 0.027 | 0.021 |
| 0.031 | 0.121 | 0.097 | 0.067 | 0.121 | 0.066 | 0.022 | 0.002 | |
| 75 | 0.021 | 0.014 |
| 0.024 | 0.104 | 0.076 | 0.066 | 0.098 | 0.063 | 0.011 | 0.003 | |
| 100 | 0.023 | 0.017 |
| 0.013 | 0.095 | 0.072 | 0.066 | 0.089 | 0.063 | 0.010 | 0.001 | |
| S2 | 0 | 0.014 | 0.039 | 0.046 | 0.025 | 0.044 | 0.060 | 0.021 | 0.039 | 0.008 | 0.001 | 0.001 |
| 25 | 0.006 | 0.006 |
| 0.020 | 0.038 | 0.022 | 0.021 | 0.028 | 0.012 | 0.005 | 0.001 | |
| 50 | 0.004 | 0.003 | 0.044 | 0.016 | 0.033 | 0.016 | 0.020 | 0.026 | 0.012 | 0.002 | 0.001 | |
| 75 | 0.003 | 0.002 |
| 0.012 | 0.028 | 0.013 | 0.019 | 0.021 | 0.011 | 0.002 | 0.001 | |
| 100 | 0.003 | 0.002 |
| 0.008 | 0.022 | 0.011 | 0.017 | 0.015 | 0.010 | 0.001 | 0.001 | |
| G1 | 0 | 0.195 | 0.145 | 0.199 | 0.084 | 0.129 | 0.218 | 0.036 | 0.111 | 0.061 | 0.001 | 0.001 |
| 25 | 0.113 | 0.107 |
| 0.082 | 0.111 | 0.142 | 0.035 | 0.086 | 0.060 | <0.001 | 0.001 | |
| 50 | 0.069 | 0.065 |
| 0.074 | 0.091 | 0.096 | 0.031 | 0.073 | 0.057 | 0.001 | 0.001 | |
| 75 | 0.041 | 0.038 |
| 0.055 | 0.068 | 0.061 | 0.024 | 0.059 | 0.049 | <0.001 | 0.001 | |
| 100 | 0.024 | 0.020 |
| 0.031 | 0.045 | 0.034 | 0.017 | 0.042 | 0.038 | 0.001 | 0.001 | |
| G2 | 0 | 0.163 | 0.147 | 0.131 | 0.038 | 0.077 | 0.177 | 0.025 | 0.062 | 0.045 | 0.002 | 0.001 |
| 25 | 0.097 | 0.092 | 0.103 | 0.036 | 0.067 |
| 0.024 | 0.059 | 0.043 | <0.001 | 0.001 | |
| 50 | 0.042 | 0.040 |
| 0.030 | 0.052 | 0.071 | 0.021 | 0.046 | 0.041 | <0.001 | <0.001 | |
| 75 | 0.019 | 0.018 |
| 0.019 | 0.038 | 0.038 | 0.016 | 0.034 | 0.034 | <0.001 | 0.001 | |
| 100 | 0.011 | 0.009 |
| 0.008 | 0.025 | 0.021 | 0.011 | 0.026 | 0.025 | <0.001 | 0.001 |
p < 0.05
Fig. 3Plots of performance with different noise intensities. Each line represents a method (color coded), the mean performance over the ten runs is presented
Results of the study on the sensitivity with respect to the number of experiments. Mean AUPR in the top 20 % of the possible connetions with a undirected evaluation with respect to number of experiments (# exp). The best results are highlighted. Results obtained with current version (1.0) of the package and are updated online
| Datasource | # exp | ARACNE | C3NET | CLR | GeneNet | Genie3 | MRNET | MutRank | MRNETB | PCIT | Zscore | Random |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R1 | 20 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.002 | 0.001 |
|
| 0.001 |
| 50 | 0.001 | 0.001 | 0.001 | 0.001 | 0.004 | 0.001 | 0.007 | 0.001 | 0.056 | 0.046 | <0.001 | |
| 200 | 0.008 | 0.004 | 0.010 | 0.181 | 0.035 | 0.010 | 0.060 | 0.010 | 0.226 | 0.179 | 0.001 | |
| 800 | 0.160 | 0.114 | 0.166 | 0.723 | 0.249 | 0.166 | 0.306 | 0.167 | 0.764 | 0.726 | <0.001 | |
| S1 | 20 | 0.021 | 0.016 |
| 0.096 | 0.097 | 0.077 | 0.058 | 0.089 | 0.055 | 0.005 | 0.002 |
| 50 | 0.027 | 0.020 |
| 0.099 | 0.122 | 0.091 | 0.060 | 0.110 | 0.057 | 0.017 | 0.002 | |
| 200 | 0.036 | 0.030 |
| 0.066 | 0.135 | 0.108 | 0.064 | 0.122 | 0.059 | 0.034 | 0.002 | |
| 800 | 0.064 | 0.054 | 0.141 | 0.053 |
| 0.139 | 0.065 | 0.130 | 0.059 | 0.058 | 0.003 | |
| S2 | 20 | 0.003 | 0.003 |
| 0.025 | 0.026 | 0.014 | 0.019 | 0.019 | 0.011 | 0.003 | 0.001 |
| 50 | 0.005 | 0.004 |
| 0.025 | 0.033 | 0.018 | 0.021 | 0.025 | 0.013 | 0.002 | 0.001 | |
| 200 | 0.007 | 0.006 |
| 0.018 | 0.040 | 0.024 | 0.022 | 0.028 | 0.013 | 0.004 | 0.001 | |
| 800 | 0.011 | 0.010 |
| 0.010 |
| 0.028 | 0.022 | 0.027 | 0.012 | 0.013 | 0.001 | |
| G1 | 20 | 0.014 | 0.012 | 0.020 | 0.001 | 0.015 | 0.017 | 0.009 | 0.024 |
| 0.001 | 0.001 |
| 50 | 0.051 | 0.047 |
| 0.056 | 0.065 | 0.064 | 0.020 | 0.063 | 0.048 | <0.001 | 0.001 | |
| 200 | 0.136 | 0.127 |
| 0.083 | 0.122 | 0.164 | 0.038 | 0.090 | 0.061 | 0.001 | 0.001 | |
| 800 |
| 0.215 | 0.222 | 0.091 | 0.156 | 0.238 | 0.049 | 0.105 | 0.071 | 0.001 | 0.001 | |
| G2 | 20 | 0.012 | 0.010 | 0.022 | 0.001 | 0.012 | 0.017 | 0.007 |
|
| 0.001 | 0.001 |
| 50 | 0.040 | 0.038 |
| 0.026 | 0.042 | 0.059 | 0.015 | 0.047 | 0.034 | <0.001 | 0.001 | |
| 200 | 0.137 | 0.127 | 0.120 | 0.037 | 0.079 |
| 0.028 | 0.063 | 0.046 | <0.001 | 0.001 | |
| 800 |
| 0.214 | 0.157 | 0.036 | 0.100 | 0.218 | 0.034 | 0.070 | 0.051 | <0.001 | 0.001 |
p < 0.05
Fig. 4Plots of performance with different number of experiments. Each line represents a method (color coded), the mean performance over the ten runs is presented