| Literature DB >> 24742092 |
William Chad Young, Adrian E Raftery, Ka Yee Yeung1.
Abstract
BACKGROUND: Genome-wide time-series data provide a rich set of information for discovering gene regulatory relationships. As genome-wide data for mammalian systems are being generated, it is critical to develop network inference methods that can handle tens of thousands of genes efficiently, provide a systematic framework for the integration of multiple data sources, and yield robust, accurate and compact gene-to-gene relationships.Entities:
Mesh:
Year: 2014 PMID: 24742092 PMCID: PMC4006459 DOI: 10.1186/1752-0509-8-47
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Performance of different methods on the yeast data
| LASSO | 0.046 | 0.506 | 0.0416 | 996 | 20,469 |
| ARACNE | 0.205 | 0.502 | 0.0399 | 69 | 268 |
| CLR | 0.039 | 0.510 | 0.0435 | 8,879 | 220,942 |
| MRNET | 0.039 | 0.513 | 0.0442 | 8,737 | 214,757 |
| ScanBMA [20] | 0.601 | 0.0747 | 227 | 353 | |
| ScanBMA [3556] | 0.274 | 0.0740 | 127 | 336 | |
| iBMA [100] | 0.180 | 0.517 | 593 | 2,702 |
AUROC is the area under the ROC curve, AUPRC is the area under the precision-recall curve, and TP and FP are the numbers of true positive and false positive edges inferred, respectively. Thus TP +FP is the number of edges in the inferred network and Precision = TP/(TP +FP). ScanBMA was applied to the transformed data using the informative edge prior and Zellner’s g-prior for the model parameters. The superscript indicates the value of nvar. Expected precision and AUPRC from random guessing is 0.0380.
Performance of different versions of ScanBMA on the yeast data, either on the original scale (Orig) or transformed (Trans)
| Trans | 20 | 0.601 | 227 | 353 | |||
| Trans | 3556 | 0.274 | 0.0740 | 127 | 336 | ||
| Trans | 20 | BIC, inform | 0.244 | 0.590 | 0.0616 | 200 | 619 |
| Orig | 20 | 0.274 | 0.586 | 0.0680 | 552 | 1460 | |
| Trans | 20 | 0.175 | 0.499 | 0.0395 | 34 | 160 |
For the priors, inform refers to the informative prior, while Guelzim refers to the prior probability of 2.76/6000 for all possible relationships.
Figure 1Yeast precision-recall curves. Precision-Recall curves for different methods on the yeast data. ScanBMA was run using the g-prior, transformed data, and informative prior with nvar = 20 and 3556.
Average CPU time per target gene of different methods for the yeast data
| LASSO | 4.1 |
| ARACNE | 70.4 |
| CLR | 7.9 |
| MRNET | >500 |
| ScanBMA [20] | 0.04 |
| ScanBMA [3556] | 11.2 |
| iBMA [20] | 0.08 |
| iBMA [3556] | 85 |
ScanBMA was run with g-prior, transformed data, informative prior. Superscript indicates value of nvar.
Average performance of different methods on the DREAM4 10-gene networks
| LASSO | 0.190 | 0.731 | 0.487 | 62 | 265 |
| ebdbnet | 0.704 | 0.438 | 28 | 27 | |
| ARACNE | 0.304 | 0.668 | 0.388 | 35 | 80 |
| CLR | 0.215 | 0.681 | 0.397 | 50 | 183 |
| MRNET | 0.215 | 0.709 | 0.409 | 53 | 193 |
| ScanBMA | 0.432 | 35 | 46 |
ScanBMA was run with the original data. The true positive (TP) and false positive (FP) columns are totaled across all 5 networks. There are 71 true edges across the 5 networks.
Figure 210-gene Precision-Recall curves. Precision-Recall curves for various methods on network 1 of the 10-gene networks from the DREAM4 competition. This network has 15 true edges.
Figure 3DREAM4 10-gene network visual comparison.
Average performance of different methods on the DREAM4 100-gene networks
| LASSO | 0.035 | 0.643 | 0.073 | 571 | 15757 |
| ebdbnet | 0.054 | 0.643 | 0.043 | 182 | 3201 |
| ARACNE | 0.114 | 0.589 | 0.106 | 208 | 1621 |
| CLR | 0.035 | 0.699 | 0.123 | 678 | 18669 |
| MRNET | 0.035 | 689 | 18784 | ||
| ScanBMA | 0.657 | 0.101 | 193 | 1062 |
ScanBMA run with original data. The true positive (TP) and false positive (FP) columns are totaled across all 5 networks. There are 1,024 true edges across the 5 networks
Figure 4100-gene Precision-Recall curves. Precision-Recall curves for various methods on network 1 of the 100-gene networks from the DREAM4 competition.