| Literature DB >> 22172045 |
Junliang Shang1, Junying Zhang, Yan Sun, Dan Liu, Daojun Ye, Yaling Yin.
Abstract
BACKGROUND: Epistasis is recognized fundamentally important for understanding the mechanism of disease-causing genetic variation. Though many novel methods for detecting epistasis have been proposed, few studies focus on their comparison. Undertaking a comprehensive comparison study is an urgent task and a pathway of the methods to real applications.Entities:
Mesh:
Year: 2011 PMID: 22172045 PMCID: PMC3259123 DOI: 10.1186/1471-2105-12-475
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Classification of the methods that detect epistasis. All methods can be classified into three categories according to their search strategies, i.e., exhaustive search, stochastic search, and heuristic search. Methods with bold names are described and evaluated in detail. Detailed information of these methods is provided in supplementary table S1 in additional file 1.
Figure 2Detection power of the methods on 100-SNP non-noise datasets.
Figure 3Detection power of the methods on 10000-SNP non-noise datasets.
Figure 4Detection power of the methods on 100-SNP noise datasets with 5% missing data.
Figure 5Detection power of the methods on 100-SNP noise datasets with 5% genotyping error.
Figure 6Detection power of the methods on 100-SNP noise datasets with 20% phenocopy.
Degree of Robustness (DOR) values of the methods to the noise of missing data, genotyping error and phenocopy.
| Noise Types | Models | Power | TEAM | BOOST | SNPRuler | AntEpiSeeker | epiMODE |
|---|---|---|---|---|---|---|---|
| Missing Data | Model 1 | -- | -- | 0.0000 | 1.0000 | 0.0000 | |
| -- | -- | 0.0000 | 1.0000 | 0.0000 | |||
| -- | -- | 0.0000 | 1.0000 | 0.0000 | |||
| Model 2 | -- | -- | 0.0000 | 0.0000 | 0.0000 | ||
| -- | -- | 0.0000 | 0.0000 | 0.0000 | |||
| -- | -- | 0.0000 | 0.0433 | 0.0000 | |||
| Model 3 | -- | -- | 0.0000 | 1.0000 | 0.0000 | ||
| -- | -- | 0.2925 | 1.0000 | 0.0000 | |||
| -- | -- | 0.5188 | 1.0000 | 0.0000 | |||
| Model 4 | -- | -- | 1.0000 | 1.0000 | 0.0000 | ||
| -- | -- | 1.0000 | 1.0000 | 0.0000 | |||
| -- | -- | 1.0000 | 0.7607 | 0.0000 | |||
| Model 5 | -- | -- | 1.0000 | 0.0000 | |||
| -- | -- | 1.0000 | 0.0000 | ||||
| -- | -- | 1.0000 | 0.0000 | ||||
| Model 6 | -- | -- | 1.0000 | 0.0164 | 0.0000 | ||
| -- | -- | 1.0000 | 0.0214 | 0.0000 | |||
| -- | -- | 1.0000 | 0.0214 | 0.0000 | |||
| Model 7 | -- | -- | 1.0000 | 1.0000 | 0.0000 | ||
| -- | -- | 1.0000 | 1.0000 | 0.0000 | |||
| -- | -- | 1.0000 | 1.0000 | 0.0000 | |||
| Model 8 | -- | -- | 0.8399 | 1.0000 | 0.0000 | ||
| -- | -- | 0.8399 | 1.0000 | 0.0000 | |||
| -- | -- | 0.8407 | 1.0000 | 0.0000 | |||
| Model 9 | -- | -- | 1.0000 | 1.0000 | 0.0000 | ||
| -- | -- | 1.0000 | 1.0000 | 0.0000 | |||
| -- | -- | 1.0000 | 1.0000 | 0.0000 | |||
| Genotyping Error | Model 1 | 0.0000 | 0.0000 | 1.0000 | 0.0000 | ||
| 0.9153 | 0.0000 | 0.0000 | 1.0000 | 0.0000 | |||
| 0.9170 | 0.0000 | 0.0000 | 1.0000 | 0.0000 | |||
| Model 2 | 0.1531 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | ||
| 0.0000 | 0.0000 | 0.0000 | 0.0000 | ||||
| 0.0000 | 1.0000 | 0.0014 | 0.0000 | ||||
| Model 3 | 1.0000 | 0.0007 | 0.0000 | 1.0000 | 0.0000 | ||
| 1.0000 | 0.0007 | 0.0000 | 1.0000 | 0.0000 | |||
| 1.0000 | 0.0000 | 1.0000 | 0.0000 | ||||
| Model 4 | 0.0031 | 1.0000 | 1.0000 | 0.0000 | |||
| 0.0031 | 1.0000 | 1.0000 | 0.0000 | ||||
| 0.0031 | 1.0000 | 1.0000 | 0.9191 | 0.0000 | |||
| Model 5 | 0.0051 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | ||
| 0.0051 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | |||
| 0.0051 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | |||
| Model 6 | 0.8026 | 1.0000 | 1.0000 | 0.8415 | 0.0000 | ||
| 0.8026 | 1.0000 | 1.0000 | 0.8415 | 0.0000 | |||
| 0.8026 | 1.0000 | 1.0000 | 0.8415 | 0.0000 | |||
| Model 7 | 0.0058 | 1.0000 | 0.8399 | 1.0000 | 0.0000 | ||
| 0.0058 | 1.0000 | 0.8399 | 1.0000 | 0.0000 | |||
| 0.0058 | 1.0000 | 0.8399 | 0.0000 | ||||
| Model 8 | 0.0000 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | ||
| 0.0000 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | |||
| 0.0000 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | |||
| Model 9 | 0.5050 | 1.0000 | 0.6862 | 1.0000 | 0.0000 | ||
| 0.5050 | 1.0000 | 0.6862 | 1.0000 | 0.0000 | |||
| 0.5050 | 1.0000 | 0.6862 | 1.0000 | 0.0000 | |||
| Phenocopy | Model 1 | 0.0000 | 0.0000 | 0.0005 | 0.0000 | ||
| 0.0000 | 0.0000 | 0.0027 | 0.0000 | ||||
| 0.0001 | 0.0000 | 0.0643 | 0.0000 | ||||
| Model 2 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | ||
| 0.0000 | 0.1531 | 0.0000 | 0.0000 | ||||
| 0.0000 | 0.5785 | 0.0003 | 0.0000 | ||||
| Model 3 | 0.0014 | 0.0000 | 0.0000 | 0.0023 | 0.0000 | ||
| 0.0153 | 0.0000 | 0.0001 | 0.0261 | 0.0000 | |||
| 0.1096 | 0.0000 | 0.0013 | 0.1542 | 0.0000 | |||
| Model 4 | 0.7763 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | ||
| 0.7763 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | |||
| 0.7763 | 1.0000 | 1.0000 | 0.9393 | 0.0000 | |||
| Model 5 | 0.5485 | 0.8808 | 0.1096 | 0.8006 | 0.0000 | ||
| 0.5485 | 0.8808 | 0.1096 | 0.9195 | 0.0000 | |||
| 0.6171 | 1.0000 | 0.1153 | 0.9195 | 0.0000 | |||
| Model 6 | 0.8513 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | ||
| 0.8513 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | |||
| 0.8513 | 1.0000 | 1.0000 | 1.0000 | 0.0000 | |||
| Model 7 | 0.1207 | 0.9203 | 0.3906 | 0.8315 | 0.0000 | ||
| 0.1207 | 0.9203 | 0.3906 | 0.8315 | 0.0000 | |||
| 0.2275 | 0.9601 | 0.5445 | 0.0000 | ||||
| Model 8 | 0.0000 | 1.0000 | 0.9195 | 1.0000 | 0.0000 | ||
| 0.0000 | 1.0000 | 0.9195 | 1.0000 | 0.0000 | |||
| 0.0124 | 1.0000 | 0.9000 | 1.0000 | 0.0000 | |||
| Model 9 | 0.5050 | 1.0000 | 1.0000 | 0.0000 | |||
| 0.5050 | 1.0000 | 1.0000 | 0.0000 | ||||
| 0.5050 | 1.0000 | 1.0000 | 0.0000 | ||||
There are three types of noise added into datasets respectively. For each model with certain type of noise, each method has three DORs, since three forms of detection power are introduced. Theoretically, DORs are ranged from 0 to 1. However, realistically, there are some DORs (only a few) in table larger than 1. Most DORs with italic fonts are caused by detection power computation precision and their detection power differences are not greater than 0.01. DORs with bold fonts, which are only occurred on robustness analysis of SNPRuler to phenocopy on Model 1 and Model 2, are described and evaluated in detail.
Figure 7Sensitivity of the methods at 0.01 FPR.
Figure 8Left-side ROC curves of the methods on datasets.
Running time (minutes) of the methods on Sim1 ~ Sim6.
| Methods | Sim1 | Sim2 | Sim3 | Sim4 | Sim5 | Sim6 |
|---|---|---|---|---|---|---|
| TEAM | 0.099 | 0.219 | 3.955 | 7.885 | 350.1 | 695.7 |
| BOOST | 0.003 | 0.006 | 0.053 | 0.086 | 3.098 | 4.142 |
| SNPRuler | 0.019 | 0.026 | 0.348 | 0.667 | 30.88 | 58.26 |
| AntEpiSeeker | 9.857 | 19.11 | 12.96 | 27.11 | 51.36 | 104.2 |
| epiMODE | 0.604 | 0.841 | 1607 | 3175 | >20d* | >20d* |
* represents running time is presented in days.