| Literature DB >> 23113980 |
Verena Zuber1, A Pedro Duarte Silva, Korbinian Strimmer.
Abstract
BACKGROUND: Identification of causal SNPs in most genome wide association studies relies on approaches that consider each SNP individually. However, there is a strong correlation structure among SNPs that needs to be taken into account. Hence, increasingly modern computationally expensive regression methods are employed for SNP selection that consider all markers simultaneously and thus incorporate dependencies among SNPs.Entities:
Mesh:
Year: 2012 PMID: 23113980 PMCID: PMC3558454 DOI: 10.1186/1471-2105-13-284
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Software used in the comparison study
| CAR | R package | [ |
| COR | R package | [ |
| NEG | [ | |
| MCP | R package | [ |
| BOOST | R package | [ |
| LASSO | R package | [ |
The R packages are available from the R software archive CRAN at http://cran.r-project.org/.
Figure 1Average true positives resulting from SNP rankings of the investigated approaches for phenotype Q1 (top row) and Q2 (bottom row). For Q1 there are 38 true SNPs and for Q2 71 true SNPs.
Median model sizes and the corresponding interquartile ranges (IQR) as well as the average true positives for phenotypes Q1 and Q2 for all investigated methods summarized across the 200 repetitions (first three columns)
| | ||||||
|---|---|---|---|---|---|---|
| | ||||||
| Q1 | | | | | | |
| | CAR | 51 (53) | 0.23 | |||
| | COR | 176 (108) | 0.88 | |||
| | NEG | 1390 (118) | 14.38 | 6.60 | ||
| | MCP | 20 (5) | 3.95 | 0.12 | ||
| | BOOST | 53 (5) | 5.50 | 0.25 | ||
| | LASSO | 37 (31) | 4.89 | 0.18 | ||
| Q2 | | | | | | |
| | CAR | 31 (38) | 0.29 | |||
| | COR | 1 (7) | 0.00 | |||
| | NEG | 1632 (755) | 20.21 | 14.50 | ||
| | MCP | 29 (5) | 2.75 | 0.28 | ||
| | BOOST | 59 (6) | 3.82 | 0.59 | ||
| LASSO | 15 (36) | 1.50 | 0.14 | |||
For comparison, the last three columns show the average true positives at the specified model size for CAR, COR and RND. The best performing method is shown in bold, the second best in italic.
Median model sizes and the corresponding interquartile ranges (IQR) for phenotype Q4
| Q4 | | ||||||
| | Model Size | CAR | COR | NEG | MCP | BOOST | LASSO |
| | Median | 34 | 0 | 1900 | 27 | 59 | 1 |
| IQR | 40 | 1 | 2713 | 4 | 6 | 6 | |
Figure 2Frequency of occurrence of each true SNP among the top 100 SNPs selected by each approach for phenotype Q1 (top row) and for Q2 (lower row) for the 200 repetitions. Note that the SNPs are ordered according to the first column.
True SNPs found among the top 100 SNPs identified by CAR scores in at least 50 of the 200 repetitions for Q1 and Q2
| Q1 | | | | | 0.014 |
| | ARNT | C1S6533 | 88 | 0.011478 | 0.56190 | |
| | FLT1 | C13S431 | 110 | 0.017217 | 0.74136 | 0.147 |
| | FLT1 | C13S522 | 200 | 0.027977 | 0.61830 | 0.147 |
| | FLT1 | C13S523 | 200 | 0.066714 | 0.64997 | 0.147 |
| | FLT1 | C13S524 | 164 | 0.004304 | 0.62223 | 0.147 |
| | KDR | C4S1877 | 145 | 0.000717 | 1.07706 | 0.111 |
| | KDR | C4S1878 | 101 | 0.164993 | 0.13573 | 0.111 |
| | KDR | C4S1884 | 95 | 0.020803 | 0.29558 | 0.111 |
| | VEGFA | C6S2981 | 69 | 0.002152 | 1.20645 | |
| | VEGFC | C4S4935 | 91 | 0.000717 | 1.35726 | |
| Q2 | | | | | 0.008 |
| | BCHE | C3S4869 | 54 | 0.000717 | 1.01569 | 0.001 |
| | BCHE | C3S4875 | 59 | 0.000717 | 1.09484 | 0.001 |
| | LPL | C8S442 | 69 | 0.015782 | 0.49459 | |
| | SIRT1 | C10S3048 | 54 | 0.002152 | 0.83224 | 0.330 |
| | SIRT1 | C10S3050 | 72 | 0.002152 | 0.97060 | 0.330 |
| | VNN1 | C6S5380 | 138 | 0.170732 | 0.24437 | |
| | VNN3 | C6S5441 | 59 | 0.098278 | 0.27053 | 0.066 |
| VNN3 | C6S5449 | 57 | 0.010043 | 0.66909 | 0.066 |
The last column shows the average absolute correlation among all SNPs for Q1 and Q2 as well as the average absolute correlation for the SNPs belonging to one gene.
Proportion of common and rare variants of the true SNPs found among the top 100 SNPs
| Q1 | | ||||||
| | Proportion (%) | CAR | COR | NEG | MCP | BOOST | LASSO |
| | Common | 0.56 | 0.71 | 0.63 | 0.74 | 0.71 | 0.73 |
| | Rare | 0.44 | 0.29 | 0.37 | 0.26 | 0.29 | 0.27 |
| Q2 | | ||||||
| | Proportion (%) | CAR | COR | NEG | MCP | BOOST | LASSO |
| | Common | 0.28 | 0.41 | 0.36 | 0.44 | 0.42 | 0.43 |
| Rare | 0.72 | 0.59 | 0.64 | 0.56 | 0.58 | 0.57 | |