| Literature DB >> 22373491 |
Jin Liu1, Kai Wang, Shuangge Ma, Jian Huang.
Abstract
We use a novel penalized approach for genome-wide association study that accounts for the linkage disequilibrium between adjacent markers. This method uses a penalty on the difference of the genetic effect at adjacent single-nucleotide polymorphisms and combines it with the minimax concave penalty, which has been shown to be superior to the least absolute shrinkage and selection operator (LASSO) in terms of estimator bias and selection consistency. Our method is implemented using a coordinate descent algorithm. The value of the tuning parameters is determined by extended Bayesian information criteria. The leave-one-out method is used to compute p-values of selected single-nucleotide polymorphisms. Its applicability to a simulated data from Genetic Analysis Workshop 17 replication one is illustrated. Our method selects three SNPs (C13S522, C13S523, and C13S524), whereas the LASSO method selects two SNPs (C13S522 and C13S523).Entities:
Year: 2011 PMID: 22373491 PMCID: PMC3287906 DOI: 10.1186/1753-6561-5-S9-S67
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Absolute lag-one autocorrelation of SNPs over the genome
Figure 2Proportion of absolute lag-one autocorrelation coefficients greater than 0.5 for 100 SNPs per segment over the genome
Figure 3|β| estimates from single-SNP linear regression over the genome
SNPs selected by the SMCP and LASSO models for trait Q1 in replicate 1
| SNP | Position | Gene | Univariate estimate | Univariate | SMCP estimate | LOO | LASSO estimate | LOO |
|---|---|---|---|---|---|---|---|---|
| C12S707 | 11065657 | 0.53 | 2.0 × 10−7 | 0.002 | 7.9 × 10−1 | |||
| C12S711 | 11065733 | 0.61 | 7.6 × 10−8 | 0.007 | 3.0 × 10−1 | |||
| C13S522 | 27899910 | 1.22 | 2.1 × 10−17 | 0.169 | 5.7 × 10−7 | 0.096 | 1.6 × 10−8 | |
| C13S523 | 27899912 | 0.94 | 2.6 × 10−22 | 0.173 | 6.2 × 10−10 | 0.134 | 1.6 × 10−13 | |
| C13S524 | 27899915 | 1.88 | 2.2 × 10−7 | 0.058 | 1.1 × 10−2 |
Figure 4Boxplots for SNPs selected using the SMCP method
Mean and standard error (in parentheses) of true positives and false positives for selected SNPs over 200 replicates for trait Q1
| SMCP model | LASSO model | Regular regression | |
|---|---|---|---|
| True positive | 3.35 (1.52) | 2.48 (1.19) | 7.03 (1.81) |
| False positive | 18.42 (36.20) | 8.64 (21.53) | 174.35 (87.87) |