| Literature DB >> 22373266 |
Allan J Motyer1, Chris McKendry, Sally Galbraith, Susan R Wilson.
Abstract
Model selection procedures for simultaneous analysis of all single-nucleotide polymorphisms in genome-wide association studies are most suitable for making full use of the data for a complex disease study. In this paper we consider a penalized regression using the LASSO procedure and show that post-processing of the penalized-regression results with subsequent stepwise selection may lead to improved identification of causal single-nucleotide polymorphisms.Entities:
Year: 2011 PMID: 22373266 PMCID: PMC3287859 DOI: 10.1186/1753-6561-5-S9-S24
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Common SNPs in model
| SNPs with MAF > 0.01 in the simulating model for Q1. Gene | SNP | MAF | SNP effect size, |
|---|---|---|---|
| C1S6533 | 0.011478 | 0.56190 | |
| C13S431 | 0.017217 | 0.74136 | |
| C13S522 | 0.027977 | 0.61830 | |
| C13S523 | 0.066714 | 0.64997 | |
| C14S1734 | 0.012195 | 0.21203 | |
| C4S1878 | 0.164993 | 0.13573 | |
| C4S1884 | 0.020803 | 0.29558 |
Mean number of times SNPs in simulating model were selected over 200 replicates
| SNP | Lasso with | + AIC | + BIC | Lasso with | + AIC | + BIC |
|---|---|---|---|---|---|---|
| C1S6533 | 0.445 | 0.440 | 0.365 | 0.040 | 0.040 | 0.040 |
| C13S431 | 0.775 | 0.665 | 0.555 | 0.185 | 0.185 | 0.180 |
| C13S522 | 0.985 | 0.970 | 0.880 | 0.850 | 0.850 | 0.850 |
| C13S523 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| C14S1734 | 0.085 | 0.065 | 0.035 | 0.000 | 0.000 | 0.000 |
| C4S1878 | 0.375 | 0.265 | 0.200 | 0.045 | 0.045 | 0.045 |
| C4S1884 | 0.425 | 0.385 | 0.285 | 0.055 | 0.055 | 0.055 |
| Total correct | 4.09 | 3.79 | 3.32 | 2.175 | 2.175 | 2.170 |
| Total selected | 28.05 | 21.18 | 14.30 | 3.29 | 3.22 | 3.10 |
The mean number of total SNPs selected for each of the variations of the model selection procedure are also shown.
Figure 1Distribution of RMIP values for each SNP from LASSO bagging (resample model averaging). SNPs were ordered on the x-axis by RMIP value. Of the 6,321 SNPs, 1,296 had nonzero RMIP (zero values are not plotted). The SNPs used in the simulating model are plotted in gray.