| Literature DB >> 22373158 |
Kristin L Ayers1, Chrysovalanto Mamasoula, Heather J Cordell.
Abstract
Testing for association between multiple markers and a phenotype can not only capture untyped causal variants in weak linkage disequilibrium with nearby typed markers but also identify the effect of a combination of markers. We propose a sliding window approach that uses multimarker genotypes as variables in a penalized regression. We investigate a penalty with three separate components: (1) a group least absolute shrinkage and selection operator (LASSO) that selects multimarker genotypes in a gene to be included in or excluded from the model, (2) an allele-sharing penalty that encourages multimarker genotypes with similar alleles to have similar coefficients, and (3) a penalty that shrinks the size of coefficients while performing model selection. The penalized likelihood is minimized with a cyclic coordinate descent algorithm, allowing quick coefficient estimation for a large number of markers. We compare our method to single-marker analysis and a gene-based sparse group LASSO on the Genetic Analysis Workshop 17 data for quantitative trait Q2. We found that all of the methods were underpowered to detect the simulated rare causal variants at the low false-positive rates desired in association studies. However, the sparse group LASSO on multi-marker genotypes seems to provide some advantage over the sparse group LASSO applied to single SNPs within genes, giving further evidence that there may be an advantage to modeling combinations of rare variant alleles over modeling them individually.Entities:
Year: 2011 PMID: 22373158 PMCID: PMC3287934 DOI: 10.1186/1753-6561-5-S9-S92
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Power versus false-positive rates for the five methods. OURS, our method presented in this paper; SS, single-marker test; GLS, sparse group LASSO on single SNPs; GLG, sparse group LASSO on multimarker genotypes; GLA, sparse group LASSO with an allele-sharing penalty on multimarker genotypes.
Figure 2Detection signals for three different methods. The three plots present the average detection rates for the single-marker analysis (top), the sparse group LASSO on multimarker genotypes middle), and our method (bottom) for each SNP or multimarker genotype. The p-value threshold (5 × 10−4) and penalty strengths (16.0 and 0.90, respectively) were chosen to give false-positive rates of about 0.005.