| Literature DB >> 28346824 |
Tamar Sofer1, Elizabeth D Schifano2, David C Christiani3, Xihong Lin4.
Abstract
We propose a weighted pseudolikelihood method for analyzing the association of a SNP set, example, SNPs in a gene or a genetic pathway or network, with multiple secondary phenotypes in case-control genetic association studies. To boost analysis power, we assume that the SNP-specific effects are shared across all secondary phenotypes using a scaled mean model. We estimate regression parameters using Inverse Probability Weighted (IPW) estimating equations obtained from the weighted pseudolikelihood, which accounts for case-control sampling to prevent potential ascertainment bias. To test the effect of a SNP set, we propose a weighted variance component pseudo-score test. We also propose a penalized IPW pseudolikelihood method for selecting a subset of SNPs that are associated with the multiple secondary phenotypes. We show that the proposed variable selection procedure has the oracle properties and is robust to misspecification of the correlation structure among secondary phenotypes. We select the tuning parameter using a weighted Bayesian Information-like Criterion (wBIC). We evaluate the finite sample performance of the proposed methods via simulations, and illustrate the methods by the analysis of the multiple secondary smoking behavior outcomes in a lung cancer case-control genetic association study.Entities:
Keywords: Biased sampling; High-dimensional data; SNP set analysis; Sparsity; Variable selection; Variance component test; Weighted BIC
Mesh:
Year: 2017 PMID: 28346824 PMCID: PMC5617769 DOI: 10.1111/biom.12680
Source DB: PubMed Journal: Biometrics ISSN: 0006-341X Impact factor: 2.571