| Literature DB >> 22962455 |
Hongjie Zhu1, Lexin Li, Hua Zhou.
Abstract
MOTIVATION: Association tests based on next-generation sequencing data are often under-powered due to the presence of rare variants and large amount of neutral or protective variants. A successful strategy is to aggregate genetic information within meaningful single-nucleotide polymorphism (SNP) sets, e.g. genes or pathways, and test association on SNP sets. Many existing methods for group-wise tests require specific assumptions about the direction of individual SNP effects and/or perform poorly in the presence of interactions.Entities:
Mesh:
Year: 2012 PMID: 22962455 PMCID: PMC3436833 DOI: 10.1093/bioinformatics/bts406
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.LD structures of the 146 SNPs in TG (top left) and 187 SNPs in COL6A3 (bottom right)
Fig. 2.Histograms of MAFs for SNPs in TG (left), COL6A3 (middle) and the entire GAW17 dataset (right)
Results of simulation studies based on sequence data of genes TG and COL6A3
Each study focuses on one genetic model mimicking a specific type of true genetic effect. Under ‘Genetic Effect’ are the true genetic effects that generate the quantitative trait, where X{j}'s are SNPs in descending order according to their MAFs. The common variants (j ≤ 10) are selected to have low pairwise LD. The binary trait is determined from the quantitative trait and serves as the response variable in the simulation studies. The numbers under the names of different methods are their empirical power in different studies or Type-I error.
Fig. 3.Empirical power of kSIR with WF kernel varies with SNR. Gene TG is used for simulation. The nine scenarios in Table 1 are examined. CV, common variant; RV, rare variant