Literature DB >> 17873152

Probability of detecting disease-associated single nucleotide polymorphisms in case-control genome-wide association studies.

Mitchell H Gail1, Ruth M Pfeiffer, William Wheeler, David Pee.   

Abstract

Some case-control genome-wide association studies (CCGWASs) select promising single nucleotide polymorphisms (SNPs) by ranking corresponding p-values, rather than by applying the same p-value threshold to each SNP. For such a study, we define the detection probability (DP) for a specific disease-associated SNP as the probability that the SNP will be "T-selected," namely have one of the top T largest chi-square values (or smallest p-values) for trend tests of association. The corresponding proportion positive (PP) is the fraction of selected SNPs that are true disease-associated SNPs. We study DP and PP analytically and via simulations, both for fixed and for random effects models of genetic risk, that allow for heterogeneity in genetic risk. DP increases with genetic effect size and case-control sample size and decreases with the number of nondisease-associated SNPs, mainly through the ratio of T to N, the total number of SNPs. We show that DP increases very slowly with T, and the increment in DP per unit increase in T declines rapidly with T. DP is also diminished if the number of true disease SNPs exceeds T. For a genetic odds ratio per minor disease allele of 1.2 or less, even a CCGWAS with 1000 cases and 1000 controls requires T to be impractically large to achieve an acceptable DP, leading to PP values so low as to make the study futile and misleading. We further calculate the sample size of the initial CCGWAS that is required to minimize the total cost of a research program that also includes follow-up studies to examine the T-selected SNPs. A large initial CCGWAS is desirable if genetic effects are small or if the cost of a follow-up study is large.

Entities:  

Mesh:

Year:  2007        PMID: 17873152     DOI: 10.1093/biostatistics/kxm032

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  17 in total

1.  Using Ascertainment for Targeted Resequencing to Increase Power to Identify Causal Variants.

Authors:  M D Swartz; B Peng; C Reyes-Gibby; S Shete
Journal:  Stat Interface       Date:  2011       Impact factor: 0.582

2.  Probability that a two-stage genome-wide association study will detect a disease-associated snp and implications for multistage designs.

Authors:  M H Gail; R M Pfeiffer; W Wheeler; D Pee
Journal:  Ann Hum Genet       Date:  2008-07-24       Impact factor: 1.670

3.  MAX-rank: a simple and robust genome-wide scan for case-control association studies.

Authors:  Qizhai Li; Kai Yu; Zhaohai Li; Gang Zheng
Journal:  Hum Genet       Date:  2008-05-20       Impact factor: 4.132

4.  Novel rank-based approaches for discovery and replication in genome-wide association studies.

Authors:  Chia-Ling Kuo; Dmitri V Zaykin
Journal:  Genetics       Date:  2011-07-29       Impact factor: 4.562

5.  Statistical inference on the penetrances of rare genetic mutations based on a case-family design.

Authors:  Hong Zhang; Sylviane Olschwang; Kai Yu
Journal:  Biostatistics       Date:  2010-02-23       Impact factor: 5.899

6.  Sample size and power analysis for sparse signal recovery in genome-wide association studies.

Authors:  Jichun Xie; T Tony Cai; Hongzhe Li
Journal:  Biometrika       Date:  2011-06       Impact factor: 2.445

7.  ON MODEL SELECTION STRATEGIES TO IDENTIFY GENES UNDERLYING BINARY TRAITS USING GENOME-WIDE ASSOCIATION DATA.

Authors:  Zheyang Wu; Hongyu Zhao
Journal:  Stat Sin       Date:  2012       Impact factor: 1.261

8.  Using cases to strengthen inference on the association between single nucleotide polymorphisms and a secondary phenotype in genome-wide association studies.

Authors:  Huilin Li; Mitchell H Gail; Sonja Berndt; Nilanjan Chatterjee
Journal:  Genet Epidemiol       Date:  2010-07       Impact factor: 2.135

9.  Comparison of approaches for incorporating new information into existing risk prediction models.

Authors:  Sonja Grill; Donna P Ankerst; Mitchell H Gail; Nilanjan Chatterjee; Ruth M Pfeiffer
Journal:  Stat Med       Date:  2016-12-11       Impact factor: 2.373

10.  Statistical power of model selection strategies for genome-wide association studies.

Authors:  Zheyang Wu; Hongyu Zhao
Journal:  PLoS Genet       Date:  2009-07-31       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.