| Literature DB >> 24265830 |
Donghyung Lee1, Silviu-Alin Bacanu.
Abstract
Genome wide association studies have been usually analyzed in a univariate manner. The commonly used univariate tests have one degree of freedom and assume an additive mode of inheritance. The experiment-wise significance of these univariate statistics is obtained by adjusting for multiple testing. Next generation sequencing studies, which assay 10-20 million variants, are beginning to come online. For these studies, the strategy of additive univariate testing and multiple testing adjustment is likely to result in a loss of power due to (1) the substantial multiple testing burden and (2) the possibility of a non-additive causal mode of inheritance. To reduce the power loss we propose: a new method (1) to summarize in a single statistic the strength of the association signals coming from all not-very-rare variants in a linkage disequilibrium block and (2) to incorporate, in any linkage disequilibrium block statistic, the strength of the association signals under multiple modes of inheritance. The proposed linkage disequilibrium block test consists of the sum of squares of nominally significant univariate statistics. We compare the performance of this method to the performance of existing linkage disequilibrium block/gene-based methods. Simulations show that (1) extending methods to combine testing for multiple modes of inheritance leads to substantial power gains, especially for a recessive mode of inheritance, and (2) the proposed method has a good overall performance. Based on simulation results, we provide practical advice on choosing suitable methods for applied analyses.Entities:
Mesh:
Year: 2013 PMID: 24265830 PMCID: PMC3827222 DOI: 10.1371/journal.pone.0080540
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Number of cases/controls (n), relative risk of heterozygote to non-risk allele homozygote (R 1) and relative risk of risk allele homozygote to non-risk allele homozygote (R 2) used at each simulation setting under the single causal variant scenario of Experiment II.
| Genetic Model | ||||
|---|---|---|---|---|
| Number of causal variants ( | Additive | Dominant | Recessive | |
| 1 | 1,000 (1.3, 1.6) | 1,000 (1.5, 1.5) | 1,000 (1, 3) | |
Within each cell, the settings are presented as n (R 1, R 2).
Number of cases/controls (n) and effect size (δ) of any causal allele used at each simulation setting under the two and five non-interacting causal variant scenario of Experiment II.
| Genetic Model | ||||
|---|---|---|---|---|
| Number of causal variants ( | Additive | Dominant | Recessive | |
| 2 | 1,000 (0.005) | 1,000 (0.008) | 1,000 (0.03) | |
| 5 | 1,000 (0.002) | 1,000 (0.003) | 1,000 (0.01) | |
Within each cell, the settings are presented as n (δ).
Figure 1Empirical size of the test under Experiment II as a function of the method type and the ADR adjustment status.
The nominal type I error rate is α=0.05. The bars represent the 95% confidence interval for the size of test. Abbreviations for methods are as follows: B - Bonferroni, S - Simes, G - GATES, V-SS - VEGAS-SS, SS-x - SS-T with x=1,...,9, V-minP - VEGAS-minP, PC - principal component method, S-PC - Simes adjustment of Simes and PC methods, SKAT - sequence kernel association test, aSUM - data-adaptive sum test.
Figure 2Empirical power of SS-T methods under Experiment II as a function of the mode of inheritance, the number of causal variants in a LD block (k) and the ADR adjustment status.
The nominal type I error rate is α=0.05. The bars represent the 95% confidence interval for the power of test. See Figure 1 for background and abbreviations.
Figure 3Empirical power of the main methods under Experiment II as a function of the mode of inheritance, the number of causal variants in a LD block (k) and the ADR adjustment status.
The nominal type I error rate is α=0.05. The bars represent the 95% confidence interval for the power of test. See Figure 1 for background and abbreviations.