| Literature DB >> 25519366 |
Michael D Swartz1, Taebeom Kim2, Jiangong Niu3, Robert K Yu4, Sanjay Shete4, Iuliana Ionita-Laza5.
Abstract
We are now well into the sequencing era of genetic analysis, and methods to investigate rare variants associated with disease remain in high demand. Currently, the more common rare variant analysis methods are burden tests and variance component tests. This report introduces a burden test known as the modified replication based sum statistic and evaluates its performance, and the performance of other common burden and variance component tests under the setting of a small sample size (103 total cases and controls) using the Genetic Analysis Workshop 18 simulated data with complete knowledge of the simulation model. Specifically we look at the variable threshold sum statistic, replication-based sum statistics, the C-alpha, and sequence kernel association test. Using minor allele frequency thresholds of less than 0.05, we find that the modified replication based sum statistic is competitive with all methods and that using 103 individuals leads to all methods being vastly underpowered. Much larger sample sizes are needed to confidently find truly associated genes.Entities:
Year: 2014 PMID: 25519366 PMCID: PMC4143716 DOI: 10.1186/1753-6561-8-S1-S13
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Power for analysis with minor allele frequency (MAF) less than 0.05. This figure shows the power of each method to detect each gene as causal for the 200 replicates of the simulation data when including only rarer variants with empirical MAF less than 0.05. The methods perform similarly within each gene. Especially note the performance of the modified replication based sum statistic S(solid black bar) relative to its predecessors Sand S. For these data, all 3 methods are very competitive. Below the horizontal axis is the percent signal for each gene (percent of causal rare variants divided by the total number of rare variants multiplied by 100).
Mean power for each method across all genes
| Method | Hypertension | Nongenetic |
|---|---|---|
| Stau | 0.056 | 0.048 |
| Scomb | 0.083 | 0.049 |
| Smax | 0.055 | 0.046 |
| C-alpha | 0.043 | 0.050 |
| VT | 0.069 | 0.050 |
| SKAT | 0.051 | 0.055 |
This table reports the mean power for each of the 6 methods investigated for those variants with minor allele frequencies less than 0.05. We report the power for the simulated hypertension phenotype as well as for a nongenetic phenotype for type 1 error.
SKAT, sequence kernel association test; VT, variable threshold.