| Literature DB >> 27654840 |
Takanori Hasegawa1, Kaname Kojima2, Yosuke Kawai2, Kazuharu Misawa2, Takahiro Mimori2, Masao Nagasaki3.
Abstract
BACKGROUND: Genome-wide association studies have revealed associations between single-nucleotide polymorphisms (SNPs) and phenotypes such as disease symptoms and drug tolerance. To address the small sample size for rare variants, association studies tend to group gene or pathway level variants and evaluate the effect on the set of variants. One of such strategies, known as the sequential kernel association test (SKAT), is a widely used collapsing method. However, the reported p-values from SKAT tend to be biased because the asymptotic property of the statistic is used to calculate the p-value. Although this bias can be corrected by applying permutation procedures for the test statistics, the computational cost of obtaining p-values with high resolution is prohibitive.Entities:
Keywords: Genome wide association study; Multiple test; Rare variants
Year: 2016 PMID: 27654840 PMCID: PMC5031335 DOI: 10.1186/s12864-016-3094-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1A sample figure exemplifying the distribution of the expectation of the estimated p-value and the stop criteria in the proposed procedure. The α confidence interval d of the distribution of the estimated p-value is colored gray. B and r are the number of permutations completed and the number of permutation statistics that are greater than the original statistic s using the observed data, respectively. The stop criterion is evaluated using p±d /2 and α , which is the predefined significance level
The power comparison of SKAT, SKAT-O, and AP-SKAT aimed at testing the association between randomly selected 5 kb regions and continuous traits under the effect size = 0.4
| SKAT | SKAT-O | AP-SKAT | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | |
| 250 | 1.47E-2 | 1.67E-3 | 1.89E-4 | 1.48E-2 | 1.69E-3 | 1.95E-4 | 1.51E-2 | 1.80E-3 | 2.40E-4 |
| 500 | 2.13E-2 | 2.90E-3 | 4.03E-4 | 2.15E-2 | 2.92E-3 | 4.03E-4 | 2.17E-2 | 3.07E-3 | 4.91E-4 |
| 750 | 2.92E-2 | 4.67E-3 | 7.35E-4 | 2.94E-2 | 4.69E-3 | 7.38E-4 | 2.97E-2 | 4.74E-3 | 8.37E-4 |
| 1000 | 3.84E-2 | 6.86E-3 | 1.24E-3 | 3.86E-2 | 6.95E-3 | 1.25E-3 | 3.91E-2 | 7.08E-3 | 1.33E-3 |
| 1250 | 4.92E-2 | 9.60E-3 | 1.92E-3 | 4.93E-2 | 9.72E-3 | 1.92E-3 | 4.95E-2 | 9.71E-3 | 2.05E-3 |
| 1500 | 6.05E-2 | 1.31E-2 | 2.81E-3 | 6.07E-2 | 1.31E-2 | 2.82E-3 | 6.14E-2 | 1.32E-2 | 3.05E-3 |
The power comparison of SKAT, SKAT-O, and AP-SKAT aimed at testing the association between randomly selected 5 kb regions and continuous traits under the effect size = 0.8
| SKAT | SKAT-O | AP-SKAT | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | |
| 250 | 3.83E-2 | 6.96E-3 | 1.23E-3 | 3.85E-2 | 6.96E-3 | 1.25E-3 | 3.91E-2 | 7.02E-3 | 1.36E-3 |
| 500 | 8.60E-2 | 2.22E-2 | 5.41E-3 | 8.72E-2 | 2.23E-2 | 5.47E-3 | 8.86E-2 | 2.23E-2 | 5.66E-3 |
| 750 | 1.52E-1 | 4.83E-2 | 1.45E-2 | 1.52E-1 | 4.84E-2 | 1.46E-2 | 1.55E-1 | 4.88E-2 | 1.52E-2 |
| 1000 | 2.21E-1 | 8.19E-2 | 2.96E-2 | 2.24E-1 | 8.50E-2 | 2.99E-2 | 2.26E-1 | 8.53E-2 | 3.02E-2 |
| 1250 | 2.98E-1 | 1.30E-1 | 5.10E-2 | 2.99E-1 | 1.31E-1 | 5.29E-2 | 3.01E-1 | 1.32E-1 | 5.37E-2 |
| 1500 | 3.70E-1 | 1.91E-1 | 8.11E-2 | 3.74E-1 | 1.93E-1 | 8.28E-2 | 3.75E-1 | 1.90E-1 | 8.49E-2 |
The power comparison of SKAT, SKAT-O, and AP-SKAT aimed at testing the association between randomly selected 5 kb regions and continuous traits under the effect size = 1.2
| SKAT | SKAT-O | AP-SKAT | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | |
| 250 | 1.01E-1 | 2.72E-2 | 7.04E-3 | 1.01E-1 | 2.75E-2 | 7.11E-3 | 1.03E-1 | 2.73E-2 | 7.42E-3 |
| 500 | 2.57E-1 | 1.04E-1 | 3.90E-2 | 2.59E-1 | 1.05E-1 | 3.94E-2 | 2.64E-1 | 1.06E-1 | 4.13E-2 |
| 750 | 4.16E-1 | 2.32E-1 | 1.07E-1 | 4.19E-1 | 2.34E-1 | 1.09E-1 | 4.21E-1 | 2.33E-1 | 1.12E-1 |
| 1000 | 5.06E-1 | 3.64E-1 | 2.18E-1 | 5.07E-1 | 3.64E-1 | 2.20E-1 | 5.11E-1 | 3.63E-1 | 2.18E-1 |
| 1250 | 5.79E-1 | 4.62E-1 | 3.36E-1 | 5.81E-1 | 4.64E-1 | 3.39E-1 | 5.83E-1 | 4.62E-1 | 3.37E-1 |
| 1500 | 6.68E-1 | 5.01E-1 | 4.29E-1 | 6.66E-1 | 5.01E-1 | 4.32E-1 | 6.72E-1 | 5.02E-1 | 4.28E-1 |
The power comparison of SKAT, SKAT-O, and AP-SKAT aimed at testing the association between randomly selected 5 kb regions and continuous traits under the effect size = 1.6
| SKAT | SKAT-O | AP-SKAT | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | |
| 250 | 2.19E-1 | 8.09E-2 | 2.90E-2 | 2.21E-1 | 8.30E-2 | 2.95E-2 | 2.24E-1 | 8.32E-2 | 2.97E-2 |
| 500 | 4.84E-1 | 3.05E-1 | 1.63E-1 | 4.80E-1 | 3.04E-1 | 1.64E-1 | 4.85E-1 | 3.01E-1 | 1.67E-1 |
| 750 | 5.99E-1 | 4.83E-1 | 3.69E-1 | 5.99E-1 | 4.82E-1 | 3.71E-1 | 6.09E-1 | 4.80E-1 | 3.66E-1 |
| 1000 | 7.42E-1 | 5.42E-1 | 4.88E-1 | 7.41E-1 | 5.42E-1 | 4.89E-1 | 7.48E-1 | 5.46E-1 | 4.87E-1 |
| 1250 | 8.50E-1 | 6.52E-1 | 5.14E-1 | 8.50E-1 | 6.54E-1 | 5.13E-1 | 8.50E-1 | 6.54E-1 | 5.16E-1 |
| 1500 | 9.19E-1 | 7.50E-1 | 5.93E-1 | 9.20E-1 | 7.48E-1 | 5.90E-1 | 9.18E-1 | 7.48E-1 | 5.94E-1 |
The power comparison of SKAT, SKAT-O, and AP-SKAT aimed at testing the association between randomly selected 5 kb regions and continuous traits under the effect size = 2.0
| SKAT | SKAT-O | AP-SKAT | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | 10−2 | 10−3 | 10−4 | |
| 250 | 3.77E-1 | 2.00E-1 | 8.70E-2 | 3.82E-1 | 2.01E-1 | 8.84E-2 | 3.83E-1 | 1.97E-1 | 8.88E-2 |
| 500 | 6.14E-1 | 4.89E-1 | 3.88E-1 | 6.14E-1 | 4.90E-1 | 3.88E-1 | 6.24E-1 | 4.89E-1 | 3.79E-1 |
| 750 | 8.16E-1 | 6.12E-1 | 5.01E-1 | 8.16E-1 | 6.07E-1 | 5.01E-1 | 8.16E-1 | 6.14E-1 | 5.02E-1 |
| 1000 | 9.30E-1 | 7.68E-1 | 6.10E-1 | 9.30E-1 | 7.66E-1 | 6.11E-1 | 9.27E-1 | 7.69E-1 | 6.12E-1 |
| 1250 | 9.81E-1 | 8.96E-1 | 7.48E-1 | 9.80E-1 | 8.94E-1 | 7.42E-1 | 9.78E-1 | 8.78E-1 | 7.42E-1 |
| 1500 | 9.95E-1 | 9.61E-1 | 8.58E-1 | 9.95E-1 | 9.60E-1 | 8.59E-1 | 9.94E-1 | 9.48E-1 | 8.49E-1 |
Type I errors of SKAT-O and AP-SKAT to evaluate the inflation of p-values using 1000 Genomes Project data under the noises according to the Student’s t -distribution with 5 degrees of freedom
| Sample Size | 500 | 1000 | 1500 | 2000 |
|---|---|---|---|---|
| SKAT-O | 356 | 178 | 202 | 142 |
| AP-SKAT | 348 | 153 | 189 | 130 |
Fig. 2Comparison of computation times between the standard and permutation procedures using 1000 Genomes Project data, WTCCC, and HapMap. Solid and dotted lines indicate the runtimes of the standard and adaptive procedures, respectively
Fig. 3Comparison plot with several confidential intervals using the 1000 Genomes Project data, WTCCC data, and HapMap data. The comparisons of estimated p-values for the 1000 Genomes Project data, WTCCC data, and HapMap data by the standard and the adaptive procedures with a significance interval of 0.05,2.5×10−06 and 2.5×10−11. Solid and dotted lines are the base line and the Bonferroni corrected significance level (p=0.05), respectively. Circles indicate the estimated p-values of SNP sets by the standard and the adaptive procedures, and the numbers of SNP sets is 20,568,13,397,31,002, respectively. Both the vertical and the horizontal axes in these figures are logarithmic scale