| Literature DB >> 24454922 |
Wan-Yu Lin1, Xiang-Yang Lou2, Guimin Gao3, Nianjun Liu2.
Abstract
With the development of next-generation sequencing technology, there is a great demand for powerful statistical methods to detect rare variants (minor allele frequencies (MAFs)<1%) associated with diseases. Testing for each variant site individually is known to be underpowered, and therefore many methods have been proposed to test for the association of a group of variants with phenotypes, by pooling signals of the variants in a chromosomal region. However, this pooling strategy inevitably leads to the inclusion of a large proportion of neutral variants, which may compromise the power of association tests. To address this issue, we extend the [Formula: see text]-MidP method (Cheung et al., 2012, Genet Epidemiol 36: 675-685) and propose an approach (named 'adaptive combination of P-values for rare variant association testing', abbreviated as 'ADA') that adaptively combines per-site P-values with the weights based on MAFs. Before combining P-values, we first imposed a truncation threshold upon the per-site P-values, to guard against the noise caused by the inclusion of neutral variants. This ADA method is shown to outperform popular burden tests and non-burden tests under many scenarios. ADA is recommended for next-generation sequencing data analysis where many neutral variants may be included in a functional region.Entities:
Mesh:
Year: 2014 PMID: 24454922 PMCID: PMC3893264 DOI: 10.1371/journal.pone.0085728
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The workflow diagram of the ADA method.
Type-I error rates.
| nominal significance level | 0.0001 | 0.005 | 0.010 | 0.015 | 0.020 | 0.025 | 0.030 | 0.035 | 0.040 | 0.045 | 0.050 |
|
| 0.0001 | 0.0054 | 0.0102 | 0.0151 | 0.0196 | 0.0246 | 0.0295 | 0.0347 | 0.0396 | 0.0444 | 0.0492 |
|
| 0.0001 | 0.0048 | 0.0096 | 0.0142 | 0.0191 | 0.0237 | 0.0288 | 0.0337 | 0.0384 | 0.0434 | 0.0482 |
|
| 0.0001 | 0.0050 | 0.0101 | 0.0149 | 0.0199 | 0.0248 | 0.0298 | 0.0348 | 0.0398 | 0.0448 | 0.0498 |
|
| 0.0001 | 0.0050 | 0.0100 | 0.0148 | 0.0199 | 0.0247 | 0.0297 | 0.0351 | 0.0400 | 0.0451 | 0.0500 |
|
| 0.0001 | 0.0046 | 0.0096 | 0.0146 | 0.0196 | 0.0245 | 0.0294 | 0.0346 | 0.0399 | 0.0449 | 0.0501 |
|
| 0.0001 | 0.0046 | 0.0098 | 0.0149 | 0.0198 | 0.0247 | 0.0296 | 0.0346 | 0.0398 | 0.0449 | 0.0498 |
|
| 0.0001 | 0.0052 | 0.0103 | 0.0153 | 0.0204 | 0.0254 | 0.0304 | 0.0356 | 0.0402 | 0.0452 | 0.0502 |
|
| 0.0001 | 0.0050 | 0.0100 | 0.0150 | 0.0201 | 0.0250 | 0.0302 | 0.0352 | 0.0404 | 0.0453 | 0.0503 |
Figure 2Comparison of power by r (the percentage of deleterious variants among the d causal variants), PAR, and d (the number of causal variants).
The figure shows the power comparison by r (left column, given PAR = 0.3% and d = 20), PAR (middle column, given d = 20 and r = 80%), and d (right column, given r = 80% and PAR = 0.3%). The nominal significance level was set at 0.05 (top row) and 0.01 (bottom row), respectively.
Analysis of the Dallas Heart Study data.
|
|
|
|
|
|
|
|
| |
|
| 0.024 | 0.012 | 0.028 | 0.011 | 0.584 | 0.070 | 0.184 | 0.486 |
a P-values were estimated based on 104 permutations.
Power (%) of the ADA method with two sets of candidate P-value truncation thresholds.
| candidate | Given PAR = 0.3% and | Given | Given | |||||||||||||
|
| PAR (%) |
| ||||||||||||||
| 5 | 20 | 50 | 80 | 100 | 0.0 | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 3 | 5 | 10 | 15 | 20 | |
| Nominal significance level = 5% | ||||||||||||||||
| 0.10, 0.11,… 0.20 | 29.97 | 23.17 | 33.28 | 67.41 | 88.24 | 4.84 | 18.45 | 45.06 | 67.41 | 82.03 | 90.47 | 14.00 | 24.16 | 40.58 | 55.80 | 67.41 |
| 0.05, 0.06,…, 0.25 | 29.38 | 23.50 | 35.04 | 68.73 | 89.31 | 5.04 | 18.56 | 46.09 | 68.73 | 83.60 | 91.91 | 14.64 | 25.50 | 42.24 | 57.30 | 68.73 |
| Nominal significance level = 1% | ||||||||||||||||
| 0.10, 0.11,…, 0.20 | 13.00 | 8.17 | 17.99 | 51.10 | 78.32 | 1.00 | 8.39 | 29.50 | 51.10 | 68.09 | 80.03 | 4.68 | 10.99 | 24.01 | 38.65 | 51.10 |
| 0.05, 0.06,…, 0.25 | 12.25 | 8.22 | 18.74 | 51.98 | 79.17 | 0.93 | 8.46 | 30.03 | 51.98 | 69.45 | 81.22 | 4.88 | 11.50 | 24.93 | 39.59 | 51.98 |