| Literature DB >> 28381878 |
Yu-Mei Li1,2, Chao Xu2, Yang Xiang1, Cheng Peng2,3, Hong-Wen Deng2.
Abstract
Advances in DNA sequencing technology have been promoting the development of sequencing studies to identify rare variants associated with complex traits. Adaptive strategy can be effective to reduce the noise provided by non-causal variants. However, the existing adaptive strategies depend on many assumptions. In this paper, we proposed a new adaptive strategy using entropy theory for association analysis. This entropy-based strategy is based on the magnitude of association between variants and disease and does not depend on the detailed association pattern with causal variants. We considered multi-marker test and Sum test with collapsing method to construct the entropy-based adaptive strategy. Using simulation studies, we investigated the performance of our method for rare variant analyses as well as for common variant analyses with multi-marker test and compared it with several existing adaptive strategies. The results showed that our method can improve the power and achieve good performance when there is a large number of non-causal variants and effects of causal variants are in the same direction for rare variant.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28381878 PMCID: PMC5584517 DOI: 10.1038/jhg.2017.39
Source DB: PubMed Journal: J Hum Genet ISSN: 1434-5161 Impact factor: 3.172
The estimated type I error rates when there are 20 rare variants
| 500 | 0.058 (0.002) | 0.051 (0.009) | 0.052 (0.006) | 0.050 (0.007) | 0.053 (0.007) | 0.052 (0.005) |
| 1000 | 0.051 (0.005) | 0.056 (0.004) | 0.054 (0.005) | 0.047 (0.005) | 0.047 (0.005) | 0.049 (0.005) |
| 1500 | 0.052 (0.005) | 0.051 (0.004) | 0.050 (0.005) | 0.049 (0.004) | 0.053 (0.005) | 0.050 (0.004) |
| 2000 | 0.053 (0.004) | 0.051 (0.005) | 0.052 (0.005) | 0.052 (0.005) | 0.051 (0.003) | 0.051 (0.004) |
Note: shown in parentheses is the standard error.
The estimated type I error rates and power for common variant analysis with a number of common variants where the sample size is 1000
| 0.05 (0.004) | 0.05 (0.004) | 0.052 (0.005) | 0.051 (0.005) | 0.053 (0.004) | 0.908 (0.01) | 0.807 (0.004) | 0.766 (0.006) | 0.725 (0.007) | 0.614 (0.008) | |
| 0.049 (0.004) | 0.053 (0.005) | 0.052 (0.005) | 0.053 (0.005) | 0.055 (0.004) | 0.931 (0.011) | 0.841 (0.009) | 0.806 (0.004) | 0.771 (0.008) | 0.635 (0.009) | |
Note: shown in parentheses is the standard error.
Empirical power for RV analysis
| w= | w=w | w= | w=w | w= | w=w | w= | w=w | w= | w=w | |
|---|---|---|---|---|---|---|---|---|---|---|
| Sum | 0.970 (0.005) | 0.972 (0.006) | 0.761 (0.007) | 0.762 (0.008) | 0.549 (0.005) | 0.560 (0.007) | 0.349 (0.010) | 0.340 (0.009) | 0.210 (0.009) | 0.207 (0.010) |
| aSum-Ord | 0.958 (0.009) | 0.960 (0.007) | 0.902 (0.006) | 0.900 (0.006) | 0.811 (0.006) | 0.814 (0.005) | 0.705 (0.009) | 0.710 (0.006) | 0.571 (0.008) | 0.575 (0.007) |
| Price-VT | 0.952 (0.012) | 0.958 (0.010) | 0.864 (0.011) | 0.866 (0.010) | 0.701 (0.006) | 0.700 (0.005) | 0.689 (0.008) | 0.691 (0.007) | 0.563 (0.009) | 0.561 (0.007) |
| aSum-E | 0.951 (0.011) | 0.955 (0.010) | 0.898 (0.011) | 0.899 (0.011) | 0.806 (0.004) | 0.804 (0.004) | 0.717 (0.007) | 0.717 (0.006) | 0.611 (0.008) | 0.616 (0.009) |
| | 0.910 (0.006) | 0.910 (0.006) | 0.811 (0.009) | 0.811 (0.009) | 0.740 (0.010) | 0.740 (0.010) | 0.678 (0.012) | 0.678 (0.012) | 0.506 (0.013) | 0.506 (0.013) |
| | 0.929 (0.011) | 0.929 (0.011) | 0.840 (0.010) | 0.840 (0.010) | 0.758 (0.011) | 0.758 (0.011) | 0.687 (0.011) | 0.687 (0.011) | 0.571 (0.012) | 0.571 (0.012) |
| Sum | 0.935 (0.008) | 0.936 (0.007) | 0.750 (0.008) | 0.768 (0.009) | 0.523 (0.009) | 0.529 (0.008) | 0.345 (0.007) | 0.343 (0.009) | 0.213 (0.013) | 0.212 (0.012) |
| aSum-Ord | 0.942 (0.009) | 0.947 (0.009) | 0.901 (0.010) | 0.919 (0.011) | 0.704 (0.010) | 0.702 (0.009) | 0.701 (0.010) | 0.707 (0.011) | 0.625 (0.012) | 0.630 (0.011) |
| Price-VT | 0.918 (0.004) | 0.911 (0.005) | 0.850 (0.006) | 0.856 (0.006) | 0.669 (0.006) | 0.670 (0.007) | 0.678 (0.011) | 0.686 (0.011) | 0.579 (0.010) | 0.569 (0.011) |
| aSum-E | 0.928 (0.008) | 0.931 (0.007) | 0.893 (0.008) | 0.895 (0.007) | 0.720 (0.009) | 0.722 (0.008) | 0.712 (0.010) | 0.716 (0.011) | 0.623 (0.009) | 0.628 (0.010) |
| | 0.801 (0.009) | 0.801 (0.009) | 0.773 (0.010) | 0.773 (0.010) | 0.686 (0.011) | 0.686 (0.011) | 0.651 (0.011) | 0.651 (0.011) | 0.573 (0.012) | 0.573 (0.012) |
| | 0.818 (0.009) | 0.818 (0.009) | 0.800 (0.010) | 0.800 (0.010) | 0.714 (0.009) | 0.714 (0.009) | 0.702 (0.010) | 0.702 (0.010) | 0.593 (0.011) | 0.593 (0.011) |
| Sum | 0.300 (0.006) | 0.313 (0.005) | 0.267 (0.008) | 0.285 (0.009) | 0.216 (0.006) | 0.227 (0.006) | 0.187 (0.009) | 0.193 (0.009) | 0.168 (0.008) | 0.171 (0.007) |
| aSum-Ord | 0.519 (0.006) | 0.521 (0.006) | 0.449 (0.012) | 0.464 (0.011) | 0.420 (0.008) | 0.419 (0.007) | 0.402 (0.009) | 0.410 (0.010) | 0.300 (0.008) | 0.315 (0.009) |
| Price-VT | 0.473 (0.008) | 0.477 (0.007) | 0.473 (0.009) | 0.480 (0.009) | 0.410 (0.009) | 0.417 (0.010) | 0.416 (0.012) | 0.413 (0.011) | 0.291 (0.009) | 0.287 (0.008) |
| aSum-E | 0.405 (0.003) | 0.419 (0.006) | 0.373 (0.008) | 0.371 (0.008) | 0.333 (0.008) | 0.332 (0.009) | 0.302 (0.009) | 0.304 (0.006) | 0.218 (0.007) | 0.230 (0.009) |
| | 0.406 (0.003) | 0.406 (0.003) | 0.329 (0.008) | 0.329 (0.008) | 0.316 (0.006) | 0.316 (0.006) | 0.308 (0.008) | 0.308 (0.008) | 0.256 (0.010) | 0.256 (0.010) |
| | 0.426 (0.008) | 0.426 (0.008) | 0.353 (0.009) | 0.353 (0.009) | 0.335 (0.007) | 0.335 (0.007) | 0.330 (0.009) | 0.330 (0.009) | 0.311 (0.010) | 0.311 (0.010) |
Note: scenario A, causal variants have the same effect. OR=3; scenario B, causal variants have different effects with the same direction. OR∈[1.2, 3] for causal variants; scenario C, causal variants have different effects. OR∈[1.2, 3] for half of causal variants and OR∈[0.2, 0.8] for the rest causal variants. w=1 means no weighting and w=wMB means weighting. MAF of causal variants∈[0.001, 0.01]. The sample size is 1000. Shown in parentheses is the standard error.