| Literature DB >> 25880329 |
Chung-Feng Kao1, Jia-Rou Liu2, Hung Hung3, Po-Hsiu Kuo3.
Abstract
The rapid advances in sequencing technologies and the resulting next-generation sequencing data provide the opportunity to detect disease-associated variants with a better solution, in particular for low-frequency variants. Although both common and rare variants might exert their independent effects on the risk for the trait of interest, previous methods to detect the association effects rarely consider them simultaneously. We proposed a class of test statistics, the generalized weighted-sum statistic (GWSS), to detect disease associations in the presence of common and rare variants with a case-control study design. Information of rare variants was aggregated using a weighted sum method, while signal directions and strength of the variants were considered at the same time. Permutations were performed to obtain the empirical p-values of the test statistics. Our simulation showed that, compared to the existing methods, the GWSS method had better performance in most of the scenarios. The GWSS (in particular VDWSS-t) method is particularly robust for opposite association directions, association strength, and varying distributions of minor-allele frequencies. It is therefore promising for detecting disease-associated loci. For empirical data application, we also applied our GWSS method to the Genetic Analysis Workshop 17 data, and the results were consistent with the simulation, suggesting good performance of our method. As re-sequencing studies become more popular to identify putative disease loci, we recommend the use of this newly developed GWSS to detect associations with both common and rare variants.Entities:
Mesh:
Year: 2015 PMID: 25880329 PMCID: PMC4399906 DOI: 10.1371/journal.pone.0120873
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Various test methods of the generalized weighted sum statistic (GWSS).
| Method | Weight | Summarized Scheme | Summary Statistic |
|---|---|---|---|
| (Step 1) | (Step 2) | (Step 3) | |
| WSS |
| rank-sum |
|
| ORWSS |
| rank-sum |
|
| WSS- |
|
|
|
| ORWSS- |
|
|
|
| DSS- |
|
|
|
| DWSS- |
|
|
|
| VWSS- |
|
|
|
| VORWSS- |
|
|
|
| VDSS- |
|
|
|
| VDWSS- |
|
|
|
Detection power for identical MAF distributions of signal and noise rare variants (only list methods with better performance).
|
|
| |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (a) | (b) | (c) | (d) | |||||||||||||||||
|
| 8 | 8 | 8 | 7 | 7 | 8 | 8 | 8 | 7 | 7 | 8 | 8 | 8 | 7 | 7 | 8 | 8 | 8 | 7 | 7 |
|
| 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 |
|
| 0 | 8 | 4 | 8 | 4 | 0 | 8 | 4 | 8 | 4 | 0 | 8 | 4 | 8 | 4 | 0 | 8 | 4 | 8 | 4 |
|
| 0 | 0 | 4 | 0 | 4 | 0 | 0 | 4 | 0 | 4 | 0 | 0 | 4 | 0 | 4 | 0 | 0 | 4 | 0 | 4 |
| SSU | 0.96 | 0.88 | 0.88 | 0.90 | 0.91 | 0.92 | 0.83 | 0.85 | 0.88 | 0.86 | 1.00 | 0.98 | 0.98 | 0.99 | 0.99 | 0.71 | 0.69 | 0.71 | 0.72 | 0.74 |
|
| 0.98 | 0.94 | 0.95 | 0.95 | 0.95 | 0.91 | 0.81 | 0.86 | 0.87 | 0.88 | 1.00 | 0.99 | 1.00 | 0.99 | 0.99 | 0.70 | 0.70 | 0.76 | 0.74 | 0.76 |
| SKATb | 0.96 | 0.92 | 0.91 | 0.93 | 0.95 | 0.95 | 0.89 | 0.89 | 0.94 | 0.94 | 1.00 | 0.99 | 0.99 | 0.99 | 0.99 | 0.80 | 0.78 | 0.78 | 0.82 | 0.79 |
| SKAT-C | 0.96 | 0.92 | 0.84 | 0.93 | 0.93 | 0.94 | 0.89 | 0.81 | 0.92 | 0.89 | 1.00 | 0.99 | 0.99 | 0.99 | 0.98 | 0.80 | 0.78 | 0.65 | 0.79 | 0.73 |
| SKAT-A | 0.96 | 0.92 | 0.89 | 0.91 | 0.93 | 0.94 | 0.89 | 0.86 | 0.93 | 0.91 | 1.00 | 0.99 | 0.98 | 0.98 | 0.99 | 0.80 | 0.78 | 0.70 | 0.81 | 0.75 |
| ORWSS- | 0.97 | 0.91 | 0.89 | 0.92 | 0.92 | 0.93 | 0.86 | 0.87 | 0.90 | 0.89 | 0.99 | 0.94 | 0.94 | 0.97 | 0.96 | 0.80 | 0.75 | 0.73 | 0.81 | 0.78 |
| DWSS- | 0.97 | 0.93 | 0.92 | 0.93 | 0.93 | 0.94 | 0.85 | 0.85 | 0.89 | 0.88 | 0.99 | 0.96 | 0.95 | 0.97 | 0.96 | 0.80 | 0.71 | 0.71 | 0.76 | 0.75 |
| VORWSS- | 0.97 | 0.91 | 0.89 | 0.92 | 0.92 | 0.93 | 0.86 | 0.87 | 0.90 | 0.89 | 0.99 | 0.94 | 0.95 | 0.97 | 0.97 | 0.80 | 0.74 | 0.74 | 0.81 | 0.81 |
| VDWSS- | 0.98 | 0.93 | 0.93 | 0.93 | 0.93 | 0.94 | 0.85 | 0.88 | 0.89 | 0.89 | 0.99 | 0.96 | 0.97 | 0.97 | 0.97 | 0.80 | 0.72 | 0.74 | 0.77 | 0.78 |
| VDSS- | 0.97 | 0.93 | 0.92 | 0.93 | 0.92 | 0.94 | 0.88 | 0.88 | 0.92 | 0.90 | 1.00 | 0.96 | 0.97 | 0.98 | 0.97 | 0.81 | 0.75 | 0.76 | 0.81 | 0.76 |
Fig 1Detection power (ρ = 0) for different MAF distributions of signal and noise rare variants.
The left panel considers ORj = 2 for rare variants (RVs) and 1.5 for common variant (CV), and the right panel considers ORj = 1/2 for RVs and 1/1.5 for CV. Each panel considers four scenarios: scenario1 considers 8 signal RVs and 8 noise RVs; scenario2 considers 8 signal RVs, 4 noise RVs and 4 noise CVs; scenario3 considers 7 signal RVs, 1 signal CV and 8 noise RVs; and scenario4 considers 7 signal RVs, 1 signal CV, 4 noise RVs and 4 noise CVs. The results were based on 1000 subjects (500 cases and 500 controls). All empirical p-values were calculated from 500 permutations. The detection power is defined as the proportion of test statistic attained significant level 0.05 over 1000 simulations. CMC method use 0.01 as a threshold for rare variants. SKAT1 and SKATb represents SKAT using equal weight and using beta(1,25) density function evaluated at q as the weight, respectively. SKAT-C and SKAT-A combined SKAT and sum test and adaptive sum test, respectively.
Robustness of all methods in situations of identical/different MAF distributions of signal and noise rare variants (only list methods with better performance).
| (a) One factor | (a) Two factors | |||||
|---|---|---|---|---|---|---|
| Direction | Noise RVs | Noise CVs | Noise RVs X Direction | Noise CVs X Direction | signal CVs X Direction | |
| SSU | X | Δ | X | X | X | X |
|
| Δ | X | X | Δ | X | X |
| SKATb | X (●) | X (Δ) | X (Δ) | X | X | X |
| SKAT-C | X | X | ● | X | X | X |
| SKAT-A | X | X | X (●) | X | X | X |
| ORWSS- | X | Δ | X | X | X | X |
| DWSS- | X | X | X | X | X | X |
| VORWSS- | X | Δ | X | X | X | X |
| VDWSS- | X | X | X | X | X | X |
| VDSS- | X | X | X | X | X | X |
Effects of disease liability using the Genetic Analysis Workshop 17 data (only list methods with better performance).
| Chromosome | 1 | 1 | 2 | 3 | 3 | 4 | 9 | 10 | 11 | 13 | 14 | 18 | 19 | 22 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene symbol |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| no. of RVs | 22 | 16 | 9 | 6 | 33 | 27 | 4 | 42 | 17 | 25 | 6 | 10 | 4 | 36 |
| no. of CVs | 8 | 4 | 4 | 2 | 6 | 7 | 1 | 8 | 5 | 10 | 2 | 2 | 6 | 9 |
|
| ||||||||||||||
| SSU | 0.002 | 0.005 | 0.015 |
| 0.003 | 0.007 | 0.015 | 0.005 |
|
| 0.009 | 0.009 | 0.009 | 0.003 |
|
| 0.005 | 0.018 | 0.004 | 0.039 |
| 0.007 | 0.002 | 0.009 | 0.003 | 0.003 | 0.020 | 0.011 | 0.003 | 0.019 |
| SKATb | 0.014 | 0.798 | 0.022 | 0.244 | 0.060 |
| 0.237 | 0.008 | 0.001 | 0.172 | 0.014 | 0.016 | 0.007 | 0.023 |
| SKAT-C | 0.008 | 0.002 | 0.069 | 0.121 | 0.058 |
|
|
|
| 0.006 | 0.016 | 0.004 | 0.005 | 0.009 |
| SKAT-A | 0.008 | 0.009 | 0.211 | 0.123 | 0.039 | 0.002 | 0.003 |
| 0.010 | 0.036 | 0.071 | 0.005 | 0.027 | 0.025 |
| GWSS | ||||||||||||||
| WSS- |
| 0.249 |
| 0.625 | 0.004 | 0.003 | 0.006 |
|
|
|
| 0.015 | 0.005 |
|
| ORWSS- | 0.006 | 0.002 | 0.020 |
| 0.052 | 0.059 |
| 0.009 | 0.241 | 0.008 | 0.065 | 0.004 | 0.014 | 0.179 |
| DWSS- | 0.006 | 0.002 | 0.014 | 0.002 | 0.035 | 0.046 | 0.040 | 0.008 | 0.079 | 0.004 | 0.034 | 0.012 | 0.002 | 0.079 |
| DSS- | 0.012 | 0.002 | 0.071 |
| 0.587 | 0.099 | 0.002 | 0.003 | 0.025 | 0.006 | 0.123 | 0.010 | 0.007 | 0.085 |
| VWSS- |
| 0.415 |
| 0.619 |
|
| 0.006 | 0.091 | 0.002 | 0.002 |
|
|
| 0.002 |
| VORWSS- | 0.008 |
| 0.018 | 0.002 | 0.069 | 0.099 | 0.003 | 0.017 | 0.278 | 0.018 | 0.068 | 0.005 | 0.019 | 0.150 |
| VDWSS- | 0.006 | 0.003 | 0.008 | 0.003 | 0.026 | 0.038 | 0.034 | 0.008 | 0.112 | 0.002 | 0.034 | 0.010 | 0.005 | 0.098 |
| VDSS- | 0.090 | 0.003 | 0.013 |
| 0.119 | 0.026 |
| 0.066 | 0.141 | 0.008 | 0.133 | 0.289 | 0.003 | 0.063 |