| Literature DB >> 31831092 |
Lifeng Liu1, Pengfei Wang2, Jingbo Meng2, Lili Chen1, Wensheng Zhu2, Weijun Ma1.
Abstract
In recent years, there has been an increasing interest in detecting disease-related rare variants in sequencing studies. Numerous studies have shown that common variants can only explain a small proportion of the phenotypic variance for complex diseases. More and more evidence suggests that some of this missing heritability can be explained by rare variants. Considering the importance of rare variants, researchers have proposed a considerable number of methods for identifying the rare variants associated with complex diseases. Extensive research has been carried out on testing the association between rare variants and dichotomous, continuous or ordinal traits. So far, however, there has been little discussion about the case in which both genotypes and phenotypes are ordinal variables. This paper introduces a method based on the γ-statistic, called OV-RV, for examining disease-related rare variants when both genotypes and phenotypes are ordinal. At present, little is known about the asymptotic distribution of the γ-statistic when conducting association analyses for rare variants. One advantage of OV-RV is that it provides a robust estimation of the distribution of the γ-statistic by employing the permutation approach proposed by Fisher. We also perform extensive simulations to investigate the numerical performance of OV-RV under various model settings. The simulation results reveal that OV-RV is valid and efficient; namely, it controls the type I error approximately at the pre-specified significance level and achieves greater power at the same significance level. We also apply OV-RV for rare variant association studies of diastolic blood pressure.Entities:
Keywords: contingency tables; ordinal variables; rare variants; γ-statistic
Year: 2019 PMID: 31831092 PMCID: PMC7044977 DOI: 10.1017/S0016672319000120
Source DB: PubMed Journal: Genet Res (Camb) ISSN: 0016-6723 Impact factor: 1.588
Cross-contingency table of genotype at the m loci by phenotype.
| Genotype level at the | ||||
|---|---|---|---|---|
| Phenotype level | 0 | 1 | … | |
| 0 | … | |||
| 1 | … | |||
| 2 | … | |||
| ⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
| … | ||||
Estimated type I errors of the eight methods in Simulation I.
| Test | 20 rare variants | 40 rare variants | |
|---|---|---|---|
| OV-RV | 0.007 | 0.008 | |
| SKAT-O-C | 0.011 | 0.005 | |
| SKAT-O | 0.011 | 0.007 | |
| SKAT-C | 0.009 | 0.007 | |
| SKAT | 0.009 | 0.009 | |
| CAST | 0.007 | 0.007 | |
| SUM | 0.006 | 0.011 | |
| WSS | 0.009 | 0.007 | |
| OV-RV | 0.049 | 0.049 | |
| SKAT-O-C | 0.042 | 0.046 | |
| SKAT-O | 0.038 | 0.049 | |
| SKAT-C | 0.040 | 0.036 | |
| SKAT | 0.043 | 0.049 | |
| CAST | 0.035 | 0.051 | |
| SUM | 0.037 | 0.047 | |
| WSS | 0.033 | 0.056 |
Estimated power results of the eight methods based on the generated genotypes.
| Number of rare variants | Test | |||||
|---|---|---|---|---|---|---|
| 20 (12 causal) | OV-RV | 0.761 | 0.504 | 0.221 | 0.070 | |
| SKAT-O-C | 0.672 | 0.400 | 0.123 | 0.026 | ||
| SKAT-O | 0.276 | 0.151 | 0.054 | 0.006 | ||
| SKAT-C | 0.201 | 0.058 | 0.013 | 0.003 | ||
| SKAT | 0.012 | 0.006 | 0.001 | 0.001 | ||
| CAST | 0.392 | 0.252 | 0.100 | 0.023 | ||
| SUM | 0.403 | 0.251 | 0.096 | 0.023 | ||
| WSS | 0.306 | 0.191 | 0.079 | 0.015 | ||
| 40 (18 causal) | OV-RV | 0.788 | 0.559 | 0.269 | 0.067 | |
| SKAT-O-C | 0.756 | 0.475 | 0.179 | 0.032 | ||
| SKAT-O | 0.331 | 0.199 | 0.072 | 0.016 | ||
| SKAT-C | 0.229 | 0.077 | 0.022 | 0.005 | ||
| SKAT | 0.010 | 0.009 | 0.004 | 0.005 | ||
| CAST | 0.388 | 0.241 | 0.105 | 0.031 | ||
| SUM | 0.434 | 0.276 | 0.123 | 0.028 | ||
| WSS | 0.357 | 0.215 | 0.095 | 0.022 | ||
| 20 (12 causal) | OV-RV | 0.914 | 0.723 | 0.447 | 0.160 | |
| SKAT-O-C | 0.875 | 0.665 | 0.337 | 0.115 | ||
| SKAT-O | 0.564 | 0.393 | 0.186 | 0.071 | ||
| SKAT-C | 0.518 | 0.268 | 0.099 | 0.039 | ||
| SKAT | 0.063 | 0.039 | 0.024 | 0.019 | ||
| CAST | 0.634 | 0.459 | 0.231 | 0.105 | ||
| SUM | 0.684 | 0.498 | 0.259 | 0.125 | ||
| WSS | 0.627 | 0.460 | 0.241 | 0.108 | ||
| 40 (18 causal) | OV-RV | 0.930 | 0.776 | 0.474 | 0.177 | |
| SKAT-O-C | 0.920 | 0.730 | 0.416 | 0.126 | ||
| SKAT-O | 0.617 | 0.423 | 0.238 | 0.088 | ||
| SKAT-C | 0.549 | 0.276 | 0.084 | 0.040 | ||
| SKAT | 0.074 | 0.040 | 0.029 | 0.028 | ||
| CAST | 0.625 | 0.444 | 0.267 | 0.100 | ||
| SUM | 0.710 | 0.548 | 0.326 | 0.127 | ||
| WSS | 0.644 | 0.484 | 0.281 | 0.112 |
Estimated type I errors of the TG gene of the eight methods.
| Test | 20 rare variants | 40 rare variants | |
|---|---|---|---|
| OV-RV | 0.012 | 0.010 | |
| SKAT-O-C | 0.008 | 0.007 | |
| SKAT-O | 0.009 | 0.007 | |
| SKAT-C | 0.006 | 0.011 | |
| SKAT | 0.006 | 0.007 | |
| CAST | 0.010 | 0.005 | |
| SUM | 0.013 | 0.008 | |
| WSS | 0.012 | 0.008 | |
| OV-RV | 0.048 | 0.054 | |
| SKAT-O-C | 0.037 | 0.056 | |
| SKAT-O | 0.041 | 0.051 | |
| SKAT-C | 0.038 | 0.050 | |
| SKAT | 0.038 | 0.041 | |
| CAST | 0.045 | 0.050 | |
| SUM | 0.053 | 0.057 | |
| WSS | 0.050 | 0.054 |
Estimated type I errors of the COL6A3 gene of the eight methods.
| Test | 20 rare variants | 40 rare variants | |
|---|---|---|---|
| OV-RV | 0.012 | 0.011 | |
| SKAT-O-C | 0.011 | 0.010 | |
| SKAT-O | 0.007 | 0.015 | |
| SKAT-C | 0.012 | 0.008 | |
| SKAT | 0.012 | 0.012 | |
| CAST | 0.011 | 0.006 | |
| SUM | 0.013 | 0.016 | |
| WSS | 0.011 | 0.011 | |
| OV-RV | 0.050 | 0.051 | |
| SKAT-O-C | 0.043 | 0.049 | |
| SKAT-O | 0.053 | 0.044 | |
| SKAT-C | 0.044 | 0.051 | |
| SKAT | 0.052 | 0.047 | |
| CAST | 0.037 | 0.028 | |
| SUM | 0.049 | 0.038 | |
| WSS | 0.053 | 0.044 |
Estimated power results of the TG gene of the six methods.
| Number of rare variants | Test | |||||
|---|---|---|---|---|---|---|
| 20 (12 causal) | OV-RV | 0.859 | 0.590 | 0.266 | 0.044 | |
| SKAT-O-C | 0.687 | 0.400 | 0.125 | 0.016 | ||
| SKAT-O | 0.208 | 0.108 | 0.029 | 0.003 | ||
| CAST | 0.554 | 0.340 | 0.135 | 0.025 | ||
| SUM | 0.335 | 0.176 | 0.057 | 0.006 | ||
| WSS | 0.340 | 0.175 | 0.063 | 0.003 | ||
| 40 (18 causal) | OV-RV | 0.933 | 0.742 | 0.345 | 0.096 | |
| SKAT-O-C | 0.878 | 0.577 | 0.215 | 0.040 | ||
| SKAT-O | 0.405 | 0.230 | 0.068 | 0.019 | ||
| CAST | 0.605 | 0.368 | 0.144 | 0.047 | ||
| SUM | 0.653 | 0.406 | 0.171 | 0.048 | ||
| WSS | 0.480 | 0.279 | 0.110 | 0.041 | ||
| 20 (12 causal) | OV-RV | 0.962 | 0.808 | 0.498 | 0.148 | |
| SKAT-O-C | 0.903 | 0.677 | 0.336 | 0.083 | ||
| SKAT-O | 0.545 | 0.333 | 0.139 | 0.038 | ||
| CAST | 0.780 | 0.512 | 0.250 | 0.078 | ||
| SUM | 0.700 | 0.489 | 0.223 | 0.065 | ||
| WSS | 0.719 | 0.505 | 0.246 | 0.080 | ||
| 40 (18 causal) | OV-RV | 0.984 | 0.888 | 0.596 | 0.213 | |
| SKAT-O-C | 0.971 | 0.835 | 0.486 | 0.156 | ||
| SKAT-O | 0.767 | 0.522 | 0.264 | 0.090 | ||
| CAST | 0.832 | 0.622 | 0.340 | 0.125 | ||
| SUM | 0.852 | 0.650 | 0.348 | 0.127 | ||
| WSS | 0.811 | 0.617 | 0.338 | 0.125 |
Estimated power results of the COL6A3 gene of the six methods.
| Number of rare variants | Test | |||||
|---|---|---|---|---|---|---|
| 20 (12 causal) | OV-RV | 0.805 | 0.543 | 0.235 | 0.044 | |
| SKAT-O-C | 0.644 | 0.352 | 0.108 | 0.014 | ||
| SKAT-O | 0.177 | 0.093 | 0.028 | 0.008 | ||
| CAST | 0.429 | 0.270 | 0.107 | 0.035 | ||
| SUM | 0.425 | 0.263 | 0.099 | 0.033 | ||
| WSS | 0.249 | 0.135 | 0.053 | 0.015 | ||
| 40 (18 causal) | OV-RV | 0.906 | 0.670 | 0.320 | 0.061 | |
| SKAT-O-C | 0.819 | 0.519 | 0.180 | 0.024 | ||
| SKAT-O | 0.320 | 0.155 | 0.048 | 0.009 | ||
| CAST | 0.577 | 0.316 | 0.151 | 0.040 | ||
| SUM | 0.515 | 0.273 | 0.117 | 0.025 | ||
| WSS | 0.380 | 0.192 | 0.074 | 0.023 | ||
| 20 (12 causal) | OV-RV | 0.944 | 0.770 | 0.445 | 0.165 | |
| SKAT-O-C | 0.893 | 0.669 | 0.329 | 0.105 | ||
| SKAT-O | 0.525 | 0.348 | 0.186 | 0.066 | ||
| CAST | 0.627 | 0.419 | 0.220 | 0.083 | ||
| SUM | 0.625 | 0.403 | 0.209 | 0.070 | ||
| WSS | 0.628 | 0.426 | 0.229 | 0.087 | ||
| 40 (18 causal) | OV-RV | 0.976 | 0.862 | 0.578 | 0.176 | |
| SKAT-O-C | 0.954 | 0.807 | 0.459 | 0.112 | ||
| SKAT-O | 0.670 | 0.425 | 0.199 | 0.060 | ||
| CAST | 0.715 | 0.488 | 0.245 | 0.070 | ||
| SUM | 0.780 | 0.575 | 0.302 | 0.092 | ||
| WSS | 0.736 | 0.533 | 0.296 | 0.103 |
The Genetic Analysis Workshop 19 (GAW19) data shown as a list of genes associated with diastolic blood pressure.
| Chromosome | Gene | Method | ||||
|---|---|---|---|---|---|---|
| OV-RV | SKAT-O-C | SKAT-O | SKAT-C | SKAT | ||
| 5 | 0.003 | 0.069 | 0.289 | 0.088 | 0.558 | |
| 5 | 0.037 | 0.214 | 0.266 | 0.340 | 0.160 | |
| 11 | 0.044 | 0.048 | 0.401 | 0.048 | 0.359 | |