| Literature DB >> 24358113 |
Yakov A Tsepilov1, Janina S Ried2, Konstantin Strauch3, Harald Grallert4, Cornelia M van Duijn5, Tatiana I Axenovich1, Yurii S Aulchenko6.
Abstract
Genome-wide association studies (GWAS) comprise a powerful tool for mapping genes of complex traits. However, an inflation of the test statistic can occur because of population substructure or cryptic relatedness, which could cause spurious associations. If information on a large number of genetic markers is available, adjusting the analysis results by using the method of genomic control (GC) is possible. GC was originally proposed to correct the Cochran-Armitage additive trend test. For non-additive models, correction has been shown to depend on allele frequencies. Therefore, usage of GC is limited to situations where allele frequencies of null markers and candidate markers are matched. In this work, we extended the capabilities of the GC method for non-additive models, which allows us to use null markers with arbitrary allele frequencies for GC. Analytical expressions for the inflation of a test statistic describing its dependency on allele frequency and several population parameters were obtained for recessive, dominant, and over-dominant models of inheritance. We proposed a method to estimate these required population parameters. Furthermore, we suggested a GC method based on approximation of the correction coefficient by a polynomial of allele frequency and described procedures to correct the genotypic (two degrees of freedom) test for cases when the model of inheritance is unknown. Statistical properties of the described methods were investigated using simulated and real data. We demonstrated that all considered methods were effective in controlling type 1 error in the presence of genetic substructure. The proposed GC methods can be applied to statistical tests for GWAS with various models of inheritance. All methods developed and tested in this work were implemented using R language as a part of the GenABEL package.Entities:
Mesh:
Year: 2013 PMID: 24358113 PMCID: PMC3864791 DOI: 10.1371/journal.pone.0081431
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Dependence of VIF function on allele frequency p and model parameter x (F = 0.05; N = 1,000; K = 11,000).
(A) p: {0,1}, x: {−1,2}; (B) p: {0,1}, x = 0 (R, recessive), x = 1 (D, dominant), x = 1/2 (A, additive), x = 100 (O, over-dominant).
Type 1 error for one degree of freedom tests.
| Not corrected | Constant corrected | VIFGC corrected | PGC corrected | ||||||||||
| Model | Frequency |
|
|
|
|
|
|
|
|
|
|
|
|
|
| all | 1.301 | 1.305 | 0.086 | 1.000 | 1.003 | 0.051 | 1.000 | 1.000 | 0.050 | 0.999 | 0.999 | 0.050 |
| [0.05,0.25) | 1.175 | 1.170 | 0.069 | 0.905 | 0.900 | 0.038 | 0.990 | 0.983 | 0.048 | 1.004 | 0.998 | 0.049 | |
| [0.25,0.4) | 1.245 | 1.245 | 0.079 | 0.957 | 0.957 | 0.045 | 0.995 | 0.995 | 0.049 | 1.000 | 1.000 | 0.050 | |
| [0.4,0.6) | 1.320 | 1.322 | 0.088 | 1.014 | 1.015 | 0.052 | 1.002 | 1.004 | 0.051 | 0.998 | 0.999 | 0.050 | |
| [0.6,0.75) | 1.377 | 1.381 | 0.095 | 1.057 | 1.060 | 0.057 | 1.006 | 1.009 | 0.051 | 0.996 | 0.999 | 0.050 | |
| [0.75,0.95] | 1.412 | 1.416 | 0.100 | 1.084 | 1.087 | 0.060 | 1.007 | 1.010 | 0.051 | 0.997 | 1.000 | 0.050 | |
|
| all | 1.453 | 1.458 | 0.104 | 1.000 | 1.003 | 0.051 | 0.997 | 1.000 | 0.050 | 0.991 | 1.034 | 0.050 |
| [0.05,0.25) | 1.451 | 1.455 | 0.104 | 0.998 | 1.001 | 0.050 | 0.995 | 0.998 | 0.050 | 0.991 | 1.033 | 0.050 | |
| [0.25,0.4) | 1.455 | 1.460 | 0.105 | 1.001 | 1.005 | 0.051 | 0.998 | 1.002 | 0.050 | 0.991 | 1.035 | 0.050 | |
| [0.4,0.6) | 1.456 | 1.461 | 0.105 | 1.002 | 1.006 | 0.051 | 0.999 | 1.002 | 0.051 | 0.990 | 1.035 | 0.050 | |
| [0.6,0.75) | 1.454 | 1.458 | 0.104 | 1.000 | 1.003 | 0.051 | 0.997 | 1.000 | 0.050 | 0.990 | 1.034 | 0.050 | |
| [0.75,0.95] | 1.452 | 1.456 | 0.104 | 0.999 | 1.002 | 0.051 | 0.996 | 0.998 | 0.050 | 0.992 | 1.036 | 0.050 | |
|
| all | 1.302 | 1.306 | 0.086 | 1.000 | 1.003 | 0.051 | 0.999 | 1.000 | 0.050 | 0.999 | 1.000 | 0.050 |
| [0.05,0.25) | 1.413 | 1.416 | 0.099 | 1.084 | 1.086 | 0.060 | 1.007 | 1.009 | 0.051 | 0.997 | 0.999 | 0.050 | |
| [0.25,0.4) | 1.379 | 1.383 | 0.095 | 1.058 | 1.061 | 0.057 | 1.007 | 1.010 | 0.051 | 0.997 | 1.000 | 0.050 | |
| [0.4,0.6) | 1.320 | 1.323 | 0.088 | 1.013 | 1.016 | 0.052 | 1.002 | 1.004 | 0.051 | 0.998 | 1.001 | 0.050 | |
| [0.6,0.75) | 1.244 | 1.245 | 0.079 | 0.956 | 0.956 | 0.045 | 0.993 | 0.993 | 0.049 | 1.000 | 1.000 | 0.050 | |
| [0.75,0.95] | 1.174 | 1.171 | 0.070 | 0.903 | 0.900 | 0.039 | 0.988 | 0.984 | 0.048 | 1.003 | 0.999 | 0.050 | |
|
| all | 1.176 | 1.181 | 0.072 | 1.000 | 1.004 | 0.051 | 0.999 | 1.000 | 0.050 | 0.999 | 1.000 | 0.050 |
| [0.05,0.25) | 1.281 | 1.282 | 0.083 | 1.088 | 1.089 | 0.061 | 1.007 | 1.008 | 0.051 | 0.996 | 0.997 | 0.050 | |
| [0.25,0.4) | 1.143 | 1.146 | 0.067 | 0.972 | 0.974 | 0.047 | 0.998 | 1.000 | 0.050 | 1.006 | 1.008 | 0.051 | |
| [0.4,0.6) | 1.060 | 1.058 | 0.057 | 0.902 | 0.901 | 0.039 | 0.987 | 0.985 | 0.048 | 0.991 | 0.990 | 0.049 | |
| [0.6,0.75) | 1.142 | 1.143 | 0.067 | 0.971 | 0.972 | 0.047 | 0.998 | 0.999 | 0.050 | 1.006 | 1.007 | 0.051 | |
| [0.75,0.95] | 1.279 | 1.282 | 0.083 | 1.086 | 1.089 | 0.060 | 1.007 | 1.008 | 0.051 | 0.996 | 0.997 | 0.050 | |
Type 1 error was estimated in three ways: λ, which is the ratio of observed distribution's median and expected median; λ, which is the regression coefficient between observed statistic's distribution and theoretically expected Chi-square statistic; and E, which is the proportion of the tests with p-value≤0.05. The values are given for all SNPs as well as for stratified frequency groups.
Type 1 error for two degrees of freedom tests.
| Not corrected | Constant corrected | df1 based (VIFGC corrected) | PGC corrected | |||||||||
| Frequency |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1.239 | 1.250 | 0.092 | 1.000 | 1.009 | 0.053 | 0.951 | 0.957 | 0.045 | 0.991 | 1.000 | 0.051 |
|
| 1.239 | 1.248 | 0.091 | 1.000 | 1.007 | 0.052 | 0.959 | 0.962 | 0.045 | 0.992 | 1.000 | 0.051 |
|
| 1.241 | 1.252 | 0.092 | 1.001 | 1.010 | 0.053 | 0.948 | 0.955 | 0.045 | 0.991 | 1.000 | 0.051 |
|
| 1.240 | 1.252 | 0.092 | 1.001 | 1.010 | 0.053 | 0.942 | 0.951 | 0.044 | 0.990 | 1.000 | 0.051 |
|
| 1.239 | 1.251 | 0.092 | 1.000 | 1.009 | 0.053 | 0.946 | 0.955 | 0.045 | 0.990 | 1.000 | 0.052 |
|
| 1.239 | 1.249 | 0.092 | 1.000 | 1.008 | 0.053 | 0.959 | 0.963 | 0.045 | 0.992 | 1.000 | 0.051 |
The abbreviations are as in Table 1. The values are given for all SNPs as well as for stratified frequency groups.
* 2df test based on 1df corrected tests (here, 1df tests were corrected by VIFGC) [3].
Power (% of test with p-value≤0.05) for different tests.
| Simulated model | Recessive | Additive | Dominant | Over-dominant | ||||||||||||||||
| Analised model | r | a | d | o | g | r | a | d | o | g | r | a | d | o | g | r | a | d | o | g |
|
| 0.87 | 0.71 | 0.26 | 0.44 | 0.78 | 0.60 | 0.78 | 0.64 | 0.42 | 0.67 | 0.20 | 0.74 | 0.84 | 0.39 | 0.78 | 0.37 | 0.32 | 0.40 | 0.83 | 0.76 |
|
| 0.79 | 0.59 | 0.15 | 0.43 | 0.64 | 0.48 | 0.67 | 0.58 | 0.38 | 0.62 | 0.15 | 0.63 | 0.80 | 0.35 | 0.72 | 0.31 | 0.26 | 0.33 | 0.78 | 0.60 |
|
| 0.80 | 0.59 | 0.16 | 0.41 | 0.62 | 0.50 | 0.66 | 0.55 | 0.38 | 0.58 | 0.15 | 0.63 | 0.80 | 0.35 | 0.68 | 0.30 | 0.26 | 0.32 | 0.77 | 0.57 |
|
| 0.81 | 0.58 | 0.16 | 0.41 | 0.63 | 0.50 | 0.67 | 0.56 | 0.38 | 0.62 | 0.15 | 0.64 | 0.80 | 0.36 | 0.72 | 0.30 | 0.26 | 0.32 | 0.77 | 0.59 |
* genotypic model for VIFGC corrected tests is a two degrees of freedom test based on recessive and dominant tests corrected by VIFGC [3].
r, a, d, o, and g are recessive, additive, dominant, over-dominant, and genotypic models, respectively.
Type 1 error in ERF data analysis.
| Not corrected | Constant corrected | VIFGC corrected | PGC corrected | ||||||||||
| Model | Frequency |
|
|
|
|
|
|
|
|
|
|
|
|
|
| all | 1.201 | 1.200 | 0.074 | 1.000 | 1.000 | 0.050 | 1.001 | 1.000 | 0.050 | 1.000 | 1.000 | 0.050 |
| [0.05,0.25) | 1.105 | 1.109 | 0.063 | 0.921 | 0.924 | 0.042 | 0.993 | 0.997 | 0.050 | 0.998 | 1.002 | 0.051 | |
| [0.25,0.5) | 1.188 | 1.180 | 0.072 | 0.989 | 0.983 | 0.048 | 1.005 | 0.999 | 0.050 | 0.998 | 0.992 | 0.050 | |
| [0.5,0.75) | 1.240 | 1.252 | 0.079 | 1.033 | 1.043 | 0.055 | 1.004 | 1.013 | 0.051 | 0.996 | 1.006 | 0.051 | |
| [0.75,0.95] | 1.273 | 1.261 | 0.081 | 1.061 | 1.051 | 0.056 | 1.001 | 0.991 | 0.049 | 1.008 | 0.999 | 0.050 | |
|
| all | 1.298 | 1.302 | 0.086 | 1.000 | 1.003 | 0.051 | 0.997 | 1.000 | 0.050 | 0.996 | 1.000 | 0.050 |
| [0.05,0.25) | 1.299 | 1.290 | 0.085 | 1.001 | 0.994 | 0.050 | 0.998 | 0.991 | 0.049 | 1.008 | 1.000 | 0.050 | |
| [0.25,0.5) | 1.298 | 1.313 | 0.088 | 1.001 | 1.012 | 0.052 | 0.997 | 1.008 | 0.052 | 0.986 | 0.997 | 0.050 | |
| [0.5,0.75) | 1.301 | 1.320 | 0.088 | 1.003 | 1.017 | 0.053 | 1.000 | 1.014 | 0.052 | 0.989 | 1.003 | 0.051 | |
| [0.75,0.95] | 1.292 | 1.286 | 0.084 | 0.995 | 0.991 | 0.049 | 0.992 | 0.988 | 0.049 | 1.003 | 0.999 | 0.050 | |
|
| all | 1.202 | 1.203 | 0.074 | 1.000 | 1.001 | 0.050 | 1.000 | 1.000 | 0.050 | 1.000 | 1.000 | 0.050 |
| [0.05,0.25) | 1.277 | 1.265 | 0.081 | 1.063 | 1.053 | 0.056 | 1.001 | 0.991 | 0.049 | 1.008 | 0.999 | 0.050 | |
| [0.25,0.5) | 1.243 | 1.252 | 0.079 | 1.035 | 1.042 | 0.054 | 1.003 | 1.011 | 0.051 | 0.997 | 1.005 | 0.051 | |
| [0.5,0.75) | 1.190 | 1.183 | 0.072 | 0.990 | 0.985 | 0.049 | 1.005 | 1.000 | 0.050 | 0.999 | 0.994 | 0.050 | |
| [0.75,0.95] | 1.104 | 1.112 | 0.064 | 0.919 | 0.925 | 0.042 | 0.991 | 0.998 | 0.050 | 0.995 | 1.002 | 0.051 | |
|
| all | 1.133 | 1.123 | 0.064 | 1.000 | 0.991 | 0.049 | 1.011 | 1.000 | 0.050 | 1.011 | 1.000 | 0.050 |
| [0.05,0.25) | 1.203 | 1.193 | 0.072 | 1.061 | 1.053 | 0.056 | 1.017 | 1.010 | 0.051 | 1.011 | 1.003 | 0.050 | |
| [0.25,0.5) | 1.076 | 1.060 | 0.057 | 0.950 | 0.935 | 0.043 | 1.011 | 0.995 | 0.049 | 1.013 | 0.998 | 0.049 | |
| [0.5,0.75) | 1.064 | 1.053 | 0.056 | 0.939 | 0.929 | 0.042 | 1.000 | 0.989 | 0.049 | 1.004 | 0.993 | 0.049 | |
| [0.75,0.95] | 1.204 | 1.190 | 0.072 | 1.062 | 1.050 | 0.056 | 1.017 | 1.007 | 0.051 | 1.014 | 1.004 | 0.051 | |
|
| all | 1.157 | 1.162 | 0.077 | 1.000 | 1.004 | 0.052 | 0.964 | 0.966 | 0.046 | 0.996 | 1.000 | 0.051 |
| [0.05,0.25) | 1.163 | 1.159 | 0.077 | 1.005 | 1.002 | 0.052 | 0.973 | 0.969 | 0.047 | 1.004 | 1.001 | 0.051 | |
| [0.25,0.5) | 1.152 | 1.165 | 0.077 | 0.996 | 1.006 | 0.052 | 0.954 | 0.964 | 0.045 | 0.988 | 0.999 | 0.051 | |
| [0.5,0.75) | 1.151 | 1.164 | 0.077 | 0.995 | 1.006 | 0.052 | 0.953 | 0.963 | 0.046 | 0.989 | 1.000 | 0.051 | |
| [0.75,0.95] | 1.163 | 1.159 | 0.077 | 1.005 | 1.001 | 0.052 | 0.973 | 0.970 | 0.047 | 1.004 | 1.000 | 0.052 | |
* for VIFGC corrected genotypic (2df) tests, we used the 1df based test by performing VIFGC-corrected tests for recessive and dominant models [3].
The abbreviations are as in Table 1. The values are given for all SNPs as well as for stratified frequency groups.
Type 1 error of KORA data tests.
| Not corrected | Constant corrected | VIFGC corrected | PGC corrected | ||||||||||
| Model | Frequency |
|
|
|
|
|
|
|
|
|
|
|
|
|
| all | 1.016 | 1.020 | 0.053 | 1.000 | 1.004 | 0.051 | 0.996 | 1.000 | 0.050 | 0.996 | 1.000 | 0.050 |
| [0.05,0.25) | 1.015 | 1.016 | 0.052 | 0.998 | 1.000 | 0.050 | 1.004 | 1.005 | 0.051 | 1.000 | 1.001 | 0.050 | |
| [0.25,0.5) | 1.014 | 1.023 | 0.053 | 0.998 | 1.006 | 0.051 | 0.996 | 1.005 | 0.051 | 0.992 | 1.001 | 0.050 | |
| [0.5,0.75) | 1.017 | 1.021 | 0.053 | 1.001 | 1.005 | 0.051 | 0.993 | 0.997 | 0.050 | 0.993 | 0.997 | 0.050 | |
| [0.75,0.95] | 1.020 | 1.020 | 0.053 | 1.004 | 1.004 | 0.051 | 0.993 | 0.993 | 0.050 | 1.000 | 1.001 | 0.051 | |
|
| all | 1.019 | 1.024 | 0.053 | 1.000 | 1.005 | 0.051 | 0.995 | 1.000 | 0.050 | 0.995 | 1.000 | 0.050 |
| [0.05,0.25) | 1.024 | 1.030 | 0.053 | 1.005 | 1.011 | 0.051 | 1.000 | 1.006 | 0.051 | 0.997 | 1.003 | 0.050 | |
| [0.25,0.5) | 1.020 | 1.024 | 0.053 | 1.001 | 1.004 | 0.051 | 0.996 | 0.999 | 0.050 | 0.994 | 0.998 | 0.050 | |
| [0.5,0.75) | 1.017 | 1.019 | 0.052 | 0.998 | 1.000 | 0.050 | 0.993 | 0.995 | 0.050 | 0.994 | 0.996 | 0.050 | |
| [0.75,0.95] | 1.016 | 1.024 | 0.053 | 0.997 | 1.005 | 0.051 | 0.992 | 1.000 | 0.050 | 0.995 | 1.003 | 0.051 | |
|
| all | 1.021 | 1.025 | 0.053 | 1.000 | 1.004 | 0.051 | 0.996 | 1.000 | 0.050 | 0.996 | 1.000 | 0.050 |
| [0.05,0.25) | 1.024 | 1.026 | 0.053 | 1.002 | 1.005 | 0.051 | 0.989 | 0.992 | 0.049 | 0.999 | 1.002 | 0.050 | |
| [0.25,0.5) | 1.025 | 1.028 | 0.054 | 1.004 | 1.007 | 0.051 | 0.995 | 0.999 | 0.050 | 0.994 | 0.998 | 0.050 | |
| [0.5,0.75) | 1.015 | 1.026 | 0.053 | 0.994 | 1.005 | 0.051 | 0.992 | 1.003 | 0.050 | 0.987 | 0.997 | 0.050 | |
| [0.75,0.95] | 1.022 | 1.021 | 0.053 | 1.000 | 1.000 | 0.050 | 1.007 | 1.007 | 0.051 | 1.003 | 1.002 | 0.050 | |
|
| all | 1.027 | 1.027 | 0.053 | 1.000 | 1.000 | 0.050 | 1.000 | 1.000 | 0.050 | 1.000 | 1.000 | 0.050 |
| [0.05,0.25) | 1.027 | 1.027 | 0.054 | 1.000 | 1.001 | 0.051 | 0.987 | 0.988 | 0.049 | 0.999 | 1.000 | 0.050 | |
| [0.25,0.5) | 1.029 | 1.027 | 0.053 | 1.003 | 1.000 | 0.050 | 1.016 | 1.013 | 0.051 | 1.003 | 1.001 | 0.050 | |
| [0.5,0.75) | 1.028 | 1.025 | 0.053 | 1.002 | 0.998 | 0.050 | 1.014 | 1.011 | 0.051 | 1.002 | 0.999 | 0.050 | |
| [0.75,0.95] | 1.021 | 1.028 | 0.054 | 0.995 | 1.001 | 0.051 | 0.982 | 0.988 | 0.049 | 0.994 | 1.000 | 0.051 | |
|
| all | 1.021 | 1.024 | 0.054 | 1.000 | 1.003 | 0.051 | 0.999 | 1.002 | 0.051 | 0.997 | 1.000 | 0.051 |
| [0.05,0.25) | 1.021 | 1.022 | 0.054 | 1.000 | 1.001 | 0.051 | 0.997 | 0.998 | 0.050 | 1.000 | 1.001 | 0.051 | |
| [0.25,0.5) | 1.022 | 1.026 | 0.055 | 1.001 | 1.005 | 0.051 | 0.997 | 1.001 | 0.051 | 0.996 | 1.000 | 0.051 | |
| [0.5,0.75) | 1.020 | 1.024 | 0.054 | 0.999 | 1.003 | 0.051 | 0.996 | 0.999 | 0.050 | 0.994 | 0.998 | 0.050 | |
| [0.75,0.95] | 1.021 | 1.025 | 0.055 | 1.000 | 1.004 | 0.052 | 1.006 | 1.009 | 0.052 | 0.997 | 1.001 | 0.051 | |
* for VIFGC corrected genotypic (2df) tests, we used the 1df based test by performing VIFGC-corrected tests for recessive and dominant models [3].
The abbreviations are as in Table 1. The values are given for all SNPs as well as for stratified frequency groups.