| Literature DB >> 24167573 |
Wei-Gao Mao1, Hai-Qiang He, Yan Xu, Ping-Yan Chen, Ji-Yuan Zhou.
Abstract
Recently, there have been many case-control studies proposed to test for association between haplotypes and disease, which require the Hardy-Weinberg equilibrium (HWE) assumption of haplotype frequencies. As such, haplotype inference of unphased genotypes and development of haplotype-based HWE tests are crucial prior to fine mapping. The goodness-of-fit test is a frequently-used method to test for HWE for multiple tightly-linked loci. However, its degrees of freedom dramatically increase with the increase of the number of loci, which may lack the test power. Therefore, in this paper, to improve the test power for haplotype-based HWE, we first write out two likelihood functions of the observed data based on the Niu's model (NM) and inbreeding model (IM), respectively, which can cause the departure from HWE. Then, we use two expectation-maximization algorithms and one expectation-conditional-maximization algorithm to estimate the model parameters under the HWE, IM and NM models, respectively. Finally, we propose the likelihood ratio tests LRT[Formula: see text] and LRT[Formula: see text] for haplotype-based HWE under the NM and IM models, respectively. We simulate the HWE, Niu's, inbreeding and population stratification models to assess the validity and compare the performance of these two LRT tests. The simulation results show that both of the tests control the type I error rates well in testing for haplotype-based HWE. If the NM model is true, then LRT[Formula: see text] is more powerful. While, if the true model is the IM model, then LRT[Formula: see text] has better performance in power. Under the population stratification model, LRT[Formula: see text] is still more powerful. To this end, LRT[Formula: see text] is generally recommended. Application of the proposed methods to a rheumatoid arthritis data set further illustrates their utility for real data analysis.Entities:
Mesh:
Year: 2013 PMID: 24167573 PMCID: PMC3805574 DOI: 10.1371/journal.pone.0077399
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Haplotype distribution for Niu's model and inbreeding model.
| SNP | Frequency |
| 122 | 0.082 |
| 221 | 0.525 |
| 121 | 0.283 |
| 211 | 0.004 |
| 111 | 0.106 |
Haplotype distribution for population stratification model.
| SNP | Frequency | |
| I | II | |
| 122 | 0.082 | 0.030 |
| 212 | 0.000 | 0.170 |
| 112 | 0.000 | 0.050 |
| 221 | 0.525 | 0.470 |
| 121 | 0.283 | 0.100 |
| 211 | 0.004 | 0.150 |
| 111 | 0.106 | 0.030 |
Mean and standard deviation (SD) of and estimates, mean of sum of absolute differences (SAD) of haplotype frequency estimates for EM, ECM and IEM algorithms, simulated size and powers of two HWE tests for different values of and , under Niu's model.
|
|
|
|
| SAD | Size/Power | |||||
| Mean | SD | Mean | SD | EM | ECM | IEM | LRT | LRT | ||
| 500 | 1.00 | 1.001 | 0.107 | 0.009 | 0.016 | 0.041 | 0.043 | 0.041 | 0.054 | 0.029 |
| 1.05 | 1.055 | 0.115 | 0.016 | 0.020 | 0.041 | 0.043 | 0.041 | 0.072 | 0.066 | |
| 1.10 | 1.099 | 0.118 | 0.022 | 0.022 | 0.043 | 0.043 | 0.043 | 0.135 | 0.107 | |
| 1.15 | 1.148 | 0.123 | 0.030 | 0.024 | 0.048 | 0.045 | 0.048 | 0.240 | 0.174 | |
| 1.20 | 1.198 | 0.124 | 0.039 | 0.027 | 0.050 | 0.043 | 0.050 | 0.368 | 0.274 | |
| 1.25 | 1.241 | 0.132 | 0.048 | 0.028 | 0.053 | 0.043 | 0.053 | 0.528 | 0.397 | |
| 1.30 | 1.299 | 0.138 | 0.060 | 0.030 | 0.058 | 0.044 | 0.058 | 0.672 | 0.543 | |
| 1.35 | 1.346 | 0.136 | 0.070 | 0.030 | 0.061 | 0.043 | 0.060 | 0.799 | 0.678 | |
| 1.40 | 1.396 | 0.144 | 0.079 | 0.031 | 0.068 | 0.043 | 0.067 | 0.880 | 0.776 | |
| 1.45 | 1.454 | 0.150 | 0.090 | 0.031 | 0.071 | 0.043 | 0.070 | 0.933 | 0.860 | |
| 1.50 | 1.497 | 0.158 | 0.099 | 0.033 | 0.075 | 0.042 | 0.073 | 0.966 | 0.899 | |
| 1000 | 1.00 | 1.001 | 0.076 | 0.007 | 0.010 | 0.029 | 0.031 | 0.029 | 0.050 | 0.019 |
| 1.05 | 1.047 | 0.080 | 0.013 | 0.014 | 0.030 | 0.031 | 0.030 | 0.097 | 0.073 | |
| 1.10 | 1.096 | 0.082 | 0.020 | 0.017 | 0.033 | 0.031 | 0.033 | 0.229 | 0.157 | |
| 1.15 | 1.148 | 0.085 | 0.030 | 0.019 | 0.036 | 0.030 | 0.036 | 0.452 | 0.329 | |
| 1.20 | 1.201 | 0.089 | 0.040 | 0.020 | 0.041 | 0.031 | 0.040 | 0.673 | 0.532 | |
| 1.25 | 1.247 | 0.091 | 0.049 | 0.021 | 0.045 | 0.030 | 0.044 | 0.839 | 0.695 | |
| 1.30 | 1.296 | 0.094 | 0.059 | 0.021 | 0.050 | 0.030 | 0.049 | 0.933 | 0.838 | |
| 1.35 | 1.352 | 0.101 | 0.071 | 0.022 | 0.055 | 0.030 | 0.054 | 0.975 | 0.920 | |
| 1.40 | 1.398 | 0.106 | 0.080 | 0.023 | 0.060 | 0.031 | 0.058 | 0.985 | 0.961 | |
| 1.45 | 1.444 | 0.104 | 0.090 | 0.022 | 0.066 | 0.031 | 0.064 | 0.996 | 0.987 | |
| 1.50 | 1.504 | 0.111 | 0.101 | 0.023 | 0.070 | 0.030 | 0.068 | 1.000 | 0.999 | |
| 1500 | 1.00 | 1.001 | 0.060 | 0.005 | 0.008 | 0.024 | 0.026 | 0.024 | 0.039 | 0.023 |
| 1.05 | 1.047 | 0.063 | 0.011 | 0.012 | 0.025 | 0.025 | 0.025 | 0.111 | 0.082 | |
| 1.10 | 1.102 | 0.070 | 0.021 | 0.015 | 0.028 | 0.025 | 0.028 | 0.347 | 0.267 | |
| 1.15 | 1.148 | 0.072 | 0.029 | 0.016 | 0.031 | 0.025 | 0.031 | 0.611 | 0.464 | |
| 1.20 | 1.199 | 0.073 | 0.040 | 0.017 | 0.036 | 0.025 | 0.036 | 0.841 | 0.690 | |
| 1.25 | 1.249 | 0.073 | 0.050 | 0.017 | 0.042 | 0.025 | 0.041 | 0.958 | 0.865 | |
| 1.30 | 1.302 | 0.080 | 0.061 | 0.018 | 0.047 | 0.025 | 0.047 | 0.990 | 0.950 | |
| 1.35 | 1.349 | 0.081 | 0.070 | 0.018 | 0.053 | 0.025 | 0.051 | 0.999 | 0.984 | |
| 1.40 | 1.403 | 0.083 | 0.081 | 0.018 | 0.058 | 0.025 | 0.056 | 1.000 | 0.994 | |
| 1.45 | 1.449 | 0.087 | 0.091 | 0.018 | 0.064 | 0.025 | 0.062 | 1.000 | 0.998 | |
| 1.50 | 1.499 | 0.086 | 0.100 | 0.018 | 0.068 | 0.025 | 0.066 | 1.000 | 1.000 | |
Mean and standard deviation (SD) of and estimates, mean of sum of absolute differences (SAD) of haplotype frequency estimates for EM, ECM and IEM algorithms, simulated size and powers of two HWE tests for different values of and , under inbreeding model.
|
|
|
|
| SAD | Size/Power | |||||
| Mean | SD | Mean | SD | EM | ECM | IEM | LRT | LRT | ||
| 500 | 0.00 | 1.001 | 0.111 | 0.010 | 0.015 | 0.041 | 0.044 | 0.041 | 0.060 | 0.020 |
| 0.01 | 1.035 | 0.108 | 0.015 | 0.019 | 0.041 | 0.044 | 0.041 | 0.048 | 0.060 | |
| 0.02 | 1.077 | 0.120 | 0.024 | 0.023 | 0.041 | 0.044 | 0.041 | 0.114 | 0.137 | |
| 0.03 | 1.109 | 0.123 | 0.030 | 0.025 | 0.042 | 0.047 | 0.042 | 0.170 | 0.218 | |
| 0.04 | 1.157 | 0.124 | 0.041 | 0.027 | 0.041 | 0.048 | 0.041 | 0.268 | 0.345 | |
| 0.05 | 1.185 | 0.124 | 0.049 | 0.028 | 0.042 | 0.049 | 0.042 | 0.357 | 0.468 | |
| 0.06 | 1.241 | 0.131 | 0.061 | 0.028 | 0.041 | 0.051 | 0.041 | 0.528 | 0.629 | |
| 0.07 | 1.279 | 0.141 | 0.070 | 0.030 | 0.042 | 0.054 | 0.042 | 0.638 | 0.735 | |
| 0.08 | 1.316 | 0.144 | 0.079 | 0.031 | 0.043 | 0.057 | 0.043 | 0.718 | 0.811 | |
| 0.09 | 1.366 | 0.140 | 0.090 | 0.029 | 0.044 | 0.061 | 0.043 | 0.851 | 0.906 | |
| 0.10 | 1.409 | 0.156 | 0.099 | 0.032 | 0.043 | 0.064 | 0.043 | 0.889 | 0.940 | |
| 1000 | 0.00 | 1.000 | 0.075 | 0.007 | 0.010 | 0.029 | 0.031 | 0.029 | 0.056 | 0.026 |
| 0.01 | 1.032 | 0.076 | 0.012 | 0.014 | 0.029 | 0.030 | 0.029 | 0.053 | 0.077 | |
| 0.02 | 1.073 | 0.078 | 0.021 | 0.016 | 0.029 | 0.032 | 0.029 | 0.142 | 0.198 | |
| 0.03 | 1.115 | 0.084 | 0.030 | 0.018 | 0.029 | 0.034 | 0.029 | 0.294 | 0.394 | |
| 0.04 | 1.151 | 0.086 | 0.040 | 0.019 | 0.030 | 0.037 | 0.030 | 0.477 | 0.606 | |
| 0.05 | 1.188 | 0.088 | 0.049 | 0.020 | 0.030 | 0.039 | 0.030 | 0.643 | 0.762 | |
| 0.06 | 1.229 | 0.093 | 0.059 | 0.020 | 0.031 | 0.042 | 0.031 | 0.775 | 0.875 | |
| 0.07 | 1.276 | 0.097 | 0.070 | 0.021 | 0.029 | 0.045 | 0.029 | 0.895 | 0.957 | |
| 0.08 | 1.316 | 0.096 | 0.079 | 0.020 | 0.030 | 0.048 | 0.030 | 0.959 | 0.988 | |
| 0.09 | 1.357 | 0.102 | 0.089 | 0.021 | 0.030 | 0.052 | 0.030 | 0.973 | 0.995 | |
| 0.10 | 1.411 | 0.102 | 0.100 | 0.021 | 0.031 | 0.057 | 0.031 | 0.996 | 1.000 | |
| 1500 | 0.00 | 1.001 | 0.064 | 0.006 | 0.009 | 0.024 | 0.025 | 0.024 | 0.053 | 0.028 |
| 0.01 | 1.038 | 0.062 | 0.012 | 0.012 | 0.024 | 0.026 | 0.024 | 0.079 | 0.096 | |
| 0.02 | 1.073 | 0.066 | 0.020 | 0.014 | 0.024 | 0.027 | 0.024 | 0.210 | 0.272 | |
| 0.03 | 1.111 | 0.072 | 0.030 | 0.016 | 0.024 | 0.029 | 0.024 | 0.392 | 0.534 | |
| 0.04 | 1.150 | 0.070 | 0.040 | 0.015 | 0.024 | 0.032 | 0.024 | 0.610 | 0.763 | |
| 0.05 | 1.191 | 0.071 | 0.050 | 0.016 | 0.024 | 0.035 | 0.024 | 0.794 | 0.902 | |
| 0.06 | 1.231 | 0.075 | 0.059 | 0.017 | 0.025 | 0.038 | 0.025 | 0.916 | 0.970 | |
| 0.07 | 1.276 | 0.082 | 0.070 | 0.017 | 0.024 | 0.042 | 0.024 | 0.975 | 0.992 | |
| 0.08 | 1.318 | 0.081 | 0.080 | 0.017 | 0.025 | 0.047 | 0.025 | 0.995 | 0.997 | |
| 0.09 | 1.365 | 0.081 | 0.090 | 0.017 | 0.025 | 0.050 | 0.025 | 0.999 | 0.999 | |
| 0.10 | 1.408 | 0.086 | 0.100 | 0.018 | 0.025 | 0.054 | 0.025 | 1.000 | 1.000 | |
Mean and standard deviation (SD) of and estimates, mean of sum of absolute differences (SAD) of haplotype frequency estimates for EM, ECM and IEM algorithms, power comparison of two HWE tests under population stratification model, with the proportion of subpopulation I being taken as 0.6 and 0.8, and the sample size being fixed at 500, 1000 and 1500.
|
|
|
|
| SAD | Power | |||||
| Mean | SD | Mean | SD | EM | ECM | IEM | LRT | LRT | ||
| 500 | 0.6 | 1.224 | 0.144 | 0.044 | 0.022 | 0.073 | 0.094 | 0.072 | 0.396 | 0.572 |
| 0.8 | 1.138 | 0.131 | 0.041 | 0.023 | 0.061 | 0.072 | 0.060 | 0.197 | 0.541 | |
| 1000 | 0.6 | 1.223 | 0.103 | 0.045 | 0.016 | 0.061 | 0.086 | 0.061 | 0.674 | 0.874 |
| 0.8 | 1.138 | 0.092 | 0.041 | 0.016 | 0.047 | 0.060 | 0.046 | 0.350 | 0.828 | |
| 1500 | 0.6 | 1.226 | 0.083 | 0.045 | 0.013 | 0.057 | 0.084 | 0.056 | 0.850 | 0.962 |
| 0.8 | 1.133 | 0.076 | 0.041 | 0.013 | 0.042 | 0.056 | 0.041 | 0.463 | 0.946 | |
Results of application to North American Rheumatoid Arthritis Consortium data set.
| Haplotype | N. of | ||||||
| Chr. | block | SNPs | SNP names |
|
| P-value | |
| LRT | LRT | ||||||
| 2 | 3 | 2 | rs1686430, rs1734449 | 1.349 | 0.137 | 0.0107 | 0.0089 |
| 2 | 9 | 2 | rs1866209, rs1438048 | 1.563 | 0.150 | 0.0051 | 0.0057 |
| 3 | 1 | 2 | rs1516337, rs1516350 | 1.287 | 0.086 | 0.0204 | 0.0213 |
| 5 | 6 | 2 | rs244903, rs244896 | 1.052 | 0.067 | 0.6341 | 0.0311 |
| 6 | 7 | 2 | rs1565528, rs1491074 | 1.284 | 0.074 | 0.0231 | 0.0505 |
| 7 | 16 | 2 | rs1182378, rs1182414 | 1.137 | 0.071 | 0.2526 | 0.0440 |
| 10 | 3 | 2 | rs1979720, rs1494201 | 1.302 | 0.082 | 0.0189 | 0.0313 |
| 13 | 2 | 2 | rs436227, rs390704 | 1.350 | 0.096 | 0.0085 | 0.0173 |
| 14 | 3 | 2 | rs1381641, rs1020897 | 1.264 | 0.077 | 0.0326 | 0.0704 |
| 15 | 7 | 6 | rs347117, rs383902, | 1.547 | 0.099 | 2.36 | 0.0428 |
| rs395601, rs387812, | |||||||
| rs347115, rs610877 | |||||||
| 16 | 5 | 2 | rs179209, rs179219 | 1.215 | 0.086 | 0.0708 | 0.0388 |
| 18 | 6 | 2 | rs1787190, rs1981 | 1.330 | 0.139 | 0.0070 | 0.0076 |
| 21 | 3 | 2 | rs1892687, rs2051179 | 1.372 | 0.146 | 0.0027 | 0.0048 |
Chr., chromosome; SNP, single nucleotide polymorphism; N. of SNPs, number of SNPs.
Figure 1Haplotype LD display for the seventh haplotype block on chromosome 15.
The red box denotes that the LOD value between any two loci is larger than or equal to 2.0. The numbers in the red boxes are the corresponding values of and the empty box denotes that .