| Literature DB >> 30789927 |
Stefan Wellek1,2, Andreas Ziegler3,4,5.
Abstract
The problem of checking the genotype distribution obtained for some diallelic marker for compatibility with the Hardy-Weinberg equilibrium (HWE) condition arises also for loci on the X chromosome. The possible genotypes depend on the sex of the individual in this case: for females, the genotype distribution is trinomial, as in the case of an autosomal locus, whereas a binomial proportion is observed for males. Like in genetic association studies with autosomal SNPs, interest is typically in establishing approximate compatibility of the observed genotype frequencies with HWE. This requires to replace traditional methods tailored for detecting lack of fit to the model with an equivalence testing procedure to be derived by treating approximate compatibility with the model as the alternative hypothesis. The test constructed here is based on an upper confidence bound and a simple to interpret combined measure of distance between true and HWE conforming genotype distributions in female and male subjects. A particular focus of the paper is on the derivation of the asymptotic distribution of the test statistic under null alternatives which is not of the usual Gaussian form. A closed sample size formula is also provided and shown to behave satisfactorily in terms of the approximation error.Entities:
Mesh:
Year: 2019 PMID: 30789927 PMCID: PMC6383894 DOI: 10.1371/journal.pone.0212344
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1De Finetti diagram of the boundary curves of the equivalence region specified under the alternative hypotheses of (1).
Ragged lines: critical region of the test to be generalized for X-chromosomal SNPs. [δ° = 0.96, α = 0.05, n = 200].
Exact rejection probabilities of the goodness-of-fit test with critical region (19) at the common boundary of the hypotheses (17).
[Nominal significance level α = 0.05; equivalence margin ].
|
|
|
| Rej. Prob. | |||||
|---|---|---|---|---|---|---|---|---|
| 0.25 | 0.57897 | 0.53949 | 0.45557 | 0.33647 | 0.33647 | 100 | 100 | 0.01103 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03505 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03474 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.03966 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.03972 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04164 |
| 0.25 | 0.41402 | 0.45701 | 0.37546 | −0.33647 | 0.33647 | 100 | 100 | 0.01381 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03944 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03906 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.04352 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.04316 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04459 |
| 0.25 | 0.57897 | 0.53949 | 0.62123 | 0.33647 | −0.33647 | 100 | 100 | 0.01167 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03475 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03432 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.03963 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.03955 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04154 |
| 0.25 | 0.41402 | 0.45701 | 0.54093 | −0.33647 | −0.33647 | 100 | 100 | 0.01289 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03948 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03944 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.04356 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.04326 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04467 |
| 0.25 | 0.51212 | 0.50606 | 0.38958 | 0.04879 | 0.47334 | 100 | 100 | 0.01634 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.04023 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03830 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.04380 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.04323 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04452 |
| 0.25 | 0.60637 | 0.55319 | 0.53475 | 0.47000 | 0.07432 | 100 | 100 | 0.00912 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03175 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03538 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.03145 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.03748 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.03989 |
| 0.09 | 0.52274 | 0.35137 | 0.27899 | 0.33647 | 0.33647 | 100 | 100 | 0.00702 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03240 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03282 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.03705 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.03785 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04012 |
| 0.09 | 0.32718 | 0.25359 | 0.19529 | −0.33647 | 0.33647 | 100 | 100 | 0.00041 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03403 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03509 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.03873 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.04017 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04235 |
†) = π1 + π2/2 [≡ allele frequency among females]
Exact rejection probabilities of the goodness-of-fit test with critical region (19) at additional points on the common boundary of the hypotheses (17).
[Nominal significance level α = 0.05; equivalence margin ].
|
|
|
| Rej. Prob. | |||||
|---|---|---|---|---|---|---|---|---|
| 0.09 | 0.52274 | 0.35137 | 0.43130 | 0.33647 | −0.33647 | 100 | 100 | 0.00941 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03367 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03414 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.03765 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.03863 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04079 |
| 0.09 | 0.32718 | 0.25359 | 0.32233 | −0.33647 | −0.33647 | 100 | 100 | 0.00325 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03639 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03746 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.03976 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.04140 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04326 |
| 0.09 | 0.43445 | 0.30723 | 0.21645 | 0.04879 | 0.47334 | 100 | 100 | 0.00347 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03657 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03401 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.04222 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.04137 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04012 |
| 0.09 | 0.56438 | 0.37219 | 0.35500 | 0.47000 | 0.07432 | 100 | 100 | 0.00762 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.03177 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.03555 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.03152 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.03759 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.04003 |
| 0.00564 | 0.18874 | 0.10001 | 0.07354 | 0.33647 | 0.33647 | 400 | 400 | 0.00726 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.02603 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.03304 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1600 | 1600 | 0.03643 |
†) = π1 + π2/2 [≡ allele frequency among females]
Fig 2Visualization of the parameter configurations covered by Table 1 as points in the -plane.
Exact power of the goodness-of-fit test with critical region (19) under selected null alternatives.
[Nominal significance level α = 0.05; equivalence margin ].
|
|
|
| Rej. Prob. | |||||
|---|---|---|---|---|---|---|---|---|
| 0.25 | 0.5 | 0.5 | 0.5 | 0.00 | 0.00 | 100 | 100 | 0.10239 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.95828 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.98461 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.98224 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.99979 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 1.00000 |
| 0.16 | 0.48 | 0.4 | 0.4 | 0.00 | 0.00 | 100 | 100 | 0.09111 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.94585 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.97755 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.97700 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.99965 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 1.00000 |
| 0.09 | 0.42 | 0.3 | 0.3 | 0.00 | 0.00 | 100 | 100 | 0.03006 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.88172 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.93296 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.94927 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.99818 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.99998 |
| 0.04 | 0.32 | 0.2 | 0.2 | 0.00 | 0.00 | 100 | 100 | 0.00008 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.62410 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.70607 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.79884 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.96535 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.99750 |
| 0.01 | 0.18 | 0.1 | 0.1 | 0.00 | 0.00 | 100 | 100 | 0.00000 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 400 | 0.06914 |
| ″ | ″ | ″ | ″ | ″ | ″ | 400 | 600 | 0.10738 |
| ″ | ″ | ″ | ″ | ″ | ″ | 600 | 400 | 0.18311 |
| ″ | ″ | ″ | ″ | ″ | ″ | 800 | 800 | 0.45744 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.73707 |
†) = π1 + π2/2 [≡ allele frequency among females]
Exact power of the goodness-of-fit test with critical region (19) and 800 observations per subgroup under alternatives specifying that the true deviation from HWE is a non-zero fraction of that considered compatible with equivalence.
[Nominal significance level α = 0.05; equivalence margin ].
|
|
|
| Δ | Rej. Prob. | |||
|---|---|---|---|---|---|---|---|
| 0.04 | 0.32870 | 0.20435 | 0.19893 | 0.03365 | 0.03365 | 0.04758 | 0.94943 |
| ″ | 0.33755 | 0.20878 | 0.19788 | 0.06729 | 0.06729 | 0.09517 | 0.91198 |
| ″ | 0.34656 | 0.21328 | 0.19684 | 0.10094 | 0.10094 | 0.14275 | 0.84553 |
| ″ | 0.35573 | 0.21787 | 0.19580 | 0.13459 | 0.13459 | 0.19034 | 0.74342 |
| ″ | 0.36506 | 0.22253 | 0.19478 | 0.16824 | 0.16824 | 0.23792 | 0.60695 |
| ″ | 0.37453 | 0.22727 | 0.19377 | 0.20188 | 0.20188 | 0.28551 | 0.45004 |
| ″ | 0.38415 | 0.23208 | 0.19276 | 0.23553 | 0.23553 | 0.33309 | 0.29666 |
| ″ | 0.39392 | 0.23696 | 0.19176 | 0.26918 | 0.26918 | 0.38068 | 0.17046 |
| ″ | 0.40382 | 0.24191 | 0.19076 | 0.30283 | 0.30283 | 0.42826 | 0.08390 |
†) = π1 + π2/2 [≡ allele frequency among females]
Testing four X-chromosomal SNPs ascertained in the GENEVA project [25] for goodness of fit to HWE.
[Nominal significance level α = 0.05; equivalence margin ; decision = “+” ⇔ rejection of the null hypothesis of relevant deviations from HWE. The results for rs12010339 were calculated replacing both zero entries by 1 and decreasing x1 by 2, in line with common rules for the analysis of sparse contingency tables].
| SNP# |
|
|
| Decision | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| rs6646338 | 651 | 230 | 314 | 107 | 604 | 399 | 0.2835 | 13.2641 | 0.4526 | + |
| rs12010339 | ″ | 651 | 0 | 0 | 605 | 603 | 3.9475 | 157.4547 | 5.7856 | − |
| rs5935567 | ″ | 231 | 337 | 83 | 605 | 372 | 0.1964 | 8.8719 | 0.3346 | + |
| rs5968922 | ″ | 275 | 296 | 80 | 604 | 392 | 0.0040 | 12.1492 | 0.1658 | + |
Comparisons between the goodness-of-fit test and the inverted χ2-test in data sets generated by simulation from a population satisfying the null hypothesis of relevant disequilibrium [upper lines] and being in perfect HWE [lines 3-4], respectively.
[Nominal significance level α = 0.05; equivalence margin ; 100,000 replications per Monte Carlo experiment].
|
|
|
| Sim. Rej. Prob. | Prop. of concord. dec. | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Gof-Test | invtd. | |||||||||
| 0.09 | 0.52274 | 0.35137 | 0.27899 | 0.33647 | 0.33647 | 100 | 100 | 0.00696 | 0.60024 | 0.40672 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.03983 | 0.00000 | 0.96017 |
| 0.09 | 0.42 | 0.3 | 0.3 | 0.00 | 0.00 | 100 | 100 | 0.03102 | 0.95177 | 0.07925 |
| ″ | ″ | ″ | ″ | ″ | ″ | 1200 | 1200 | 0.99996 | 0.94926 | 0.94930 |
†) = π1 + π2/2 [≡ allele frequency among females]
Sample-sizes approximated by means of formula (22) and exact power effectively attained with them against selected non-null alternatives of the form considered in Table 3.
[Nominal significance level α = 0.05; target power = 80%; equivalence margin ].
|
|
|
| Δ | λ | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 0.25 | 0.54097 | 0.52048 | 0.47846 | 0.16824 | 0.16824 | 0.23792 |
| 485 | 485 | 0.69063 |
| ″ | ″ | ″ | ″ | ″ | ″ | ″ |
| 449 | 898 | 0.75536 |
| ″ | ″ | ″ | ″ | ″ | ″ | ″ |
| 552 | 276 | 0.60427 |
| ″ | ″ | ″ | ″ | ″ | ″ | ″ |
| 436 | 1308 | 0.77938 |
| ″ | ″ | ″ | ″ | ″ | ″ | ″ |
| 612 | 204 | 0.54963 |
| 0.04 | 0.35573 | 0.21787 | 0.19580 | 0.13459 | 0.13459 | 0.19034 |
| 697 | 697 | 0.67720 |
| ″ | ″ | ″ | ″ | ″ | ″ | ″ |
| 656 | 1312 | 0.73626 |
| ″ | ″ | ″ | ″ | ″ | ″ | ″ |
| 774 | 387 | 0.58844 |
| ″ | ″ | ″ | ″ | ″ | ″ | ″ |
| 641 | 1923 | 0.75422 |
| ″ | ″ | ″ | ″ | ″ | ″ | ″ |
| 846 | 282 | 0.53012 |
†) = π1 + π2/2 [≡ allele frequency among females]
Sample-sizes approximated by means of formula (28) and exact power effectively attained with them against selected alternatives exactly satisfying the HWE condition.
[Nominal significance level α = 0.05; equivalence margin ].
|
|
|
|
| λ | 1 − | ||||
|---|---|---|---|---|---|---|---|---|---|
| 0.25 | 0.50 | 0.25 | 0.5 | 1/2 | 1.22475 | 0.60 | 213 | 213 | 0.64299 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 279 | 279 | 0.82359 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 338 | 338 | 0.91139 |
| ″ | ″ | ″ | ″ | 1/3 | 1.00000 | 0.60 | 163 | 326 | 0.61533 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 214 | 428 | 0.80902 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 260 | 520 | 0.90585 |
| ″ | ″ | ″ | ″ | 1/4 | 0.91287 | 0.60 | 157 | 471 | 0.64940 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 205 | 615 | 0.82969 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 247 | 741 | 0.91558 |
| 0.09 | 0.42 | 0.49 | 0.3 | 1/2 | 1.12250 | 0.60 | 264 | 264 | 0.61656 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 346 | 346 | 0.80669 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 419 | 419 | 0.90118 |
| ″ | ″ | ″ | ″ | 1/3 | 0.91652 | 0.60 | 210 | 420 | 0.59908 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 276 | 552 | 0.79392 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 335 | 670 | 0.89315 |
| ″ | ″ | ″ | ″ | 1/4 | 0.83666 | 0.60 | 202 | 606 | 0.62698 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 265 | 795 | 0.80995 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 321 | 963 | 0.90049 |
| 0.01 | 0.18 | 0.81 | 0.1 | 1/2 | 0.73485 | 0.60 | 1054 | 1054 | 0.65511 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 1375 | 1375 | 0.80985 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 1668 | 1668 | 0.88892 |
| ″ | ″ | ″ | ″ | 1/3 | 0.60000 | 0.60 | 986 | 1972 | 0.66802 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 1288 | 2576 | 0.80764 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 1575 | 3150 | 0.88403 |
| ″ | ″ | ″ | ″ | 1/4 | 0.54772 | 0.60 | 960 | 2880 | 0.66463 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.80 | 1259 | 3777 | 0.80283 |
| ″ | ″ | ″ | ″ | ″ | ″ | 0.90 | 1547 | 4641 | 0.88074 |