| Literature DB >> 29288525 |
Jan Graffelman1,2, Bruce S Weir2.
Abstract
Statistical tests for Hardy-Weinberg equilibrium are important elementary tools in genetic data analysis. X-chromosomal variants have long been tested by applying autosomal test procedures to females only, and gender is usually not considered when testing autosomal variants for equilibrium. Recently, we proposed specific X-chromosomal exact test procedures for bi-allelic variants that include the hemizygous males, as well as autosomal tests that consider gender. In this study, we present the extension of the previous work for variants with multiple alleles. A full enumeration algorithm is used for the exact calculations of tri-allelic variants. For variants with many alternate alleles, we use a permutation test. Some empirical examples with data from the 1,000 genomes project are discussed.Entities:
Keywords: X chromosome; indel; microsatellite; permutation test
Mesh:
Year: 2018 PMID: 29288525 PMCID: PMC5969302 DOI: 10.1111/1755-0998.12748
Source DB: PubMed Journal: Mol Ecol Resour ISSN: 1755-098X Impact factor: 7.090
Lower triangular matrix layout for autosomal genotype counts
Layout of X‐chromosomal genotype counts for males (vector) and females (lower triangular matrix)
Twofold triangular matrix layout of autosomal genotype counts accounting for gender, with male and female genotype counts
Table of all possible outcomes and probabilities of a sample consisting of 10 individuals (six males and four females) with total allele counts A = 3, B = 5 and C = 6. Sample Nr. 15 (in bold) corresponds to the observed sample
| Nr. | Males | Females | Prob. | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| A | B | C | AA | AB | AC | BB | BC | CC | ||
| 1 | 3 | 3 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 0.0029 |
| 2 | 3 | 3 | 0 | 0 | 0 | 0 | 1 | 0 | 3 | 0.0005 |
| 3 | 3 | 2 | 1 | 0 | 0 | 0 | 0 | 3 | 1 | 0.0114 |
| 4 | 3 | 2 | 1 | 0 | 0 | 0 | 1 | 1 | 2 | 0.0086 |
| 5 | 3 | 1 | 2 | 0 | 0 | 0 | 0 | 4 | 0 | 0.0057 |
| 6 | 3 | 1 | 2 | 0 | 0 | 0 | 1 | 2 | 1 | 0.0171 |
| 7 | 3 | 1 | 2 | 0 | 0 | 0 | 2 | 0 | 2 | 0.0021 |
| 8 | 3 | 0 | 3 | 0 | 0 | 0 | 1 | 3 | 0 | 0.0038 |
| 9 | 3 | 0 | 3 | 0 | 0 | 0 | 2 | 1 | 1 | 0.0029 |
| 10 | 2 | 4 | 0 | 0 | 1 | 0 | 0 | 0 | 3 | 0.0007 |
| 11 | 2 | 4 | 0 | 0 | 0 | 1 | 0 | 1 | 2 | 0.0043 |
| 12 | 2 | 3 | 1 | 0 | 1 | 0 | 0 | 1 | 2 | 0.0171 |
| 13 | 2 | 3 | 1 | 0 | 0 | 1 | 0 | 2 | 1 | 0.0343 |
| 14 | 2 | 3 | 1 | 0 | 0 | 1 | 1 | 0 | 2 | 0.0086 |
|
|
|
|
|
|
|
|
|
|
|
|
| 16 | 2 | 2 | 2 | 0 | 1 | 0 | 1 | 0 | 2 | 0.0128 |
| 17 | 2 | 2 | 2 | 0 | 0 | 1 | 0 | 3 | 0 | 0.0343 |
| 18 | 2 | 2 | 2 | 0 | 0 | 1 | 1 | 1 | 1 | 0.0514 |
| 19 | 2 | 1 | 3 | 0 | 0 | 1 | 1 | 2 | 0 | 0.0343 |
| 20 | 2 | 1 | 3 | 0 | 0 | 1 | 2 | 0 | 1 | 0.0086 |
| 21 | 2 | 1 | 3 | 0 | 1 | 0 | 0 | 3 | 0 | 0.0228 |
| 22 | 2 | 1 | 3 | 0 | 1 | 0 | 1 | 1 | 1 | 0.0343 |
| 23 | 2 | 0 | 4 | 0 | 0 | 1 | 2 | 1 | 0 | 0.0043 |
| 24 | 2 | 0 | 4 | 0 | 1 | 0 | 1 | 2 | 0 | 0.0086 |
| 25 | 2 | 0 | 4 | 0 | 1 | 0 | 2 | 0 | 1 | 0.0021 |
| 26 | 1 | 5 | 0 | 0 | 0 | 2 | 0 | 0 | 2 | 0.0009 |
| 27 | 1 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | 3 | 0.0001 |
| 28 | 1 | 4 | 1 | 0 | 1 | 1 | 0 | 0 | 2 | 0.0086 |
| 29 | 1 | 4 | 1 | 0 | 0 | 2 | 0 | 1 | 1 | 0.0171 |
| 30 | 1 | 4 | 1 | 1 | 0 | 0 | 0 | 1 | 2 | 0.0043 |
| 31 | 1 | 3 | 2 | 0 | 2 | 0 | 0 | 0 | 2 | 0.0086 |
| 32 | 1 | 3 | 2 | 0 | 1 | 1 | 0 | 1 | 1 | 0.0685 |
| 33 | 1 | 3 | 2 | 0 | 0 | 2 | 0 | 2 | 0 | 0.0343 |
| 34 | 1 | 3 | 2 | 0 | 0 | 2 | 1 | 0 | 1 | 0.0171 |
| 35 | 1 | 3 | 2 | 1 | 0 | 0 | 0 | 2 | 1 | 0.0171 |
| 36 | 1 | 3 | 2 | 1 | 0 | 0 | 1 | 0 | 2 | 0.0043 |
| 37 | 1 | 2 | 3 | 0 | 2 | 0 | 0 | 1 | 1 | 0.0343 |
| 38 | 1 | 2 | 3 | 0 | 1 | 1 | 0 | 2 | 0 | 0.0685 |
| 39 | 1 | 2 | 3 | 0 | 1 | 1 | 1 | 0 | 1 | 0.0343 |
| 40 | 1 | 2 | 3 | 0 | 0 | 2 | 1 | 1 | 0 | 0.0343 |
| 41 | 1 | 2 | 3 | 1 | 0 | 0 | 0 | 3 | 0 | 0.0114 |
| 42 | 1 | 2 | 3 | 1 | 0 | 0 | 1 | 1 | 1 | 0.0171 |
| 43 | 1 | 1 | 4 | 0 | 0 | 2 | 2 | 0 | 0 | 0.0043 |
| 44 | 1 | 1 | 4 | 0 | 1 | 1 | 1 | 1 | 0 | 0.0343 |
| 45 | 1 | 1 | 4 | 0 | 2 | 0 | 0 | 2 | 0 | 0.0171 |
| 46 | 1 | 1 | 4 | 0 | 2 | 0 | 1 | 0 | 1 | 0.0086 |
| 47 | 1 | 1 | 4 | 1 | 0 | 0 | 1 | 2 | 0 | 0.0086 |
| 48 | 1 | 1 | 4 | 1 | 0 | 0 | 2 | 0 | 1 | 0.0021 |
| 49 | 1 | 0 | 5 | 0 | 1 | 1 | 2 | 0 | 0 | 0.0017 |
| 50 | 1 | 0 | 5 | 0 | 2 | 0 | 1 | 1 | 0 | 0.0034 |
| 51 | 1 | 0 | 5 | 1 | 0 | 0 | 2 | 1 | 0 | 0.0009 |
| 52 | 0 | 5 | 1 | 0 | 0 | 3 | 0 | 0 | 1 | 0.0011 |
| 53 | 0 | 5 | 1 | 1 | 0 | 1 | 0 | 0 | 2 | 0.0009 |
| 54 | 0 | 4 | 2 | 0 | 1 | 2 | 0 | 0 | 1 | 0.0086 |
| 55 | 0 | 4 | 2 | 1 | 1 | 0 | 0 | 0 | 2 | 0.0021 |
| 56 | 0 | 4 | 2 | 0 | 0 | 3 | 0 | 1 | 0 | 0.0057 |
| 57 | 0 | 4 | 2 | 1 | 0 | 1 | 0 | 1 | 1 | 0.0086 |
| 58 | 0 | 3 | 3 | 0 | 2 | 1 | 0 | 0 | 1 | 0.0114 |
| 59 | 0 | 3 | 3 | 0 | 1 | 2 | 0 | 1 | 0 | 0.0228 |
| 60 | 0 | 3 | 3 | 1 | 1 | 0 | 0 | 1 | 1 | 0.0114 |
| 61 | 0 | 3 | 3 | 1 | 0 | 1 | 0 | 2 | 0 | 0.0114 |
| 62 | 0 | 3 | 3 | 0 | 0 | 3 | 1 | 0 | 0 | 0.0038 |
| 63 | 0 | 3 | 3 | 1 | 0 | 1 | 1 | 0 | 1 | 0.0057 |
| 64 | 0 | 2 | 4 | 0 | 1 | 2 | 1 | 0 | 0 | 0.0086 |
| 65 | 0 | 2 | 4 | 0 | 2 | 1 | 0 | 1 | 0 | 0.0171 |
| 66 | 0 | 2 | 4 | 1 | 0 | 1 | 1 | 1 | 0 | 0.0086 |
| 67 | 0 | 2 | 4 | 1 | 1 | 0 | 0 | 2 | 0 | 0.0086 |
| 68 | 0 | 2 | 4 | 0 | 3 | 0 | 0 | 0 | 1 | 0.0029 |
| 69 | 0 | 2 | 4 | 1 | 1 | 0 | 1 | 0 | 1 | 0.0043 |
| 70 | 0 | 1 | 5 | 0 | 2 | 1 | 1 | 0 | 0 | 0.0034 |
| 71 | 0 | 1 | 5 | 1 | 0 | 1 | 2 | 0 | 0 | 0.0009 |
| 72 | 0 | 1 | 5 | 0 | 3 | 0 | 0 | 1 | 0 | 0.0023 |
| 73 | 0 | 1 | 5 | 1 | 1 | 0 | 1 | 1 | 0 | 0.0034 |
| 74 | 0 | 0 | 6 | 0 | 3 | 0 | 1 | 0 | 0 | 0.0002 |
| 75 | 0 | 0 | 6 | 1 | 1 | 0 | 2 | 0 | 0 | 0.0001 |
Identifier, position (in bp), genotype counts and exact p‐values for five tri‐allelic X‐chromosomal variants
| Variant | Position | Genotype counts | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Males | Females | |||||||||
| A | B | C | AA | AB | AC | BB | BC | CC | ||
| NA | 18892613 | 0 | 0 | 56 | 0 | 0 | 2 | 0 | 14 | 32 |
| NA | 44317003 | 2 | 2 | 52 | 0 | 0 | 0 | 0 | 0 | 48 |
| rs111463470 | 3107933 | 1 | 21 | 34 | 0 | 1 | 0 | 8 | 24 | 15 |
| rs79878783 | 67482671 | 0 | 15 | 41 | 0 | 0 | 1 | 3 | 41 | 3 |
| rs373113553 | 83676643 | 15 | 17 | 24 | 4 | 2 | 13 | 6 | 19 | 4 |
HWP & EAF, omnibus exact test for HWP and EAF jointly; Perm., approximation of the omnibus p‐value by a permutation procedure with 20,000 draws; HWP (F), exact test for HWP in females only; EAF, Fisher exact test for equality of allele frequencies in males and females.
Figure 1Hardy–Weinberg equilibrium tests for 2,979 tri‐allelic X‐chromosomal variants. (a): p‐values of the omnibus exact test (HWP & EAF) vs. variant position. The solid horizontal line corresponds to α = 0.05, and the dotted horizontal line corresponds to the Bonferroni‐corrected threshold. Variants colour‐coded according to having excess of heterozygotes (red) or lack of heterozygotes (green) with respect to HWP. (b) QQ‐plot of the omnibus exact p‐values. (c) Observed vs. expected heterozygosity. (d) Omnibus exact p‐values vs. exact p‐values obtained in a test using the females only. Horizontal and vertical lines correspond to α = 0.05. Panels b, c and d show variants colour‐coded according to the p‐value of an exact test for EAF
Male and female genotype counts of a four, five, six and seven allelic indel on chromosome X
p‐values of statistical tests for multi‐allelic indels on chromosome X
| Variant | # alleles | HWP & EAF | HWP (F) | EAF | |
|---|---|---|---|---|---|
| 1 | rs67657605 | 4 | 0.3101 | 0.6553 | 0.4578 |
| 2 | NA | 5 | 0.0001 | 0.0002 | 0.0253 |
| 3 | rs60184331 | 6 | 0.2254 | 0.1714 | 0.2386 |
| 4 | rs59130472 | 7 | 0.1070 | 0.0497 | 0.5045 |
HWP & EAF, joint exact test for Hardy–Weinberg proportions and equality of allele frequencies (obtained by permutation); HWP (F), exact test for Hardy–Weinberg proportions in females only; EAF, Fisher Exact test for equality of allele frequencies in the sexes.
Figure 2Distribution of the probability of permuted samples in an omnibus exact test for HWP and EAF with multi‐allelic variants on the X chromosome [Colour figure can be viewed at http://wileyonlinelibrary.com]
Variant, position (in bp), genotype counts and exact p‐values for six tri‐allelic variants on chromosome 7
| Variant | Position | Males | Females | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AA | AB | AC | BB | BC | CC | AA | AB | AC | BB | BC | CC | ||
| rs36186766 | 1048677 | 12 | 19 | 13 | 7 | 5 | 0 | 8 | 12 | 13 | 8 | 7 | 0 |
| rs111437421 | 5092372 | 38 | 14 | 0 | 2 | 2 | 0 | 32 | 15 | 1 | 0 | 0 | 0 |
| rs217419 | 44563555 | 56 | 0 | 0 | 0 | 0 | 0 | 42 | 4 | 2 | 0 | 0 | 0 |
| rs59542926 | 101495109 | 16 | 39 | 1 | 0 | 0 | 0 | 23 | 22 | 0 | 3 | 0 | 0 |
| rs35141756 | 123671547 | 29 | 19 | 7 | 1 | 0 | 0 | 41 | 5 | 0 | 2 | 0 | 0 |
| rs3863236 | 154655116 | 15 | 17 | 3 | 21 | 0 | 0 | 12 | 24 | 0 | 9 | 3 | 0 |
HWP & EAF, omnibus exact test for HWP and EAF jointly; HWP (M+F), standard exact test with all individuals; HWP (F), standard exact test for HWP in females; HWP (M) standard exact test for HWP in males; EAF, Fisher exact test for equality of allele frequencies in males and females.
Figure 3Hardy–Weinberg equilibrium tests for 2992 tri‐allelic variants on chromosome 7. (a) p‐values of the omnibus exact test vs. variant position. The solid horizontal line corresponds to α = 0.05, and dotted horizontal line corresponds to the Bonferroni‐corrected threshold. Variants colour‐coded according to having excess of heterozygotes (red) or lack of heterozygotes (green) with respect to HWP. (b) QQ‐plot of the omnibus exact p‐values. (c) Observed vs. expected heterozygosity. (d) Omnibus exact p‐values vs. exact p‐values obtained in a test using the females only. Panels b, c and d have variants colour‐coded according to the p‐value of an exact test for EAF (significant to nonsignificant from red to green). Horizontal and vertical lines correspond to α = 0.05
Genotype counts for males (left) and females (right) of four, five and six allelic variants on chromosome 7
Test results for three multi‐allelic variants on chromosome 7 of the JPT sample
| Variant | # alleles | HWP & EAF | HWP (M+F) | HWP (M) | HWP (F) | EAF | |
|---|---|---|---|---|---|---|---|
| 1 | NA | 4 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.6964 |
| 2 | rs145685769 | 5 | 0.9483 | 0.7216 | 0.9285 | 0.9505 | 0.2968 |
| 3 | rs71774128 | 6 | 0.0035 | 0.0005 | 0.0571 | 0.0044 | 0.3520 |