| Literature DB >> 29163635 |
Bowang Chen1, John W Cole2, Caspar Grond-Ginsbach3.
Abstract
Objective: Departure from Hardy Weinberg Equilibrium (HWE) may occur due to a variety of causes, including purifying selection, inbreeding, population substructure, copy number variation or genotyping error. We searched for specific characteristics of HWE-departure due to genotyping error.Entities:
Keywords: Hardy Weinberg Equilibrium (HWE); SNP quality control; association studies in genetics; heterozygosity
Year: 2017 PMID: 29163635 PMCID: PMC5671567 DOI: 10.3389/fgene.2017.00167
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Causes of HWE departure associated with reduced or excess heterozygotes.
| Missing genotypes (med, IQ range) | 0.56 (2.62) | 0.26 (2.77) | 0.486 | 13.58 (8.92) | <0.001 | |
| Insertion/deletion type ( | 133 (4.1) | 0 | 0.415 | 29 (19.7) | <0.001 | |
| MAF (med, IQ range) | 0.327 (0.237) | 0.271 (0.205) | 0.252 | 0.407 (0.134) | <0.001 | |
| MAF-SD between populations | 10.57 (6.76) | 13.26 (7.88) | 0.0077 | 12.34 (6.14) | 0.122 | |
| CNV-loss ( | 9 (0.3) | 4 (9.8) | <0.001 | 0 | 1.00 | |
| CNV-gain ( | 11 (0.3) | 0 | 1.00 | 0 | 1.00 | |
| AFR | 533 | 3 | 7 | |||
| AMR | 509 | 19 | 15 | |||
| Heterogeneity across populations | EAS | 491 | 2 | 10 | ||
| FIN | 528 | 2 | <0.001 | 7 | <0.001 | |
| NFE | 493 | 5 | 45 | |||
| SAS | 516 | 10 | 9 |
HWE-departure (d-HWE) was defined as p-HWE < 10E-5. LoH d-HWE: loss of heterozygosity associated HWE-departure. GoH d-HWE: gain of heterozygosity associated HWE-departure. AFR, African; AMR, Latino; EAS, East Asian; FIN, Finnish; NFE, Non-Finnish European; SAS, South Asian; MAF, minor allele frequency; SD, standard deviation; IQ range, interquartile range; CNV, copy number variation). P-values for comparison of d-HWE groups with variants without HWE-departure by Mann-Whitney test (Missing genotypes, MAF, MAF-SD) or χ.
Examples of HWE-departure in three ExAC variants.
| African | 691 | 2,455 | 2,051 | 708 | 2,421 | 2,068 | 0.37 | 0.59 | 1.01 | |
| East Asian | 2,392 | 1,463 | 429 | 2,277 | 1,692 | 314 | 0.73 | 8.4 E-18 | 0.86 | LoH |
| European (Finnish) | 864 | 1,630 | 812 | 853 | 1,653 | 801 | 0.51 | 0.73 | 0.99 | |
| European (Non-Finnish) | 5,564 | 15,393 | 12,372 | 5,276 | 15,969 | 12,084 | 0.40 | 3.8 E-10 | 0.96 | LoH |
| Latino | 1,658 | 2,593 | 1,523 | 1,512 | 2,885 | 1,377 | 0.51 | 1.3 E-13 | 0.90 | LoH |
| South Asian | 2,149 | 3,760 | 2,333 | 1,970 | 4,119 | 2,154 | 0.49 | 2.6 E-14 | 0.91 | LoH |
| African | 3,552 | 1,468 | 173 | 3,537 | 1,497 | 158 | 0.83 | 0.37 | 0.98 | |
| East Asian | 968 | 1,914 | 1,434 | 859 | 2,133 | 1,325 | 0.45 | 1.4 E−10 | 0.90 | LoH |
| European (Finnish) | 416 | 1,503 | 1,380 | 413 | 1,509 | 1,377 | 0.35 | 0.98 | 1.00 | |
| European (Non-Finnish) | 3,423 | 14,518 | 15,378 | 3,425 | 14,515 | 15,380 | 0.32 | 1.00 | 1.00 | |
| Latino | 195 | 1,396 | 4,190 | 138 | 1,510 | 4,133 | 0.15 | 6.8 E−08 | 0.92 | LoH |
| South Asian | 4,205 | 3,292 | 749 | 4,152 | 3,399 | 696 | 0.71 | 0.02 | 0.97 | |
| African | 110 | 1,620 | 2,198 | 215 | 1,409 | 2,303 | 0.23 | 7,6 E−20 | 1.15 | GoH |
| East Asian | 829 | 2,124 | 622 | 1,000 | 1,782 | 793 | 0.53 | 2,0 E−29 | 1.19 | GoH |
| European (Finnish) | 917 | 1,632 | 416 | 1,013 | 1,440 | 512 | 0.58 | 3,8 E−12 | 1.13 | GoH |
| European (Non-Finnish) | 5,685 | 16,989 | 4,949 | 7,279 | 13,802 | 6,543 | 0.51 | 0 | 1.23 | GoH |
| Latino | 531 | 2,895 | 1,412 | 809 | 2,339 | 1,690 | 0.41 | 3.8 E−60 | 1.24 | GoH |
| South Asian | 1,943 | 4,079 | 941 | 2,278 | 3,409 | 1,276 | 0.57 | 4.8 E−59 | 1.20 | GoH |
Obs, Observed genotypes; Exp, expected genotypes under hypothesis of HWE; Ratio O/E, ratio of observed and expected number of heterozygote carriers; p-HWE, p-value of χ.
Figure 1Analysis of genotyped short variations in genomic region 17:56,000,000–64,000,000 (A,B) and genomic region 3:65,000,000–65,500,000 (C,D) from the 1000 Genomes Project, European Population. Upper figures (A,C) show log-transformed p-values for departures from HWE. Lower figures (B,D) show ratios of observed and expected (i.e., under Hardy-Weinberg Equilibrium) frequencies of heterozygous genotypes. Dots indicate true SNPs, rectangles symbolize variants of indel type. The bar (seen in C,D) indicates common deletion variant esv2657253, (http://www.ensembl.org/Homo_sapiens/StructuralVariation/Explore?r=3:65202694--65229573;sv,esv2657253;svf,3513219;vdb,variation). The arrow (seen in A,B) points to a region with of a degenerate repeat structure within the 5' end of the LRRC37A2 with strong homology with sequences of the LRRC37A3 gene, resulting in genotyping error.