| Literature DB >> 32963319 |
Kenneth K Kidd1, Andrew J Pakstis2, Michael P Donnelly2,3, Ozlem Bulbul4, Lotfi Cherni5,6, Cemal Gurkan7,8, Longli Kang9,10, Hui Li11, Libing Yun12, Peristera Paschou13, Kelly A Meiklejohn14, Eva Haigh2, William C Speed2.
Abstract
Oculocutaneous Albinism type 2 (OCA2) is a gene of great interest because of genetic variation affecting normal pigmentation variation in humans. The diverse geographic patterns for variant frequencies at OCA2 have been evident but have not been systematically investigated, especially outside of Europe. Here we examine population genetic variation in and near the OCA2 gene from a worldwide perspective. The very different patterns of genetic variation found across world regions suggest strong selection effects may have been at work over time. For example, analyses involving the variants that affect pigmentation of the iris argue that the derived allele of the rs1800407 single nucleotide polymorphism, which produces a hypomorphic protein, may have contributed to the previously demonstrated positive selection in Europe for the enhancer variant responsible for light eye color. More study is needed on the relationships of the genetic variation at OCA2 to variation in pigmentation in areas beyond Europe.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32963319 PMCID: PMC7508881 DOI: 10.1038/s41598-020-72262-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Five commonly occurring and one rare functional SNP at OCA2 influencing expression of human pigmentation variation.
| Nucleotide position GRCh38 | Distance to next SNP in basepairs | dbSNP rs-number | Ancestral amino acids‡ Anc-position#-Drv | Alleles Anc, Drv forward strand | Hypomorphic |
|---|---|---|---|---|---|
| 27,951,891 | 31,516 | rs1800414 | His615Arg | T,C | |
| 27,983,407 | 1,694 | rs74653330 | Ala481Thr | C,T | mildly |
| 27,985,101 | 71 | rs121918166 * | Val443Ile | C,T | very |
| 27,985,172 | 29,735 | rs1800407 | Arg419Gln | C,T | possibly |
| 28,014,907 | 66,967 | rs1800401 * | Arg305Trp | G,A | |
| 28,081,874 | 38,598 | – | initiation codon | – | |
| 28,120,472 | rs12913832 | enhancer | A,G |
*SNPs not studied in this report.
‡Anc, ancestral; Drv, derived; amino acids–Ala, alanine; Arg, arginine; Gln, glutamine; His, histidine; Ile, isoleucine; Thr, threonine; Trp, tryptophan; Val, valine.
Figure 1A density plot of the frequencies of the derived allele at rs1800414. The underlying data for Figs. 1, 2, 3 and 4 are in Table S1. Alternative graphic representation with the frequencies of each population sample is in Fig. S1. See text.
Figure 2A density plot of the frequencies of the derived allele at rs74653330. The scale has been adjusted to minimize visual extrapolation to very rare occurrences. An outlier frequency of 0.52 in a small Orogen sample was omitted from the density plot and the omission resulted is a slight shift of the highest frequency region to the West. See Figure S2, caption for Fig. 1, and text.
Figure 3A density plot of the frequencies of the derived allele at rs1800407. The scale has been adjusted to minimize visual extrapolation to very rare occurrences. See Fig. S3, caption for Fig. 1, and text.
Figure 4A density plot of the frequencies of the derived allele at rs12913832. The scale has been adjusted to minimize visual extrapolation to very rare occurrences. See Fig. S4, caption for Fig. 1, and text.
Haplotypes of the ancestral 419Arg and derived 419Gln alleles at rs1800407 and the enhancer normal (E +) and negative (E-) alleles at rs12913832. The doubly-derived (cis) haplotype has frequency estimates of 1% to 3.6% in 14 European populations (see Table S2).
| 2 SNP haplotype | Eye color effect | |
|---|---|---|
| 419Arg-E + | CA | Ancestral; dark iris color |
| 419Gln-E + | TA | ? |
| 419Arg-E- | CG | Light iris color |
| 419Gln-E- | TG | Light iris color; Doubly-derived/cis |
Observed genotype counts for rs1800407 and rs12913832 among individuals with no missing data for these two SNPs. The groups shown are primarily the subset of 105 populations in Fig. 2 from world regions (Europe/SWAsia/NAfrica/SCAsia) where double heterozygotes were observed. The cells with bold underlined values indicate definite evidence of the cis (doubly-derived) haplotype,TG; see text.
| rs1800407 | CC | CT | TT | CC | CT | TT | CC | CT | TT | |
|---|---|---|---|---|---|---|---|---|---|---|
| rs12913832 | AA | AA | AA | AG | AG | AG | GG | GG | GG | |
| Population | N | |||||||||
| KSR | 32 | 3 | 0 | 5 | 1 | 0 | 1 | 0 | 0 | 42 |
| SOU | 32 | 0 | 0 | 14 | 0 | 0 | 1 | 0 | 0 | 47 |
| MHD | 27 | 1 | 0 | 9 | 2 | 0 | 1 | 0 | 0 | 40 |
| PLA | 33 | 5 | 0 | 21 | 1 | 0 | 5 | 0 | 0 | 65 |
| DRU | 39 | 5 | 0 | 38 | 2 | 0 | 17 | 0 | 0 | 101 |
| SRD | 17 | 3 | 1 | 12 | 0 | 0 | 2 | 0 | 0 | 35 |
| TCP | 22 | 5 | 1 | 20 | 2 | 0 | 9 | 0 | 0 | 59 |
| TRK | 38 | 4 | 2 | 24 | 3 | 0 | 9 | 0 | 0 | 80 |
| IRN | 28 | 4 | 0 | 8 | 2 | 0 | 0 | 0 | 0 | 42 |
| ADY | 22 | 4 | 2 | 18 | 3 | 0 | 3 | 0 | 53 | |
| ASH | 27 | 5 | 0 | 45 | 3 | 0 | 51 | 0 | 0 | 131 |
| TSI | 23 | 9 | 2 | 50 | 6 | 0 | 15 | 0 | 107 | |
| GRK | 10 | 6 | 0 | 23 | 2 | 0 | 9 | 0 | 0 | 50 |
| IBS | 38 | 10 | 0 | 40 | 8 | 9 | 0 | 107 | ||
| CHV | 2 | 1 | 0 | 13 | 2 | 0 | 21 | 0 | 42 | |
| HGR | 7 | 0 | 0 | 37 | 9 | 0 | 34 | 0 | 89 | |
| RUA | 0 | 0 | 0 | 6 | 0 | 0 | 25 | 0 | 33 | |
| RUV | 1 | 0 | 0 | 13 | 0 | 0 | 29 | 0 | 44 | |
| EAM | 5 | 2 | 0 | 26 | 3 | 0 | 48 | 0 | 87 | |
| GBR | 3 | 1 | 0 | 18 | 7 | 0 | 58 | 91 | ||
| CEU | 2 | 2 | 0 | 29 | 8 | 53 | 0 | 99 | ||
| IRI | 1 | 5 | 0 | 21 | 13 | 0 | 67 | 0 | 114 | |
| DAN | 1 | 1 | 0 | 8 | 1 | 0 | 39 | 0 | 51 | |
| FIN | 1 | 0 | 0 | 8 | 1 | 0 | 23 | 0 | 34 | |
| FN1 | 0 | 0 | 0 | 16 | 2 | 0 | 78 | 0 | 99 | |
| KMZ | 2 | 0 | 0 | 15 | 0 | 0 | 28 | 0 | 0 | 45 |
| PTH | 27 | 1 | 2 | 6 | 1 | 0 | 2 | 0 | 0 | 39 |
| PJL | 72 | 7 | 0 | 14 | 1 | 0 | 2 | 0 | 0 | 96 |
| STU | 93 | 3 | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 102 |
| BEB | 65 | 4 | 0 | 15 | 2 | 0 | 0 | 0 | 0 | 86 |
| HZR | 65 | 11 | 2 | 12 | 3 | 0 | 0 | 0 | 0 | 93 |
| AKZ | 45 | 2 | 0 | 13 | 1 | 0 | 1 | 0 | 0 | 62 |
| MGL | 87 | 1 | 0 | 10 | 2 | 0 | 0 | 0 | 0 | 100 |
| MAY | 42 | 1 | 0 | 4 | 3 | 0 | 0 | 0 | 0 | 50 |
Figure 5The haplotypes of rs12913832 and rs1800407 showing the high relative frequency of the doubly-derived haplotype especially in Northern Europe.
Distribution of individuals (by direct gene counting and by inference) in 105 populations for the 10 possible genotypes of the 2-SNP haplotype based on rs1800407, rs12913832.
| Ten possible genotypes | Direct counting | Inferred by PHASE from double heterozygotes | Inferred missing genotype | Total individuals | Observed in populations | |
|---|---|---|---|---|---|---|
| CA | CA | 4,643 | 0 | 179 | 4,822 | |
| CA | CG | 885 | 0 | 40 | 925 | |
| CG | CG | 676 | 0 | 18 | 694 | |
| CA | TA | 131 | 0 | 6 | 137 | |
| CG | TA | 0 | 87 | 0 | 87 | |
| CG | TG | 35 | 0 | 0 | 35 | |
| TA | TA | 13 | 0 | 1 | 14 | |
| CA | TG | 0 | 8 | 0 | 8 | |
| TA | TG | 2 | 0 | 0 | 2 | CEU, IBS |
| TG | TG | 1 | 0 | 0 | 1 | GBR |
| Total | 6,386 | 95 | 244 | 6,725 | ||