Literature DB >> 33692100

Genome-wide association study in almost 195,000 individuals identifies 50 previously unidentified genetic loci for eye color.

Mark Simcoe1,2, Ana Valdes1,3, Fan Liu4,5,6, Nicholas A Furlotte7, David M Evans8,9, Gibran Hemani9,10, Susan M Ring9,10, George Davey Smith9,10, David L Duffy11, Gu Zhu11, Scott D Gordon11, Sarah E Medland11, Dragana Vuckovic12,13,14, Giorgia Girotto12,13, Cinzia Sala15, Eulalia Catamo12, Maria Pina Concas13, Marco Brumat12, Paolo Gasparini12,13, Daniela Toniolo15, Massimiliano Cocca13, Antonietta Robino13, Seyhan Yazar16, Alex Hewitt16,17,18, Wenting Wu19, Peter Kraft20, Christopher J Hammond1,2, Yuan Shi21, Yan Chen4,5,6, Changqing Zeng5, Caroline C W Klaver22,23,24, Andre G Uitterlinden23,25, M Arfan Ikram23, Merel A Hamer26, Cornelia M van Duijn23,27, Tamar Nijsten26, Jiali Han19, David A Mackey16, Nicholas G Martin11, Ching-Yu Cheng21,28, David A Hinds7, Timothy D Spector1, Manfred Kayser29, Pirro G Hysi30,2.   

Abstract

Human eye color is highly heritable, but its genetic architecture is not yet fully understood. We report the results of the largest genome-wide association study for eye color to date, involving up to 192,986 European participants from 10 populations. We identify 124 independent associations arising from 61 discrete genomic regions, including 50 previously unidentified. We find evidence for genes involved in melanin pigmentation, but we also find associations with genes involved in iris morphology and structure. Further analyses in 1636 Asian participants from two populations suggest that iris pigmentation variation in Asians is genetically similar to Europeans, albeit with smaller effect sizes. Our findings collectively explain 53.2% (95% confidence interval, 45.4 to 61.0%) of eye color variation using common single-nucleotide polymorphisms. Overall, our study outcomes demonstrate that the genetic complexity of human eye color considerably exceeds previous knowledge and expectations, highlighting eye color as a genetically highly complex human trait.
Copyright © 2021 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).

Entities:  

Year:  2021        PMID: 33692100      PMCID: PMC7946369          DOI: 10.1126/sciadv.abd1239

Source DB:  PubMed          Journal:  Sci Adv        ISSN: 2375-2548            Impact factor:   14.136


INTRODUCTION

Eye color is primarily determined by melanin abundance within the iris pigment epithelium, which is greater in brown than in blue eyes (), and both the density and distribution of stromal melanocyte cells (). Ratios of the two forms of melanin, eumelanin and pheomelanin, within the iris as well as light absorption and scattering by extracellular components (Tyndall scattering) are additional factors that give irises their color (). Absolute melanin quantity and the eumelanin:pheomelanin ratio are higher in brown irises (), while blue or green irises have very little of both pigments and relatively more pheomelanin. European populations, or those with partial European origin, display the largest diversity of iris color, varying from the lightest blue to darkest brown. The prevalence of blue eyes correlates with geographic latitude across Europe and neighboring areas (), likely as a result of human migration, sexual, and possibly natural selection (–). Similarly, eye color variation with varying degrees of brown irises is seen in Asian populations (), although with a much reduced range compared to brown eye color variation in Europeans. Iris color is highly heritable (). Previous genome-wide association studies (GWASs) have identified various single-nucleotide polymorphisms (SNPs) in and around 10 genes significantly associated with eye color (–), highlighting the polygenic nature of a trait that, in the past, was assumed to be genetically simple (). The strongest genetic influence on eye color is exerted by the neighboring HERC2 and OCA2 genes (, , , ), where a long distance enhancer effect of an intronic SNP in HERC2 was demonstrated interacting with the OCA2 promoter functioning as a molecular switch between light and dark pigmentation (). Previously available genetic knowledge allows accurate prediction of blue and brown eye color, for instance, with a DNA test system based on six SNPs from six genes, including HERC2 and OCA2 (), that has been used in anthropological () and forensic (, ) applications. However, nonblue and nonbrown eye color can be genetically predicted with considerably lower accuracy (, , ), likely because of unknown predictive SNPs and responsible genes. Moreover, the phenotypic variance of eye color not previously explained by GWASs ranged between 26% for a blue versus a brown scale () and 50% for a scale using three categories () across studies. This illustrates the likely presence of yet unknown genes responsible for the noted missing genetic predicting accuracy (, , ) and missing heritability of human eye color (). To overcome these limitations and better understand the genetics of human eye color, we carried out the largest eye color GWAS to date. Our study involved 157,485 individuals of European ancestry in the discovery stage and an additional 35,501 ancestral European individuals in the replication stage, as well as 1636 Asians (of Han Chinese and Indian ancestry) additionally used for replication purposes, in total 194,622 individuals of different ancestries.

RESULTS

Linear regression results were obtained for 11,532,091 SNPs from 157,485 individuals of European ancestry in the discovery dataset from the personal genetics company 23andMe Inc. The Devlin’s genomic factor () was λ = 1.13, while a linkage disequilibrium (LD) score regression intercept of 1.095 with an [(intercept − 1)/(mean(χ2) − 1)] ratio of 0.19 was obtained, which is consistent with expectations based on the large sample size and polygenic architecture (). In total, 12,192 SNPs were associated at genome-wide significance (P < 5 × 10−8) with eye color. These SNPs clustered in 52 distinct genomic regions, 50 across the autosomes and 2 on chromosome X (Table 1 and Fig. 1). While confirming the 10 genomic regions previously associated with human eye color (, , ), the remaining 42 genomic regions identified represent novel discoveries.
Table 1

Strongest associated SNPs from the 52 genomic regions independently associated with eye color in the European discovery cohort (N = 157,485).

A full list of 115 independently associated SNPs from conditional analysis at these loci can be found in table ST1. SNPs with novel associations for eye color identified here are highlighted in bold. SNPs that were associated with other pigmentation traits, but not iris color, in previous studies are indicated by an asterisk (*). Chr is the chromosome for the given SNP, genome position (Pos) refer to the HG build 37, RS is the rsid for the given SNP, Freq is the allele frequency of the stated reference allele, Beta is the effect size for the stated reference allele, SE is the respective standard error for the beta, Alt allele is the alternative allele for the given SNP, and N is the sample size tested for the respective SNP.

RegionChrPosRSNearest geneRef alleleAlt alleleNFreqBetaSEP
119166344rs6693258GPR157TC1574830.216−0.0560.00919.86 × 10−10
2142110888rs6696511HIVEP3TC1574830.3820.1090.00771.35 × 10−45
31212421629rs351385*DTLGA1574830.580−0.0830.00752.31 × 10−28
41236035805rs2385028LYSTTC1574830.2450.0610.00862.43 × 10−12
5246233381rs13016869PRKCEGC1574830.208−0.0580.00922.23 × 10−10
62206950236rs112747614INO80DGC1574830.6960.0540.00813.69 × 10−11
72219755011rs121908120WNT10ATA1574830.9760.1400.02471.54 × 10−8
82223483670rs12614022FARSBGA1574830.6360.0480.00784.26 × 10−10
92239276278rs74409360TRAF3IP1TC1574830.076−0.1280.01408.09 × 10−20
10342762488rs3912104CCDC13TA1574830.5410.0450.00751.47 × 10−9
11369980177rs116359091*MITFGA1574830.974−0.2290.02461.02 × 10−20
12423939399rs4521336PPARGC1ATC1574830.3590.0480.00791.28 × 10−9
13459359559rs141318671No geneGA1574830.993−0.3180.05682.09 × 10−8
14490059434rs6828137TIGD2TG1574830.457−0.0410.00753.93 × 10−8
155311902rs62330021PDCD6, AHRRGA1574830.948−0.3250.01711.36 × 10−80
16533951693rs16891982*SLC45A2GC1574830.958−0.6540.01911.97 × 10−255
17540273518rs348613DAB2GA1574830.0010.7310.12272.54 × 10−9
185123896988rs72777200ZNF608TC1574830.8280.0740.01001.22 × 10−13
195148216187rs11957757ADRB2GA1574830.5470.0420.00752.94 × 10−8
206396321rs12203592*IRF4TC1574830.171−0.3850.01001.61 × 10−321
21610538183rs6910861GCNT2GA1574830.4610.0780.00761.20 × 10−24
226158841725rs341147TULP4GA1574830.343−0.0450.00786.45 × 10−9
23745960645rs2854746IGFBP3GC1574830.587−0.0430.00761.24 × 10−8
24783653553rs6944702SEMA3ATC1574830.5120.0480.00751.71 × 10−10
25812690997rs6997494LONRF1TG1574830.4800.0480.00741.07 × 10−10
26842003663rs12543326*AP3M2GA1574830.8750.0650.01139.07 × 10−9
27881350433rs147068120ZBTB10TC1574830.038−0.1220.01988.34 × 10−10
28912677471rs13297008*TYRP1GA1574830.3880.2600.00774.99 × 10−250
29927366436rs12552712MOB3BTC1574830.604−0.0560.00772.79 × 10−13
309132001056rs12335410IER5LTC1574830.221−0.0580.00901.28 × 10−10
311168831364rs72928978*TPCN2GA1574830.8770.1420.01216.42 × 10−32
321189017961rs1126809*TYRGA1574830.7210.2850.00831.82 × 10−255
331223979791rs9971729*SOX5CA1574830.5660.0820.00765.03 × 10−27
341292567833rs790464BTG1TC1574830.1890.0600.00964.50 × 10−10
351374178399rs2095645KLF12GA1574830.6720.0500.00793.26 × 10−10
361395189401rs9301973*DCTGA1574830.6710.0660.00801.48 × 10−16
371469236136rs138777265ZFP36L1CA1574830.047−0.1090.01766.21 × 10−10
381492780387rs17184180*SLC24A4TA1574830.5670.2710.00751.34 × 10−284
391528211758rs4778218*OCA2GA1574830.834−0.5200.0100<1 × 10−330
401528356859rs1129038*HERC2TC1574830.734−2.6050.0058<1 × 10−330
411548426484rs1426654SLC24A5GA1574830.1020.6840.06442.72 × 10−26
42161376386rs761063UBE2IGC1574830.277−0.0500.00842.07 × 10−9
43171966889rs4790309SMG6TC1574830.4560.0410.00753.71 × 10−8
441767497367rs3809761MAP2K6GA1574830.8110.0590.01015.08 × 10−9
451779612397rs6420484*TSPAN10GA1574830.645−0.1610.00807.53 × 10−90
46197581625rs73488486ZNF358TG1574830.0970.0860.01282.40 × 10−11
47204948248rs2748901SLC23A2GA1574830.463−0.0470.00753.37 × 10−10
482138568882rs2835660TTC3GC1574830.545−0.0640.00751.62 × 10−17
492144783287rs622330*SIK1GA1574830.5120.1110.00753.87 × 10−49
502246369657rs35051352WNT7BGC1478750.549−0.0620.00813.33 × 10−14
512348118832rs78542430SSX1TA1574830.450−0.0950.00625.93 × 10−53
5223119439335rs5957354TMEM255ATA1574830.861−0.0900.00925.43 × 10−23
Fig. 1

Manhattan plot of eye color GWAS results in the European discovery cohort (N = 157,485).

All results with P values <5 × 10−8 are indicated in red.

Strongest associated SNPs from the 52 genomic regions independently associated with eye color in the European discovery cohort (N = 157,485).

A full list of 115 independently associated SNPs from conditional analysis at these loci can be found in table ST1. SNPs with novel associations for eye color identified here are highlighted in bold. SNPs that were associated with other pigmentation traits, but not iris color, in previous studies are indicated by an asterisk (*). Chr is the chromosome for the given SNP, genome position (Pos) refer to the HG build 37, RS is the rsid for the given SNP, Freq is the allele frequency of the stated reference allele, Beta is the effect size for the stated reference allele, SE is the respective standard error for the beta, Alt allele is the alternative allele for the given SNP, and N is the sample size tested for the respective SNP.

Manhattan plot of eye color GWAS results in the European discovery cohort (N = 157,485).

All results with P values <5 × 10−8 are indicated in red. As expected, many of the strongest associations were observed for SNPs within HERC2 (rs1129038, P < 10−330), the strongest eye color–associated region known previously, and genes involved in five of the seven known types of oculocutaneous albinism (OCA): OCA1 (TYR, rs1126809, P = 1.8 × 10−255), OCA2 (OCA2, rs1800407, P < 10−330), OCA3 (TYRP1, rs13297008; P = 5.0 × 10−250), OCA4 (SLC45A2, rs16891982; P = 2.0 × 10−255), and OCA6 (SLC24A5, rs1426654; P = 2.7 × 10−26). HERC2, TYR, OCA2, TYRP1, and SLC45A2 have previously been associated with eye color (, , , , ), while previous studies found association with hair and skin pigmentation for SLC24A5 () and eye color in a South Asian population (), but not with eye color in Europeans as we showed here. No significant eye color associations were found for either the OCA5 (the 4q24 region) or the OCA7 (C10ORF11) locus, the other two genes involved in OCA, in our large discovery sample; however, C10ORF11 has recently been associated with human eyebrow color (). The remaining four previously reported loci () were also strongly associated with eye color in the present study: LYST (rs2385028, P = 2.4 × 10−12), IRF4 (rs12203592, P = 1.6 × 10−321), SLC24A4 (rs17184180, P = 1.3 × 10−284), TSPAN10 (previously reported as NPLOC4, rs6420484; P = 7.5 × 10−90), and TTC3 (rs2835660, P = 1.6 × 10−17). Among the 41 novel genetic loci identified in our discovery analysis, we detected significant eye color association for SNPs within TPCN2 (rs72928978, P = 6.42 × 10−32) and MITF (rs75114713, P = 8.11 × 10−24). These two genetic loci and five others (near the DTL, AP3M2, SOX5, DCT, and SIK1 genes; Table 1) were associated with hair and skin pigmentation in previous GWASs (highlighted with asterisk in Table 1) (, –), while their eye color association is reported here. A comparison of our results with those from the GWAS Catalog () revealed that 34 (81%) of the 41 novel eye color–associated loci are unique to eye color and not shared with other pigmentation traits (Table 1), which is 36 (69%) when considering all 52 eye color–associated loci we identified here including the 11 previously known loci. The unique novel eye color loci include SNPs in TRAF3IP1 (rs74409360, P = 8.1 × 10−20) and SEMA3A (rs6944702, P = 1.7 × 10−10). TRAF3IP1 has been previously associated with iris furrows, while SEMA3A was associated with iris crypt variation (). These findings suggest that eye color may, at least in part considered by the eye color phenotyping done here, be mediated through structural effects within the iris. Significant association was also observed for SNPs within HIVEP3 (rs6696511, P = 1.3 × 10−45), which was previously associated with refractive error and myopia (). Novel eye color association within PPARGC1A (rs4521336, P = 1.3 × 10−9) and within MAP2K6 (rs3809761, P = 5.1 × 10−9) likely arises from their participation in pigmentation pathways, the PGC1-α protein, the product of the PPARGC1A gene. PGC1-α activates the MITF promoter () acting as a regulator in the pigment pathway of the tanning response, while overexpression of MAP2K6 increases melanocyte dendricity (), allowing greater transportation of melanosomes. We report here eye color associations for genetic loci located on the X chromosome. DNA variants clustering around the SSX1 (rs78542430, P = 5.93 × 10−53) and TMEM255A (rs5957354, P = 5.43 × 10−23) genes were significantly associated with eye color. Little is published about the TMEM255A and SSX1 genes and their potential functional roles relating to the eye, and further studies are required to understand the role of these genes in iris pigmentation. Notably, a recently published large GWAS on hair color in Europeans was the first to discover X-chromosomal loci to be involved in human pigmentation (), albeit with another gene (COL46A, 12 Mbp upstream from nearest associated locus in our study) than we found to be associated with eye color here. Notably, SNPs in or near the MC1R gene, known to be involved in light skin and red hair color, did not show eye color association in our study. Next, we conducted a conditional analysis to identify SNPs associated with eye color after adjusting for the effect of the main association signal at a respective locus. This analysis highlighted 115 conditionally associated SNPs distributed across the novel and previously known 52 genomic regions (table ST1; a secondary list of 489 high-quality control (QC) scoring, significantly associated SNPs is provided in table ST2) that improve the phenotypic variability explained by genetic factors compared to the lead SNPs. Of the 115 independently associated SNPs, 9 were missense mutations (table ST3) (), which may suggest their causal role. Using additional independent data from 35,501 individuals of European ancestry enrolled in nine different studies collected by the International Visible Trait Genetics (VisiGen) Consortium and its study partners, we sought to replicate the findings from the discovery analysis. For this, we tested the strongest associated SNP for each of the 52 regions (lead SNPs) identified in the discovery GWAS. To minimize data fragmentation and therefore maximize statistical power, results of regression analyses from each of the nine VisiGen studies were meta-analyzed together, and the outcome of this meta-analysis was used to replicate the discovery stage findings. Because of the differences in sample size, genotyping platforms, and imputation methods across the VisiGen studies, several SNPs highlighted by the discovery GWAS were not available or did not pass quality control in the replication dataset, particularly variants of low minor allele frequency (MAF). Some VisiGen studies had no availability of X chromosome data. Therefore, replication was attempted for 48 lead SNPs from 50 autosomal regions highlighted in the discovery analysis (table ST4). Overall, for 47 (98%) of the 48 lead SNPs, the direction of effect was the same in the replication analysis as previously seen in the discovery analysis (Fig. 2 and table ST4). For 27 (56%) lead SNPs, we obtained significant replication after applying a strict Bonferroni adjustment for multiple testing (0.05/48 = 1.04 × 10−3), including 18 (47%) of the 38 novel lead SNPs. An additional nine (24%) of the novel lead SNPs were associated at nominal (P < 0.05) significance. There was minimal heterogeneity between cohorts for these SNPs; a histogram of the heterogeneity I-scores is provided in fig. S1.
Fig. 2

Comparison of SNP beta and Z scores between the European discovery (23andMe, N = 157,485) and the European replication (VisiGen, N = 35,501) cohorts.

Color codes represent significance in the replication cohort: red = P < 1.04 × 10−3; green = P < 0.05; blue = P > 0.05.

Comparison of SNP beta and Z scores between the European discovery (23andMe, N = 157,485) and the European replication (VisiGen, N = 35,501) cohorts.

Color codes represent significance in the replication cohort: red = P < 1.04 × 10−3; green = P < 0.05; blue = P > 0.05. Results from both the discovery and replication studies were then meta-analyzed including a total of 192,986 Europeans from 10 populations. The Devlin’s genomic factor was relatively unchanged in this analysis (λ = 1.14). With the added statistical power, we found genome-wide significant associations for SNPs in an additional nine separate genomic regions (table ST5). All nine genetic loci were novel, not previously associated with human eye color. Three of these, PDE4D (rs62370541, P = 5.8 × 10−9), JAZF1 (rs849142, 3.0 × 10−9), and SOX6 (rs2351061, P = 4.0 × 10−8), have recently been associated with hair color (). PDE4D inhibitors have been shown to increase melanin pigment in the skin of mouse models (), as PDE4D is a target of the melanin-stimulating hormone/cyclic adenosine monophosphate (AMP)/melanocyte-inducing transcription factor (MITF) pathway (). Next, we compared results obtained from European subjects with association observations from a meta-analysis of 1636 Asian individuals (959 Han Chinese and 677 Indians from Singapore), for which quantitative measurements of the iris color were available (see the Supplementary Methods for information about quantitative phenotyping). Reliable SNP data were available for 44 of the lead autosomal SNPs from the 52 genetic loci identified by conditional analysis in the European discovery cohort. The remaining eight lead SNPs had a MAF smaller than 1% in this cohort and thus were excluded on account of the much smaller sample size. Thirty-one SNPs (70%) had the same direction of effect as the European analysis, and 5 (11%) of these 44 SNPs were significantly associated with eye color in Asians, after adjustment for multiple testing (Bonferroni-adjusted P value: 0.05/44 = 1.13 × 10−3). This included SNPs from two of the newly identified genes GPR157 (rs6693258, P = 3.6 × 10−5) and SIK1 (rs622330, P = 1.7 × 10−4) (table ST6), as well as from the previously known HERC2 gene (rs1129038, P = 2.2 × 10−4), which is the most strongly eye color–associated SNP in Europeans (). For this marker, we observed considerable MAF differences between Europeans and Asians (T allele: 0.05 in Asians and 0.73 in Europeans). The strongest association in the Singaporean cohorts, however, was found within SLC24A5 (rs1426654, P = 1.9 × 10−7) representing the OCA type 6 (OCA6) locus. Notably, OCA6 was first reported in a Chinese family (). However, the HERC2 and SLC24A5 SNPs were the only two variants to display heterogeneity between the two Singaporean cohorts (table ST6), with association primarily being driven by the cohort of South Asian ancestry. This is likely a due to South Asian and European populations being more closely related than East Asian and European populations. The MAFs for rs1426654 were considerably different between Asians (0.80 overall, 0.99 in East Asians, and 0.52 in South Asians) and Europeans (0.10) for the G allele, explaining why this SNP is often used as a DNA marker for biogeographic ancestry (). Previously, SLC24A5 rs1426654 not only explained a considerable proportion of the variation in skin pigmentation between different continental populations () but also was associated with skin and eye color variation within a South Asian population (, ). Despite such drastic differences in allele frequency for some SNP alleles, the shared eye color effect between populations from two different continents demonstrates the value of our multiethnic study. DNA variants identified via GWAS are markers of statistical association and not necessarily causative. Because of the differences in LD across continental populations, association may not be universally replicable. Therefore, we next assessed the presence of association not just for the lead SNPs in each region but also by testing other SNPs located within the 52 genomic regions identified in the European discovery analysis. Several SNPs within these regions (table ST7) showed evidence for eye color association in both European and Asian populations, with genome-wide significance in Europeans and suggestive levels of genome-wide association (P < 1 × 10−5) in Asians, despite a much smaller sample size. These SNPs also showed no significant heterogeneity between the two Asian cohorts. It is therefore possible that, against the background of much stronger effects of European-only alleles or due to population-specific differences in MAF, some polymorphisms contributing to European eye color variation are also relevant for eye color variability in non-European populations, such as variation of brown eye color in Asians tested here. In Europeans, the 112 autosomal SNPs identified through conditional analysis (all autosomal SNPs shown in table S1) explained 99.96% (SE = 6.5%, P = 4.8 × 10−279) of the liability scale for blue eyes (against brown eyes) and 38.5% (SE = 5.7%, P = 2.2 × 10−130) for intermediate eyes in the TwinsUK cohort, which was one of the VisiGen cohorts used for replication. Using the same linear scale as the GWAS analysis, these autosomal SNPs explained 53.2% (SE = 4.0%, P = 1.2 × 10−322) of the total phenotypic variation in eye color in TwinsUK. Last, we performed in silico analyses to explore the putative function of the genetic loci our study highlighted with significant eye color association using the conditional SNPs. Gene set enrichment analysis identified multiple pathways with significant enrichment (table ST8). As expected, this included several pigmentation process pathways, with “Developmental Pigmentation” the most significant (P = 7 × 10−6), followed by “Frizzled Binding” (P = 2.0 × 10−4) and “Melanin Metabolic Process” (P = 2.6 × 10−4). We also examined the potential effects of the identified eye color–associated SNPs on gene expression using data from the GTEx Consortium (). Despite the lack of iris tissue in the GTEx repository, many SNPs showed significant eQTL (expression quantitative trait loci) effects in multiple cell types and tissues (), as seen for the associated SNPs across 38 (79%) of the 48 tissues in the GTEx dataset (table ST9). Most of the strongest effects were seen in nerve and sun-exposed skin tissue (P = 8.47 × 10−62 and P = 4.84 × 10−49, respectively) for rs2835660, where the C allele is significantly associated with a decrease in TTC3 expression. TTC3 is in proximity to DSCR9, a gene whose polymorphisms were previously associated with eye color (). These results implicate TTC3 as a more likely candidate gene influencing eye color at this genetic locus than DSCR9 and that its effect on eye color is likely mediated through variation in gene expression. The lack of iris tissue information in GTEx likely explains the absence of stronger eQTL effects, such as for SNPs with regulatory effects over gene transcription ().

DISCUSSION

We report the results of the largest GWAS for human eye color to date. In addition to confirming the association of SNPs in 11 previously known eye color genes (, , , , ), the identification of 50 novel eye color–associated genetic loci helps explain previously missing heritability of eye color variability in European populations. Moreover, because of the multiethnic design of our study, we demonstrate that several of the genetic loci discovered in Europeans also have an effect on eye color in Asians. Eight of the genes in or near the loci newly associated with eye color in our study were previously reported for genetic associations with other pigmentation traits, such as hair and skin color, for instance, TPCN2, MITF, and DCT (, , , ). The commonality of associated DNA variants across the three pigmentation traits helps explain why the different pigmentation traits frequently (but not completely) intercorrelate in European populations. While many significant genetic associations are shared between iris color and other pigmentation traits, there are also notable differences. Although DNA variants within the MC1R gene are strongly associated with light skin and red hair color (), no detectable association with eye color was found in our large GWAS, in line with previous albeit smaller-sized GWASs of more limited statistical power (, , ). Similarly, other DNA variants strongly associated with skin and hair color within genes, such as SILV, ASIP, and POMC (), showed no statistically significant effect on eye color in this study, nor in previous studies. Moreover, we also identified 34 genetic loci that were significantly associated with eye color, but for which there is no report of significant association with hair and/or skin color. This is remarkable as the statistical power of the recent GWASs on hair color (, ) and sun sensitivity () were similar to that of our current eye color GWAS. Significant associations for SNPs in/near genes involved in iris structure, such as TRAF3IP1 and SEMA3A, suggest that they exert their effects with changes in Tyndall scattering, rather than through alterations of melanin metabolism. Overall, this demonstrates that although many genes overlap between eye, hair, and skin color, the different human pigmentation traits are not completely determined by the same genes as we showed. The major strengths of our study compared with previous eye color GWASs arise from the larger sample size, which translated into increased statistical power and also the ability to lower the threshold of MAF for which sufficient power to detect association is available. Rare SNPs are often a source of considerable phenotypic variation (). For instance, seven (6%) of the independently associated SNPs identified by conditional analysis in the discovery cohort had a MAF between 0.1 and 1%. Despite their low frequency, however, five (71%) of these rare SNPs were in the same region as other, more common conditional SNPs that did replicate. The remaining two loci (DAB2 and an intronic region on chromosome 4) that were not formally replicated should therefore be considered only as strong candidates with respect to their association with eye color, pending independent validation in future studies. Another strength of this work is the inclusion of European and non-European populations. Non-European populations are underrepresented in the GWAS literature in general, including in pigmentation GWASs, but their study is important for the understanding of the genetic basis of human phenotypes (). Although eye color variation is typically attributed to individuals of (at least partial) European descent, or those originating from areas nearby Europe, more subtle variation in brown eyes is also observed in Asian populations without European admixture (). Our results from the Asian cohorts showed remarkable consistence in the genetic architecture of eye color among individuals of different continental ancestries with Asian replication for the two major European genes OCA2 and HERC2. Moreover, our findings also suggest that while a single regulatory variant in HERC2 is responsible for most blue/brown variation in Europeans (), many additional DNA variants across both OCA2 and HERC2 seem to have independent effects. This hypothesis is further supported by our conditional analysis in the European discovery cohort, identifying independent associations spanning ~14 mbp across both genes rather than a concentrated cluster centered at HERC2 rs1129038. This is remarkable given the large eye color variation from the lightest blue to the darkest brown in Europeans, compared with the more limited variation within brown eye color in Asians. In conclusion, our work has identified numerous novel genetic loci associated with human eye color in Europeans, of which a subset also shows effects in Asians, despite their largely reduced phenotypic eye color variation compared with Europeans. The genetic loci we identified explain the majority (53.2%) of eye color phenotypic variation (classified using a three-category scale) in Europeans and a large proportion of the previously noted missing heritability of eye color. Our findings clearly demonstrate that eye color is a genetically highly complex human trait, similar to hair () and skin color (), as highlighted recently in large European GWASs. The large number of novel eye color–associated genetic loci identified here provide a valuable resource for future functional studies, aiming to understand the molecular mechanisms that explain their eye color association, and for future genetic prediction studies, aiming to improve DNA-based eye color prediction in anthropological and forensic applications.

METHODS

We performed a GWAS using information from two sources: 157,485 research participants of European ancestry recruited among the customer base of the personal genomics company 23andMe Inc. (Sunnyvale, CA, USA) () and a meta-analysis of 35,501 European individuals from nine populations collected by members of the International Visible Trait Genetics (VisiGen) Consortium () and their study partners. As a secondary analysis and for comparative purposes, we included an additional set of 1636 individuals of Asian ancestry from two populations (959 Han Chinese and 677 Indians from Singapore).

Populations and participants

23andMe

Participants

Participants included for this analysis were 23andMe consumers who consented to participate in research and were determined to have greater than 97% European ancestry using genotype clustering. A segmented identity-by-descent algorithm () was then applied to obtain the maximal subset (157,485) of unrelated individuals to be used as our discovery cohort (full details on ancestry and relatedness calculations are provided in the Supplementary Methods, and principal component plots are provided in fig. S2).

Genotyping

Participants were genotyped on one of 23andMe’s own custom-designed SNP arrays. The V4 platform is a fully customized array containing a subset of SNPs from previous versions, while the V3 is a customized platform based on the OmniExpress+ BeadChip, and the V1 and V2 platforms are customized variants of the Illumina HumanHap550+ BeadChip (full descriptions of platforms and genotyping procedure are provided in the Supplementary Methods). Imputation of additional variants was completed using 1000 Genomes phase 1 as reference haplotypes ().

Phenotyping

Phenotyping for iris color in this cohort was derived from questionnaires. Participants were asked to self-categorize eye color into one of seven distinct groups ranging from blue to dark brown based on color matching (full details are provided in the Supplementary Methods). This categorization allowed greater phenotypic resolution than previous three-color categories while maintaining groups distinctive enough to reduce misclassification. These groups were converted into a numerical scale ranging from 0 (blue) to 6 (dark brown) and used as the outcome variable for linear regression–based GWAS analysis.

VisiGen consortium and study partners

Specific recruitment criteria varied for each cohort (Supplementary Materials), with participants from the United Kingdom, The Netherlands, Italy, and European descendants in the United States and Australia. Principal components analyses examined the genetic ancestry for each cohort and confirmed each was of nonadmixed European ancestry (specific details for each cohort are provided in the Supplementary Methods). Eye color categorization varied between cohorts, with most of the European cohorts adopting a 3- or 5-point categorical scale (details for each cohorts’ classification are provided in the Supplementary Methods).

Singaporean cohorts

Full GWAS summary statistics were available for 959 participants of Chinese descent recruited at the Singapore Polyclinic and for 677 participants in the Singapore Indian Eye Study (SINDI). Phenotypes in these cohorts were ascertained using image analysis of digital photographs, better suited to capture variation within these populations (full details are provided in the Supplementary Methods).

Statistical analyses

Association tests

We performed an independent linear regression for each cohort under the assumption of an additive model for allelic effects, with adjustments made for age, sex, and the first five principal components. Adjustment was made also for the genotyping platform (V1 to V4), and additional measures controlling for relatedness were implemented for the VisiGen family cohorts (full details for each cohort are provided in the Supplementary Methods). The use of a linear scale may be limited by the assumption that there is an equal difference between each group; however, as eye color phenotypically follows linear scale across the color spectrum, this choice of statistical model is appropriate and in accordance with previous studies on eye color (, , ).

Meta-analyses

The QC’ed summary statistics from the VisiGen and Singaporean cohorts were pooled into two separate meta-analyses: the first was conducted using data from the nine VisiGen European cohorts, and the second using data from the two Singaporean cohorts. Both meta-analyses were performed using METAL () applying a weighted Z-score approach, given the differing phenotypic scales used across individual cohorts, with genomic control adjustments () made for the VisiGen meta-analysis (see the Supplementary Methods). A final meta-analysis was then conducted between the 23andMe cohort and the European VisiGen cohorts following the same procedures with a weighted Z score approach and genomic control adjustments.

Definition of genomic region and significance

We adopted the customary definition for a GWAS significance threshold of 5 × 10−08. For this work, an “associated region” was a genomic region demarcated by consecutive significantly associated genomic markers, separated by a nonassociated region greater than 1 million base pairs.

LD score regression

LD Hub () was used to perform LD score regression () on the summary statistics from our GWASs to test for possible inflation. This method is more suited to our study than the Devlin () genomic inflation lambda because of high trait polygenicity and the large cohort sizes ().

Conditional analysis

Conditional analysis was conducted using summary statistics with genome-wide complex trait analysis (GCTA) () following the same procedure described by Yang et al. (). Genotypic data from unrelated participants in TwinsUK were used as a reference for LD structure, a distance of 1000 kb was set as an assumption of complete linkage equilibrium, and a collinearity threshold was set at R2 > 0.9.

Variant effect prediction

The predicted effects of variants were performed using the Ensembl Variant Effect Predictor (VEP) ().

Heritability explained by the independent SNPs

Population liability scale heritability () explained by associated SNPs identified in the 23andMe discovery cohort was calculated using an unrelated sample from the TwinsUK cohort using restricted maximum likelihood analysis () under the assumption that the population prevalence (taken from the 23andMe cohort) is 30.3% for blue eyes and 38.3% for intermediate colors.

Pathway analysis

Genetic pathway analysis for iris color was performed with MAGENTA () using the 23andMe discovery SNP association results as input (full details are provided in the Supplementary Methods). Gene set definitions defined by Gene Ontology () were obtained from the Molecular Signatures Database (version MSigDB v6.1) () for this analysis.

Members of consortia and affiliations

23andMe research team

23andMe Inc., Sunnyvale, California, USA—M. Agee, A. Auton, R. K. Bell, K. Bryc, S. L. Elson, P. Fontanillas, K. E. Huber, A. Kleinman, N. K. Litterman, M. H. McIntyre, J. L. Mountain, E. S. Noblin, C. A. M. Northover, S. J. Pitts, O. V. Sazonova, J. F. Shelton, S. Shringarpure, C. Tian, J. Y. Tung, and V. Vacic.

International Visible Trait Genetics Consortium

Department of Twins Research and Genetic Epidemiology, King’s College London, London, United Kingdom—P. G. Hysi and T. D. Spector. Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, The Netherlands—M. Kayser and F. Liu. QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia—D. L. Duffy and N. G. Martin. University of Queensland Diamantina Institute, University of Queensland, Brisbane, Queensland, Australia—D. M. Evans.
  84 in total

1.  HEREDITY OF EYE-COLOR IN MAN.

Authors:  G C Davenport; C B Davenport
Journal:  Science       Date:  1907-11-01       Impact factor: 47.728

2.  Estimating missing heritability for disease from genome-wide association studies.

Authors:  Sang Hong Lee; Naomi R Wray; Michael E Goddard; Peter M Visscher
Journal:  Am J Hum Genet       Date:  2011-03-03       Impact factor: 11.025

3.  Association of iris surface features with iris parameters assessed by swept-source optical coherence tomography in Asian eyes.

Authors:  Tin A Tun; Jacqueline Chua; Yuan Shi; Elizabeth Sidhartha; Sri Gowtham Thakku; William Shei; Marcus Chiang Lee Tan; Joanne Hui Min Quah; Tin Aung; Ching-Yu Cheng
Journal:  Br J Ophthalmol       Date:  2016-03-18       Impact factor: 4.638

4.  HERC2 rs12913832 modulates human pigmentation by attenuating chromatin-loop formation between a long-range enhancer and the OCA2 promoter.

Authors:  Mijke Visser; Manfred Kayser; Robert-Jan Palstra
Journal:  Genome Res       Date:  2012-01-10       Impact factor: 9.043

5.  SLC24A5, a putative cation exchanger, affects pigmentation in zebrafish and humans.

Authors:  Rebecca L Lamason; Manzoor-Ali P K Mohideen; Jason R Mest; Andrew C Wong; Heather L Norton; Michele C Aros; Michael J Jurynec; Xianyun Mao; Vanessa R Humphreville; Jasper E Humbert; Soniya Sinha; Jessica L Moore; Pudur Jagadeeswaran; Wei Zhao; Gang Ning; Izabela Makalowska; Paul M McKeigue; David O'donnell; Rick Kittles; Esteban J Parra; Nancy J Mangini; David J Grunwald; Mark D Shriver; Victor A Canfield; Keith C Cheng
Journal:  Science       Date:  2005-12-16       Impact factor: 47.728

6.  Genome-wide efficient mixed-model analysis for association studies.

Authors:  Xiang Zhou; Matthew Stephens
Journal:  Nat Genet       Date:  2012-06-17       Impact factor: 38.330

7.  Genome-wide association meta-analysis of individuals of European ancestry identifies new loci explaining a substantial fraction of hair color variation and heritability.

Authors:  Pirro G Hysi; Ana M Valdes; Fan Liu; Nicholas A Furlotte; David M Evans; Veronique Bataille; Alessia Visconti; Gibran Hemani; George McMahon; Susan M Ring; George Davey Smith; David L Duffy; Gu Zhu; Scott D Gordon; Sarah E Medland; Bochao D Lin; Gonneke Willemsen; Jouke Jan Hottenga; Dragana Vuckovic; Giorgia Girotto; Ilaria Gandin; Cinzia Sala; Maria Pina Concas; Marco Brumat; Paolo Gasparini; Daniela Toniolo; Massimiliano Cocca; Antonietta Robino; Seyhan Yazar; Alex W Hewitt; Yan Chen; Changqing Zeng; Andre G Uitterlinden; M Arfan Ikram; Merel A Hamer; Cornelia M van Duijn; Tamar Nijsten; David A Mackey; Mario Falchi; Dorret I Boomsma; Nicholas G Martin; David A Hinds; Manfred Kayser; Timothy D Spector
Journal:  Nat Genet       Date:  2018-04-16       Impact factor: 38.330

8.  A Genome-Wide Association Study of Skin and Iris Pigmentation among Individuals of South Asian Ancestry.

Authors:  Manjari Jonnalagadda; Muhammad Ashhad Faizan; Shantanu Ozarkar; Richa Ashma; Shaunak Kulkarni; Heather L Norton; Esteban Parra
Journal:  Genome Biol Evol       Date:  2019-04-01       Impact factor: 3.416

9.  Haplotypes in SLC24A5 Gene as Ancestry Informative Markers in Different Populations.

Authors:  Emiliano Giardina; Ilenia Pietrangeli; Cristina Martínez-Labarga; Claudia Martone; Flavio de Angelis; Aldo Spinella; Gianfranco De Stefano; Olga Rickards; Giuseppe Novelli
Journal:  Curr Genomics       Date:  2008-04       Impact factor: 2.236

10.  Low-Frequency and Rare-Coding Variation Contributes to Multiple Sclerosis Risk.

Authors: 
Journal:  Cell       Date:  2018-10-18       Impact factor: 41.582

View more
  10 in total

1.  Genome-Wide Association Study of Potential Meat Quality Trait Loci in Ducks.

Authors:  Qixin Guo; Lan Huang; Hao Bai; Zhixiu Wang; Yulin Bi; Guohong Chen; Yong Jiang; Guobin Chang
Journal:  Genes (Basel)       Date:  2022-05-31       Impact factor: 4.141

Review 2.  MITF in Normal Melanocytes, Cutaneous and Uveal Melanoma: A Delicate Balance.

Authors:  Maria Chiara Gelmi; Laurien E Houtzagers; Thomas Strub; Imène Krossa; Martine J Jager
Journal:  Int J Mol Sci       Date:  2022-05-26       Impact factor: 6.208

3.  Investigating the genetic architecture of eye colour in a Canadian cohort.

Authors:  Frida Lona-Durazo; Rohit Thakur; Erola Pairo-Castineira; Karen Funderburk; Tongwu Zhang; Michael A Kovacs; Jiyeon Choi; Ian J Jackson; Kevin M Brown; Esteban J Parra
Journal:  iScience       Date:  2022-05-30

4.  Macular thickness varies with age-related macular degeneration genetic risk variants in the UK Biobank cohort.

Authors:  Rebecca A Kaye; Karina Patasova; Praveen J Patel; Pirro Hysi; Andrew J Lotery
Journal:  Sci Rep       Date:  2021-12-01       Impact factor: 4.379

5.  A large Canadian cohort provides insights into the genetic architecture of human hair colour.

Authors:  Frida Lona-Durazo; Marla Mendes; Rohit Thakur; Karen Funderburk; Tongwu Zhang; Michael A Kovacs; Jiyeon Choi; Kevin M Brown; Esteban J Parra
Journal:  Commun Biol       Date:  2021-11-04

6.  A transcriptome atlas of the mouse iris at single-cell resolution defines cell types and the genomic response to pupil dilation.

Authors:  Jie Wang; Amir Rattner; Jeremy Nathans
Journal:  Elife       Date:  2021-11-16       Impact factor: 8.140

7.  A new approach to broaden the range of eye colour identifiable by IrisPlex in DNA phenotyping.

Authors:  Ersilia Paparazzo; Anzor Gozalishvili; Vincenzo Lagani; Silvana Geracitano; Alessia Bauleo; Elena Falcone; Giuseppe Passarino; Alberto Montesanto
Journal:  Sci Rep       Date:  2022-07-27       Impact factor: 4.996

8.  Genome-Wide Analysis Identifies Candidate Genes Encoding Beak Color of Duck.

Authors:  Qixin Guo; Yong Jiang; Zhixiu Wang; Yulin Bi; Guohong Chen; Hao Bai; Guobin Chang
Journal:  Genes (Basel)       Date:  2022-07-18       Impact factor: 4.141

Review 9.  Predicting Physical Appearance from DNA Data-Towards Genomic Solutions.

Authors:  Ewelina Pośpiech; Paweł Teisseyre; Jan Mielniczuk; Wojciech Branicki
Journal:  Genes (Basel)       Date:  2022-01-10       Impact factor: 4.096

Review 10.  What colour are your eyes? Teaching the genetics of eye colour & colour vision. Edridge Green Lecture RCOphth Annual Congress Glasgow May 2019.

Authors:  David A Mackey
Journal:  Eye (Lond)       Date:  2021-08-23       Impact factor: 3.775

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.