| Literature DB >> 29198719 |
John R Shaffer1, Jinxi Li2, Myoung Keun Lee3, Jasmien Roosenboom3, Ekaterina Orlova1, Kaustabh Adhikari4, Carla Gallo5, Giovanni Poletti5, Lavinia Schuler-Faccini6, Maria-Cátira Bortolini6, Samuel Canizales-Quinteros7, Francisco Rothhammer8, Gabriel Bedoya9, Rolando González-José10, Paige E Pfeffer11, Christopher A Wollenschlaeger12, Jacqueline T Hecht13, George L Wehby14, Lina M Moreno15, Anan Ding2, Li Jin16, Yajun Yang17, Jenna C Carlson18, Elizabeth J Leslie3, Eleanor Feingold18, Mary L Marazita19, David A Hinds20, Timothy C Cox21, Sijia Wang22, Andrés Ruiz-Linares23, Seth M Weinberg24.
Abstract
The genetic basis of earlobe attachment has been a matter of debate since the early 20th century, such that geneticists argue both for and against polygenic inheritance. Recent genetic studies have identified a few loci associated with the trait, but large-scale analyses are still lacking. Here, we performed a genome-wide association study of lobe attachment in a multiethnic sample of 74,660 individuals from four cohorts (three with the trait scored by an expert rater and one with the trait self-reported). Meta-analysis of the three expert-rater-scored cohorts revealed six associated loci harboring numerous candidate genes, including EDAR, SP5, MRPS22, ADGRG6 (GPR126), KIAA1217, and PAX9. The large self-reported 23andMe cohort recapitulated each of these six loci. Moreover, meta-analysis across all four cohorts revealed a total of 49 significant (p < 5 × 10-8) loci. Annotation and enrichment analyses of these 49 loci showed strong evidence of genes involved in ear development and syndromes with auricular phenotypes. RNA sequencing data from both human fetal ear and mouse second branchial arch tissue confirmed that genes located among associated loci showed evidence of expression. These results provide strong evidence for the polygenic nature of earlobe attachment and offer insights into the biological basis of normal and abnormal ear development.Entities:
Keywords: attached earlobe; complex trait genetics; epistasis; genome-wide association study; multigenic; pharyngeal arch; pinna; trans-ethnic; unattached earlobe
Mesh:
Substances:
Year: 2017 PMID: 29198719 PMCID: PMC5812923 DOI: 10.1016/j.ajhg.2017.10.001
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025
Figure 1Genome-wide Scans
(Left) Manhattan plots showing the –log10-transformed p values (y axis) by physical genomic position (x axis) for each SNP in (A) the meta-analysis of the three rater-scored cohorts and (C) the meta-analysis of all four cohorts. The horizontal line represents the threshold for genome-wide significance (p < 5 × 10−8). (A) Six significant loci (green) were observed, and genes near the lead SNP in each locus are annotated. (C) 49 associated loci were observed: the same six loci in (A) are shown in green, and these reached genome-wide significance in more than one cohort; the 15 loci in blue showed genome-wide significance in one cohort and replication-level significance (p < 0.001) in at least one additional cohort or the meta-analysis of expert-rater-scored cohorts; and the 28 loci in red were observed via meta-analysis.
(Right) Quantile-quantile plots showing the observed distribution of –log10-transformed p values (y axis) against the expected distribution (x axis) under the null hypothesis of no association (diagonal line) for (B) the meta-analysis of the three rater-scored cohorts (genomic inflation factor = 1.066) and (D) the meta-analysis of all four cohorts (genomic inflation factor = 1.563). The presence of significantly associated loci is indicated by the deviation of observed p values from the tail of the null distribution, as shown by points above the diagonal in the upper right of the plots.
Evidence of Association for the Lead SNP in Each Significant (p < 5 × 10−8) Locus Nominated in the Meta-analysis across the Rater-Scored Cohorts
| Gene candidate(s) | ||||||
| Base position | 109,513,601 | 171,542,573 | 138,997,688 | 142,921,276 | 24,506,439 | 37,209,698 |
| Functional position | missense | intronic | intergenic | intronic | intronic | intronic |
| Data source | genotyped | imputed | genotyped | imputed | imputed | imputed |
| Minor/major alleles | G/A | C/G | A/G | A/G | C/T | C/A |
| MAF | 0.015 | 0.413 | 0.257 | 0.223 | 0.451 | 0.386 |
| Beta | 0.028 | 0.073 | −0.041 | 0.119 | −0.034 | 0.086 |
| SE | 0.081 | 0.020 | 0.022 | 0.023 | 0.019 | 0.020 |
| p value | 0.733 | 1.84 × 10−4 | 0.059 | 2.78 × 10−7 | 0.077 | 2.40 × 10−5 |
| MAF | 0.404 | 0.661 | 0.550 | 0.185 | 0.490 | 0.359 |
| Beta | 0.062 | 0.088 | −0.025 | 0.067 | −0.043 | 0.027 |
| SE | 0.012 | 0.011 | 0.011 | 0.013 | 0.010 | 0.010 |
| p value | 7.67 × 10−8 | 7.29 × 10−16 | 0.019 | 2.34 × 10−7 | 2.06 × 10−5 | 9.41 × 10−3 |
| MAF | 0.946 | 0.419 | 0.429 | 0.267 | 0.303 | 0.295 |
| Beta | 0.129 | 0.101 | −0.128 | 0.051 | −0.052 | 0.065 |
| SE | 0.032 | 0.015 | 0.015 | 0.017 | 0.016 | 0.016 |
| p value | 5.66 × 10−5 | 9.84 × 10−12 | 2.60 × 10−18 | 0.002 | 0.001 | 8.04 × 10−5 |
| p value | 6.65 × 10−10 | 1.13 × 10−28 | 5.27 × 10−13 | 2.49 × 10−14 | 2.44 × 10−8 | 4.82 × 10−9 |
| MAF | 0.007 | 0.410 | 0.249 | 0.238 | 0.451 | 0.389 |
| OR | 1.490 | 1.270 | 0.837 | 1.335 | 0.813 | 1.237 |
| CI | (1.299, 1.709) | (1.238, 1.303) | (0.813, 0.862) | (1.296, 1.375) | (0.793, 0.834) | (1.206, 1.269) |
| p value | 2.04 × 10−8 | 4.90 × 10−76 | 4.42 × 10−33 | 3.31 × 10−87 | 8.48 × 10−59 | 8.26 × 10−59 |
| p value | 1.16 × 10−13 | 1.65 × 10−99 | 4.78 × 10−43 | 7.64 × 10−100 | 1.56 × 10−65 | 2.85 × 10−66 |
Abbreviations are as follows: MAF, minor allele frequency; SE, standard error of the beta-coefficient; OR, odds ratio; and CI, 95% confidence interval of the odds ratio.
Figure 2Regional Association Plots Showing Significant Associations Observed in the Meta-analysis of the Three Rater-Scored Cohorts
Regional plots near (A) EDAR, (B) SP5, (C) MRPS22, (D) ADGRG6 (GPR126), (E) KIAA1217, and (F) PAX9 show –log10-transformed p values (left y axis) by physical position (x axis). Shading denotes the LD (r2) between each SNP and the lead SNP (purple). The blue overlay represents the recombination rate (right y axis). Gene positions are indicated under each plot.
Figure 3Heatmaps of Gene Expression
Heatmaps of gene expression in second branchial arch tissue from embryonic mice (wild-type mice and dmbo and sbse mutants) and fetal human pinna (at days 57 and 59) and for genes at the six loci observed in the meta-analysis of rater-scored cohorts; normalized total read counts are on the top, and fold changes between mutant mice and wild-type mice are on the bottom. The shading scale is shown for Z scores of expression and fold change. Genes for which expression data were not measured are shown in gray.