| Literature DB >> 33547071 |
Betty Bonfante1, Pierre Faux1, Nicolas Navarro2,3, Javier Mendoza-Revilla4,5, Morgane Dubied2, Charlotte Montillot6, Emma Wentworth7, Lauriane Poloni2,3, Ceferino Varón-González8, Philip Jones9, Ziyi Xiong10, Macarena Fuentes-Guajardo11, Sagnik Palmal1, Juan Camilo Chacón-Duque12, Malena Hurtado4, Valeria Villegas4, Vanessa Granja4, Claudia Jaramillo13, William Arias13, Rodrigo Barquera14,15, Paola Everardo-Martínez14, Mirsha Sánchez-Quinto16, Jorge Gómez-Valdés14, Hugo Villamil-Ramírez17, Caio C Silva de Cerqueira18, Tábita Hünemeier19, Virginia Ramallo20,21, Fan Liu10,22,23, Seth M Weinberg24,25,26, John R Shaffer24,25, Evie Stergiakouli27,28, Laurence J Howe27, Pirro G Hysi29, Timothy D Spector29, Rolando Gonzalez-José21, Lavinia Schüler-Faccini20, Maria-Cátira Bortolini20, Victor Acuña-Alonzo14, Samuel Canizales-Quinteros17, Carla Gallo4, Giovanni Poletti4, Gabriel Bedoya13, Francisco Rothhammer30, Christel Thauvin-Robinet6,31, Laurence Faivre6,31, Caroline Costedoat1, David Balding9,32, Timothy Cox33, Manfred Kayser10, Laurence Duplomb6, Binnaz Yalcin6, Justin Cotney7, Kaustubh Adhikari34,9, Andrés Ruiz-Linares35,9,36.
Abstract
To characterize the genetic basis of facial features in Latin Americans, we performed a genome-wide association study (GWAS) of more than 6000 individuals using 59 landmark-based measurements from two-dimensional profile photographs and ~9,000,000 genotyped or imputed single-nucleotide polymorphisms. We detected significant association of 32 traits with at least 1 (and up to 6) of 32 different genomic regions, more than doubling the number of robustly associated face morphology loci reported until now (from 11 to 23). These GWAS hits are strongly enriched in regulatory sequences active specifically during craniofacial development. The associated region in 1p12 includes a tract of archaic adaptive introgression, with a Denisovan haplotype common in Native Americans affecting particularly lip thickness. Among the nine previously unidentified face morphology loci we identified is the VPS13B gene region, and we show that variants in this region also affect midfacial morphology in mice.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33547071 PMCID: PMC7864580 DOI: 10.1126/sciadv.abc6160
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.136
Fig. 1Face profile features showing genome-wide significant association.
Top: Drawings indicate the features for which the 32 traits listed below were measured in the CANDELA individuals (as defined in table S1). Bottom: Aggregate of the GWAS signals detected (across all traits). The chromosomal region (and nearest candidate gene) showing strongest association to a trait is indicated above each GWAS peak (bold type marks the previously unidentified face morphology regions identified here). Curved lines in the middle of the figure connect the previously unidentified face morphology region to their associated traits.
Fig. 2Thirty-two chromosomal regions associated with face profile traits.
The 32 traits showing genome-wide significant association are listed at the top (Fig. 1 and table S1). For ease of visualization, association P values are represented by colored boxes as follows: red, genome-wide significant (<5 × 10−8); orange, genome-wide suggestive (<10−5) (P values are provided in table S5). Chromosomal regions in bold were previously unidentified (Fig. 1 and Table 1). Prior evidence: GWASa, associated with a similar face trait; GWASb, associated with a different facial trait; GWASc, associated with a facial development disorder [e.g., nonsyndromic cleft lip/palate (NSCL/P)]; HDD, human developmental defect reported (e.g., Mendelian disorder); MDD, model developmental defect (related phenotype involving a homologous region reported in animal models).
Features of the nine chromosomal regions newly associated with face profile traits in the CANDELA sample.
Candidate genes where the index SNP is intragenic are underlined. Association P values in bold are genome-wide significant (P ≤ 5 × 10−8). Rep., European meta-analysis; AFR, sub-Saharan African; EUR, European (from 1000 genomes); NAM, Native American (); CAN, CANDELA. “# SNPs” refers to the total number of genome-wide significant SNPs in that chromosomal region (P value ≤ 5 × 10−8).
| 3q13.32 | rs7633584 | 7.3 × 10−3 | 5.2 × 10−1 | 4.2 × 10−1 | 1.6 × 10−1 | 3.8 × 10−1 | 2.0 × 10−1 | 8.2 × 10−2 | 2.6 × 10−1 | 0.29 | 0.05 | 0.39 | 0.22 | 11 | ||||
| 6q24.3 | rs6570789 | 9.8 × 10−1 | 8.0 × 10−1 | 2.0 × 10−1 | 2.2 × 10−1 | 4.1 × 10−2 | 7.9 × 10−2 | 1.2 × 10−2 | 4.2 × 10−2 | 3.3 × 10−3 | 9.2 × 10−3 | 0.40 | 0.55 | 0.92 | 0.68 | 11 | ||
| 7p12.1 | INTERGENIC | rs17134499 | 6.2 × 10−1 | 9.9 × 10−1 | 6.4 × 10−1 | 1.3 × 10−1 | 5.6 × 10−1 | 6.5 × 10−1 | 6.6 × 10−1 | 6.3 × 10−1 | 9.8 × 10−1 | 6.6 × 10−2 | 0.32 | 0.02 | 0.03 | 0.03 | 3 | |
| 7p21.1 | rs1178103 | 9.4 × 10−1 | 1.1 × 10−4 | 2.4 × 10−4 | 1.2 × 10−4 | 1.9 × 10−2 | 4.1 × 10−3 | 7.4 × 10−3 | 3.0 × 10−3 | 2.6 × 10−1 | 0.13 | 0.18 | 0.38 | 0.26 | 1 | |||
| 7q31.31 | rs6950680 | 4.9 × 10−4 | 7.2 × 10−1 | 9.8 × 10−1 | 5.6 × 10−1 | 2.1 × 10−1 | 6.9 × 10−1 | 4.7 × 10−1 | 3.4 × 10−1 | 8.2 × 10−1 | 0.21 | 0.36 | 0.04 | 0.21 | 2 | |||
| 8q22.2 | rs75976055 | 5.7 × 10−2 | 8.3 × 10−1 | 8.8 × 10−2 | 3.0 × 10−2 | 8.3 × 10−1 | 2.8 × 10−2 | 9.3 × 10−2 | 3.7 × 10−2 | 3.0 × 10−1 | 0.08 | 0.13 | 0.31 | 0.38 | 598 | |||
| 8q22.3 | 8:101737776:AT:ATT | 1.0 × 10−2 | 8.5 × 10−1 | 3.6 × 10−3 | 1.7 × 10−1 | 6.9 × 10−1 | 9.6 × 10−1 | 3.9 × 10−1 | 1.3 × 10−1 | – | 0.23 | 0.00 | 0.00 | 0.02 | 17 | |||
| 11p15.5 | rs907613 | 4.9 × 10−1 | 3.7 × 10−1 | 7.7 × 10−1 | 5.4 × 10−1 | 7.1 × 10−1 | 4.9 × 10−1 | 9.8 × 10−1 | 4.0 × 10−2 | 0.09 | 0.33 | 0.41 | 0.39 | 8 | ||||
| 17p13.3 | rs8069947 | 1.8 × 10−1 | 2.2 × 10−1 | 3.8 × 10−1 | 6.8 × 10−1 | 2.7 × 10−2 | 2.6 × 10−2 | 5.0 × 10−2 | 4.2 × 10−1 | 7.1 × 10−1 | 2.3 × 10−2 | 0.26 | 0.49 | 0.59 | 0.54 | 41 | ||
Fig. 3Evidence of association, Denisovan introgression, and selection in the WARS2-TBX15 region.
(A) Association P values (right y-axis) with lip thickness ratio 2 and the location and frequency (left y-axis) of introgressed haplotypes inferred in the CANDELA data. Filled dots indicate the SNPs retained for introgression analysis. Dots are plotted accordingly to the significance of their association and have been colored to reflect r2 with the index SNP for association with that trait (rs3790553). The location of the index SNPs associated previously with ear morphology (), chin dimples (), and forehead shape () is indicated. SNPs associated with facial features in the European meta-analysis () are outside the region shown here. The green bins show the introgressed region, and their height displays the allelic frequency and their shade refers to the confidence level at which the tract was called (pale, 20%; dark, 90%). The dashed lines denote the bounds of the Denisovan introgression region previously estimated with the same approach used here (green) () or using a Conditional Random Field approach (purple) (). (B) Standardized PBS scores obtained for genotyped SNPs when contrasting Native Americans (NAM) to CHB + CEU (red triangles) or Greenland Inuits (GI) to CHB + CEU (blue triangles). The location of the TBX15 and WARS2 genes is indicated by black arrows along with various annotations inferred from human embryonic tissue () (the color key for combined craniofacial annotations can be found in table S6).
Fig. 4Features of four genomic regions newly associated with face features in the CANDELA sample and replicating in Europeans.
Regional association P values (index SNP is labeled) are shown at the top. Regulatory annotations from human embryonic tissue () (combined chromatin states at various Carnegie stages and craniofacial-specific enhancers and superenhancers; the color key for combined craniofacial annotations is in table S6) are provided below each association plot. A heatmap of pairwise LD (r2) in the region is shown at the bottom of each panel. Similar plots for the other previously unidentified regions, detected in the CANDELA sample but not replicating in the European meta-analysis, are shown in Supplementary Notes.
Fig. 5Vps13b and mouse craniofacial shape.
(A) Shape changes in the Vps13b KO mutant male mice relative to WT. Effect is predicted according to the expected marginal means at the populational average centroid size. Darker colors represent strong positive (blue) or negative (brown) deviation from the WT, given in Procrustes distance. Positive distance represents expansion from the WT, whereas negative distance represents contraction from the WT. Mesh warping is amplified by a factor of 2 to facilitate the visualization (see also movie S1). (B) Regional association plot for the VPS13 homologous region in chromosome 15 among outbred mice. Significantly associated SNPs are shown as green dots (FDR < 5%). Ensembl annotations for genes (brown) and enhancers active in embryonic facial prominence (orange) are shown at the top. (C) Phenotypic effect associated with allele dosage at the Vps13b index SNP among outbred mice. Skull zones with an expansion/contraction relative to the mean shape are shown in brown/blue (see also movie S2). To facilitate visualization, shape variation has been magnified by ×20.