| Literature DB >> 35047866 |
Megan Null1,2, Feyza Yilmaz3,4, David Astling5, Hung-Chun Yu3, Joanne B Cole3,6, Benedikt Hallgrímsson7, Stephanie A Santorico1,6,8, Richard A Spritz3,6, Tamim H Shaikh3,6, Audrey E Hendricks1,6,8.
Abstract
Similarity in facial characteristics between relatives suggests a strong genetic component underlies facial variation. While there have been numerous studies of the genetics of facial abnormalities and, more recently, single nucleotide polymorphism (SNP) genome-wide association studies (GWASs) of normal facial variation, little is known about the role of genetic structural variation in determining facial shape. In a sample of Bantu African children, we found that only 9% of common copy number variants (CNVs) and 10-kb CNV analysis windows are well tagged by SNPs (r2 ≥ 0.8), indicating that associations with our internally called CNVs were not captured by previous SNP-based GWASs. Here, we present a GWAS and gene set analysis of the relationship between normal facial variation and CNVs in a sample of Bantu African children. We report the top five regions, which had p values ≤ 9.35 × 10-6 and find nominal evidence of independent CNV association (p < 0.05) in three regions previously identified in SNP-based GWASs. The CNV region with strongest association (p = 1.16 × 10-6, 55 losses and seven gains) contains NFATC1, which has been linked to facial morphogenesis and Cherubism, a syndrome involving abnormal lower facial development. Genomic loss in the region is associated with smaller average lower facial depth. Importantly, new loci identified here were not identified in a SNP-based GWAS, suggesting that CNVs are likely involved in determining facial shape variation. Given the plethora of SNP-based GWASs, calling CNVs from existing data may be a relatively inexpensive way to aid in the study of complex traits.Entities:
Keywords: African; Bantu; CNV; copy number variants; face shape; normal facial variation
Year: 2021 PMID: 35047866 PMCID: PMC8756499 DOI: 10.1016/j.xhgg.2021.100082
Source DB: PubMed Journal: HGG Adv ISSN: 2666-2477
Figure 1Three dimensional photographs with annotated landmarks. (A) 3D landmarks used for geometric morphometric quantification of facial shape as previously described. The full landmark set was used to calculate the shape variables (PCs and allometry) as well as facial size (centroid size). The red landmarks are those used to obtain specific interlandmark distances for analysis. (B) The full set of distances used (Table S1).
Figure 2Analysis flowchart. Flow chart describing the analysis process. ∗CNVs called in at least two algorithms with ≥10% overlap with CNVs called in all three algorithms; ∗∗10-Kb tiling windows with a 3-Kb overlap; ∗∗∗ 15 gene sets from FaceBase phenotype groups and three gene sets from SNP GWAS of normal facial variation.
Figure 3Region plot, chromosome 18. (A) A canonical variates plot for facial shape variation by CNV variant. The 3D morphs show the mean face for the common variant (no CNV) as well as exaggerated shape contrasts (×3) for the loss and gain variants. (B) Heatmaps for the shape contrasts between the common variant and each of the CNV variants. (C) Loss (red) and gain (blue) CNVs. Each line represents a unique CNV allele from one subject with the genes in the region shown below. A zoom plot of the subset of CNVs overlapping the window with the lowest p value is also shown. The CNV analysis region is shown in black. (D) Test statistic t values (effect estimate/standard error of effect estimate) across the region with 95% confidence intervals in the directional model (top) and absent/present model (bottom). Phenotypes with at least one window with p value < 5 × 10−4 are shown: lower facial depth (GN_T), head circumference (HC), principal component 1: upper facial height and mid facial width (PC1), and upper lip height (SN_STO).
Top CNV regions associated with facial phenotypes
| Region (hg19) | Win (n) | Associated phenotype | Minimum p value | Loss (n); gain (n) | Overlapping genes | r2 | |
|---|---|---|---|---|---|---|---|
| Absent/present | Directional | ||||||
| Chr18: 77,147,000– 77283000 | 19 | head circumference | 1.31 × 10−3 | 1.16 × 10−6 | 73; 12 | 0.035 | |
| lower facial depth (average) | 1.03 × 10−4 | 7.03 × 10−3 | 55; 7 | ||||
| upper lip height | 3.47 × 10−4 | 5.80 × 10−3 | 55; 7 | ||||
| PC1 | 3.71 × 10−4 | 3.05 × 10−2 | 55; 7 | ||||
| Chr10: 111,034,000–111058000 | 3 | upper facial depth (average) | 2.64 × 10−4 | 2.64 × 10−6 | 13; 0 | 0.040 | |
| Chr4: 3,423,000–3538000 | 16 | upper facial depth (average) | 5.20 × 10−6 | 1.51 × 10−1 | 41; 7 | 0.028 | |
| midfacial depth (average) | 4.79 × 10−5 | 2.21 × 10−1 | 41; 7 | ||||
| Chr2: 34,230,000–34324000 | 13 | subnasal width | 2.47 × 10−5 | 5.23 × 10−6 | 19; 1 | 0.063 | |
| nasal width | 7.82 × 10−5 | 5.71 × 10−5 | 19; 1 | ||||
| midface PC1 | 6.63 × 10−4 | 4.65 × 10−4 | 19; 1 | ||||
| Chr16: 1,225,000–1508000 | 40 | nasal ala length (average) | 9.35 × 10−6 | 6.26 × 10−5 | 1; 9 | 0.100 | |
| subnasal width | 5.12 × 10−5 | 1.10 × 10−4 | 1; 9 | ||||
| nasal width | 5.60 × 10−5 | 1.05 × 10−4 | 1; 9 | ||||
| midfacial depth (average) | 2.85 × 10−4 | 3.32 × 10−2 | 12; 7 | ||||
Reported for the window with lowest p value in the region. Details from each window in the region, as well as genes within a 10-kb flanking region are in Table S8.
Associated phenotypes with minimum region p value < 5 × 10−4 in at least one model are reported.
Maximum pairwise r2 between SNP and CNV window in the region is reported.
CNV associations in gene regions previously identified by SNP GWAS
| SNP GWAS | Gene | Association | Phenotype | p value | p value (SNP study) | With SNP | |
|---|---|---|---|---|---|---|---|
| Cole et al. | reported SNP | rs114189713 | mouth width | 7.26 × 10−8 | 1.87 × 10−7 | 0.067 | |
| CNV region | 7: 153,860,000–153,870,000 | lower lip height | 2.93 × 10−3 | ||||
| lower facial depth (average) | 1.78 × 10−2 | ||||||
| Claes et al. | Reported SNP | rs970797 | nose width; mouth and philtrum | 5.47 × 10−1 | 6.17 × 10−11 | 0.254 | |
| CNV region | 2: 176,918,000–176,977,000 | head circumference | 1.93 × 10−3 | ||||
| lower facial height | 4.64 × 10−3 | ||||||
| nasal width | 7.58 × 10−3 | ||||||
| inner canthal width | 9.02 × 10−3 | ||||||
| PC5 | 2.45 × 10−2 | ||||||
| White et al. | Reported SNP | rs74921869 | quadrant 2: region of the nose | 8.95 × 10−1 | 3.51 × 10−11 | 0.0002 | |
| CNV region | 4: 931,000–1,060,000 | centroid size | 2.20 × 10−2 | ||||
| PC5 | 3.87 × 10−2 | ||||||
| philtrum length | 2.44 × 10−2 | ||||||
| nasal width | 3.89 × 10−2 |
p value calculated with our sample. Due to CNV QC, our sample is slightly different than that of Cole et al.
Maximum SNP-window pairwise r2 within the region for the reported SNP.
Analysis windows with p values < 0.05 reported. Other two were reported for <2.5 × 10−2.
Figure 4LD between SNPs and CNVs. (A) Bar chart showing frequency of common CNVs and CNV regions by strength of maximum pairwise r2 with SNPs within 1 Mb of the region. (B) Windows with more than 50% losses (light gray, n = 3,878), windows with more than 50% gains (dark gray, n = 2,743). Eighty-four windows that had an equal number of loss and gains had r2 < 0.2 and are not included.