| Literature DB >> 33983923 |
Hanne Hoskens1,2, Dongjing Liu3, Sahin Naqvi4,5, Myoung Keun Lee6, Ryan J Eller7, Karlijne Indencleef2,8,9, Julie D White10, Jiarui Li2,8, Maarten H D Larmuseau1,11,12, Greet Hens9, Joanna Wysocka4,13,14, Susan Walsh7, Stephen Richmond15, Mark D Shriver10, John R Shaffer3,6, Hilde Peeters1, Seth M Weinberg3,6,16, Peter Claes1,2,8,17.
Abstract
The analysis of contemporary genomic data typically operates on one-dimensional phenotypic measurements (e.g. standing height). Here we report on a data-driven, family-informed strategy to facial phenotyping that searches for biologically relevant traits and reduces multivariate 3D facial shape variability into amendable univariate measurements, while preserving its structurally complex nature. We performed a biometric identification of siblings in a sample of 424 children, defining 1,048 sib-shared facial traits. Subsequent quantification and analyses in an independent European cohort (n = 8,246) demonstrated significant heritability for a subset of traits (0.17-0.53) and highlighted 218 genome-wide significant loci (38 also study-wide) associated with facial variation shared by siblings. These loci showed preferential enrichment for active chromatin marks in cranial neural crest cells and embryonic craniofacial tissues and several regions harbor putative craniofacial genes, thereby enhancing our knowledge on the genetic architecture of normal-range facial variation.Entities:
Mesh:
Year: 2021 PMID: 33983923 PMCID: PMC8118281 DOI: 10.1371/journal.pgen.1009528
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Workflow of the study.
Fig 2Global-to-local matching of siblings.
(A) Global-to-local segmentation of 3D facial shape obtained using hierarchical spectral clustering of the EURO cohort. Segments are colored per quadrant, represented by the roman numerals. (B) Phenotypically similar sibling pairs were identified in a biometric identification setup, which involves the comparison of facial shape between siblings and with unrelated individuals. Matching performance using different similarity measures and facial features was evaluated using cumulative match characteristic (CMC) curves. Plotted is the percentage of sibling pairs that were correctly identified (y-axis) within the top-k% matches (x-axis) using the Mahalanobis angle. Curves are colored based on the facial features that were used to match siblings. For each quadrant, the highest and lowest identification rates per rank are shown, with the area between the two shaded.
Fig 3Genetic loci associated with the sib-shared traits.
(A) Number of sib-shared traits extracted per facial segment, corresponding to the number of sibling pairs that matched close to perfect within a given segment using the Mahalanobis angle. A total of 1,048 traits were extracted across all 63 segments, comprising 322 independent traits. The structure of the rosette plot corresponds to the polar dendrogram displaying the facial segments in Fig 2A. (B) Ideogram of the genetic loci that contribute to variation in the sib-shared traits, as identified by the association analysis of genome-wide common variants, depicted by circles and squares (i.e. overlapping and novel loci, respectively), and exome-wide low-frequency variants, depicted by triangles. For each locus, the color of the symbol represents the quadrant in which the top associated effect (i.e. lowest p-value) was observed.
Fig 4Preferential activity in CNCCs and embryonic craniofacial tissues.
Boxplots of the distribution of H3K27ac ChIP-seq signals in 20 kb regions around the 218 genome-wide significant lead SNPs in various adult, embryonic and in vitro–derived cell types. Samples corresponding to CNCCs (blue), embryonic craniofacial tissue (orange) and surface ectoderm (green) are highlighted.