| Literature DB >> 27678121 |
Muthukrishnan Eaaswarkhanth1, Duo Xu1, Colin Flanagan1, Margarita Rzhetskaya2, M Geoffrey Hayes2, Ran Blekhman3, Nina G Jablonski4, Omer Gokcumen5.
Abstract
Human skin has evolved rapidly, leaving evolutionary signatures in the genome. The filaggrin (FLG) gene is widely studied for its skin-barrier function in humans. The extensive genetic variation in this gene, especially common loss-of-function (LoF) mutations, has been established as primary risk factors for atopic dermatitis. To investigate the evolution of this gene, we analyzed 2,504 human genomes and genotyped the copy number variation of filaggrin repeats within FLG in 126 individuals from diverse ancestral backgrounds. We were unable to replicate a recent study claiming that LoF of FLG is adaptive in northern latitudes with lower ultraviolet light exposure. Instead, we present multiple lines of evidence suggesting that FLG genetic variation, including LoF variants, have little or no effect on fitness in modern humans. Haplotype-level scrutinization of the locus revealed signatures of a recent selective sweep in Asia, which increased the allele frequency of a haplotype group (Huxian haplogroup) in Asian populations. Functionally, we found that the Huxian haplogroup carries dozens of functional variants in FLG and hornerin (HRNR) genes, including those that are associated with atopic dermatitis susceptibility, HRNR expression levels and microbiome diversity on the skin. Our results suggest that the target of the adaptive sweep is HRNR gene function, and the functional FLG variants that involve susceptibility to atopic dermatitis, seem to hitchhike the selective sweep on HRNR. Our study presents a novel case of a locus that harbors clinically relevant common genetic variation with complex evolutionary trajectories.Entities:
Keywords: copy number variation; positive selection; skin evolution; structural variants; tandem repeats
Mesh:
Substances:
Year: 2016 PMID: 27678121 PMCID: PMC5174745 DOI: 10.1093/gbe/evw242
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
F(A) Gene structure and organization. The top figure shows the organization of FLG. The thin line indicates introns and the thicker lines indicate exonic sequences. The 5′- and 3′-UTRs, and the two coding exons are labeled and with the sizes in base pairs based on reference genome is indicated. The organizational/functional units of the gene are color-coded as shown on the lower left of the figure. At the lower part of the figure, genetic variation involving subexonic filaggrin repeats is shown.
FAllele frequency distribution of FLG loss-of-function SNPs. The map on top shows the distribution of loss-of-function alleles across the world. The orange portion of the pies corresponds to cumulative allele frequency of loss-of-function alleles detected in the 1000 Genomes Phase 3 dataset. The bottom graph highlights the most common loss-of-function variants in different continental populations.
F(A) The number of FLG loss-of-function variants standout from the chromosome-wide expectations. The x-axis of this plot indicates the log number of loss-of-function variants observed in each gene (normalized by gene size), on chromosome 1 using the 1000 Genomes Phase 3 dataset. Note that genes with no loss-of-function variants reported were not plotted. The number of FLG loss-of-function variants is significantly higher than the chromosome-wide expectation.
F(A) Linkage disequilibrium (LD) in and around FLG gene. The x-axis of this plot shows the chromosomal location. The y-axis shows the LD R2 between all reported single nucleotide variants in CHB population and rs77422831, which is the variant with highest iHS score in CHB population. Top panel shows the approximate locations of genes, including FLG and HRNR. The red to blue color gradient indicates decreasing LD with rs77422831.
FPutatively functional variants are linked with Huxian haplogroup. The x-axis of this plot shows the chromosomal location. The y-axis shows the LD R2 between all reported single nucleotide variants in CHB population and rs77422831, which is the defining variant for Huxian haplogroup. Top panel shows the approximate locations of genes. In each panel, a different functional aspect of the variation was considered. In the top panel, the variants with CADD scores in the 90th percentile (CADD = 6.714355) were indicated with red points. In the middle panel, the variants that are significantly (P < 10−5) associated with the expression levels of HRNR were indicated with red points. In the bottom panel, the variants that are associated with skin microbiome diversity (P < 0.001) were indicated by red points.