| Literature DB >> 30788143 |
Lawrence H Uricchio1, Hugo C Kitano2, Alexander Gusev3, Noah A Zaitlen4,5.
Abstract
Selection and mutation shape the genetic variation underlying human traits, but the specific evolutionary mechanisms driving complex trait variation are largely unknown. We developed a statistical method that uses polarized genome-wide association study (GWAS) summary statistics from a single population to detect signals of mutational bias and selection. We found evidence for nonneutral signals on variation underlying several traits (body mass index [BMI], schizophrenia, Crohn's disease, educational attainment, and height). We then used simulations that incorporate simultaneous negative and positive selection to show that these signals are consistent with mutational bias and shifts in the fitness-phenotype relationship, but not stabilizing selection or mutational bias alone. We additionally replicate two of our top three signals (BMI and educational attainment) in an external cohort, and show that population stratification may have confounded GWAS summary statistics for height in the GIANT cohort. Our results provide a flexible and powerful framework for evolutionary analysis of complex phenotypes in humans and other species, and offer insights into the evolutionary mechanisms driving variation in human polygenic traits.Entities:
Keywords: Complex traits; mutation bias; natural selection; polygenic selection
Year: 2019 PMID: 30788143 PMCID: PMC6369964 DOI: 10.1002/evl3.97
Source DB: PubMed Journal: Evol Lett ISSN: 2056-3744
Figure 1Panels A‐B are schematics of the trait model, while C‐E show simulation results. A: fitness impact of a mutation, assuming a symmetric fitness function. At equilibrium, the trait distribution is symmetric about the optimal value of the phenotype, . The dashed line at indicates the dividing line between individuals with increased fitness f after a mutation () from those with decreased fitness (). B: schematic of the relationship between effect size and fitness effect. At time , the optimal trait value increases, and trait‐decreasing alleles have decreased fitness while trait‐increasing alleles have increased fitness. Still, only trait‐increasing alleles of small effect are on average fitness‐increasing (inset). C: Mean trait value as a function of time for four simulated trait models, differentiated by the time of a shift in selection pressure. The simulated European demographic model is plotted in the background (not to scale) D: as a function of derived allele frequency (DAF) for each model simulated in C. Points represent the mean value of β computed over 100 independent simulations E: as a function of DAF for each model plotted in C. D and E represent the mean over 100 independent simulations. (Abbreviations: AE: ancestral expansion, OOA: out‐of‐Africa, FE: founding of Europe, DM: demographic model).
Figure 2A: as a function of DAF for BMI and educational attainment (EA). B: for the same data. C: neutral null distribution of obtained by permutations. The vertical dashed line indicates the observed value of in the GWAS summary data.
P‐values corresponding to GWAS summary statistics for nine phenotypes that we hypothesized may be under selection. Values in the first column include all alleles, while the second and third columns correspond to tests including only alleles with MAF > 1% and MAF > 5%, respectively. The UK Biobank tests were performed on all alleles above 1% in frequency
| Phenotype |
|
|
| UKBB replication |
|---|---|---|---|---|
| Height | <0.0005 | <0.0005 | <0.0005 | 0.416 |
| BMI | <0.0005 | <0.0005 | <0.0005 | 0.0095 |
| Education | <0.0005 | <0.0005 | <0.0005 | <0.0005 |
| WHR‐BMI | 0.566 | 0.8325 | 0.341 | NA |
| GLL | 0.4655 | 0.434 | 0.5645 | NA |
| Crohn's disease | 0.0025 | 0.0075 | 0.018 | NA |
| Menopause onset | 0.1585 | 0.46 | 0.0475 | NA |
| Depression | 0.3915 | 0.01 | 0.0305 | NA |
| Schizophrenia | 0.0085 | 0.0035 | 0.0625 | NA |
*Tests that pass a multiple testing correction ().
**Tests that were marginally significant ().
Figure 3A‐B: Inferred mutation bias (A) and selection shift (B) parameters as a function of true parameter values for our rejection sampling method. C‐D: Power of our rejection sampling method to correctly identify the direction of mutation bias (C) and shift in optimal phenotype value (D), as a function of the true parameter value. E‐F: Inferred approximate posterior distributions for five phenotypes that were identified as non‐neutral. G: Out‐of‐sample simulations using parameters inferred in E‐F, plotted with the data used to fit each model. Gray envelopes represent the middle 50% of simulation replicates, while the black points and curves show the observed data for each phenotype.