| Literature DB >> 35290122 |
Yuchang Wu1,2, Shiro Furuya3, Zihang Wang4, Jenna E Nobles2,3, Jason M Fletcher2,3,5, Qiongshi Lu1,2,4.
Abstract
Following more than a century of phenotypic measurement of natural selection processes, much recent work explores relationships between molecular genetic measurements and realized fitness in the next generation. We take an innovative approach to the study of contemporary selective pressure by examining which genetic variants are “sustained” in populations as mortality exposure increases. Specifically, we deploy a so-called “regional GWAS” (genome-wide association study) that links the infant mortality rate (IMR) by place and year in the United Kingdom with common genetic variants among birth cohorts in the UK Biobank. These cohorts (born between 1936 and 1970) saw a decline in IMR from above 65 to under 20 deaths per 1,000 live births, with substantial subnational variations and spikes alongside wartime exposures. Our results show several genome-wide significant loci, including LCT and TLR10/1/6, related to area-level cohort IMR exposure during gestation and infancy. Genetic correlations are found across multiple domains, including fertility, cognition, health behaviors, and health outcomes, suggesting an important role for cohort selection in modern populations.Entities:
Keywords: infant mortality; recent natural selection; regional GWAS
Mesh:
Year: 2022 PMID: 35290122 PMCID: PMC8944929 DOI: 10.1073/pnas.2117312119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Fig. 1.IMR between 1936 and 1970 in England and Wales. (A) Heatmap of average county-level IMR between 1936 and 1970 in England and Wales. IMR is defined as the percentage of deaths under 1 y of age (per 1,000 live births). The denominator of the IMR does not include stillbirths. (B) Heatmap of IMR in each county and year.
Fig. 2.Genetic associations for birth year IMR. (A) Manhattan plot for birth year IMR. The horizontal lines mark the genome-wide significance cutoff of 5.0e-8 and a suggestive cutoff of 1.0e-6, respectively. (B) Genetic associations at the LCT locus. (C) Genetic associations at the TLR1/6/10 locus.
Genome-wide significant loci associated with birth year IMR
| CHR | SNP | BP (hg19) | A1 | A2 | EAF | BETA | SE | P |
|---|---|---|---|---|---|---|---|---|
| 2 | rs1446585 | 136407479 | G | A | 0.232 | −0.157 | 0.020 | 1.26E-15 |
| 4 | rs5743618 | 38798648 | A | C | 0.233 | −0.176 | 0.020 | 3.12E-19 |
*Frequency of A1. Chromosome (CHR), variant ID, base-pair coordinate (BP) based on the genome reference build hg19, allele 1 (A1), allele 2 (A2), effect allele frequency (EAF), effect size (BETA), standard error (SE), and P value for the two genome-wide significant loci.
Fig. 3.Selection patterns at IMR-associated genomic loci. (A) Associations with birth year IMR at 12 target loci for selection in Europeans (10) (also see Dataset S2). Genetic loci reaching genome-wide significance in the IMR GWAS are highlighted in dark blue. (B) Correlation between GWAS associations of birth year IMR and SDS matched with IMR-associated alleles. The fitted linear regression line is shown in red.
Fig. 4.Selection on LCT and TLR1/6/10 loci during “The Blitz.” (A) The effect size of the lead SNP (rs1446585) at the LCT locus peaks in 1942 and eventually bounces back to zero. The lead SNP (rs5743618) at the TLR1/6/10 locus follows a similar trend. Dots and intervals indicate GWAS effect size estimates and SEs. Years with n < 5,000 were excluded from the analysis. (B) Minor allele frequency (MAF) of rs1446585 in UKB birth cohort in 1939–1943. Major allele at this locus is known to associate with lactase persistence. Counties with high/low IMR were defined based on IMR in 1939. The MAFs between 1942 and 1943 are statistically different at 0.05 level (*P = 0.018).
Fig. 5.Genetic correlation between birth year IMR and 50 complex traits. Dots and intervals indicate genetic correlation estimates and SEs. Significant correlations at a false discovery rate cutoff of 0.05 are highlighted with circles. ADHD: attention-deficit/hyperactivity disorder; LDL-C and HDL-C: low (high) density lipoprotein cholesterol.