| Literature DB >> 32678081 |
Peter K Joshi1, Joris Deelen2,3, Paul R H J Timmers4, James F Wilson5,6.
Abstract
Ageing phenotypes, such as years lived in good health (healthspan), total years lived (lifespan), and survival until an exceptional old age (longevity), are of interest to us all but require exceptionally large sample sizes to study genetically. Here we combine existing genome-wide association summary statistics for healthspan, parental lifespan, and longevity in a multivariate framework, increasing statistical power, and identify 10 genomic loci which influence all three phenotypes, of which five (near FOXO3, SLC4A7, LINC02513, ZW10, and FGD6) have not been reported previously at genome-wide significance. The majority of these 10 loci are associated with cardiovascular disease and some affect the expression of genes known to change their activity with age. In total, we implicate 78 genes, and find these to be enriched for ageing pathways previously highlighted in model organisms, such as the response to DNA damage, apoptosis, and homeostasis. Finally, we identify a pathway worthy of further study: haem metabolism.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32678081 PMCID: PMC7366647 DOI: 10.1038/s41467-020-17312-3
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Healthspan, parental lifespan, and longevity are highly genetically correlated.
a Pairwise genetic correlation between human ageing studies. b Genetic correlations of age-stratified parental lifespan against healthspan and longevity. c Genetic correlations (rg) of ageing traits with traits related to development, behaviour, and disease. In bold are traits with heterogeneous correlations (Phet < 0.05). Displayed here are 17 traits which have at least one significant (FDR < 5%) genetic correlation with healthspan, parental lifespan, or longevity, out of the 27 traits tested. The 17 traits are clustered by Euclidean distance based on their genetic correlation with all tested traits (30 in total). See Supplementary Data 1 for a full list of correlations and Supplementary Table 1 for the number of SNPs used to calculate each pairwise correlation. Blank squares represent correlations which did not pass multiple testing correction. Note that fewer correlations with longevity will pass this threshold due to the smaller sample size of this GWAS. Error bars represent 95% confidence intervals of the correlation estimates. COPD: chronic obstructive pulmonary disease.
Fig. 2Twenty-four multivariate loci identified at genome-wide significance.
Manhattan plot showing the nominal strength of association −log10(P value) (two-sided) on the y-axis against the chromosomal position of SNPs on the x-axis, where the null hypothesis is no association with healthspan, parental lifespan, and longevity. The red line represents the genome-wide significance threshold (5 × 10–8). Annotated are the nearest gene(s) to the lead SNP (in red) of each locus. The y-axis has been capped at 5 × 10−30 to aid legibility; SNPs passing this cap are represented as triangles: LPA, P = 3.8 × 10−30, APOE, P = 9.6 × 10−127.
Ten loci act across all three ageing traits, reaching nominal significance in each dataset.
| Nearest Gene | rsID | A1 | Freq1 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| rs10455872 | A | 0.93 | 0.057 (0.009) | 1E−10 | 0.076 (0.007) | 9E−25 | 0.124 (0.045) | 7E−03 | 4E−30 | |
| rs7859727 | C | 0.51 | 0.031 (0.005) | 3E−10 | 0.025 (0.004) | 1E−10 | 0.066 (0.019) | 6E−04 | 4E−18 | |
| rs4783780 | A | 0.53 | 0.023 (0.005) | 2E−06 | 0.014 (0.004) | 3E−04 | 0.052 (0.019) | 6E−03 | 1E−08 | |
| rs6511720 | T | 0.12 | 0.015 (0.007) | 4E−02 | 0.034 (0.006) | 2E−08 | 0.093 (0.030) | 2E−03 | 4E−09 | |
| rs429358 | T | 0.85 | 0.014 (0.007) | 4E−02 | 0.106 (0.005) | 3E−83 | 0.510 (0.032) | 1E−56 | 1E−126 |
β effect size of the A1 allele with the standard error in parentheses.
For healthspan and parental lifespan units are the negative log of the hazard ratio, for longevity this is the log odds of reaching an exceptional old age (90th percentile).
In bold are loci which contain SNPs that are not reported at genome-wide significance in any individual dataset. The remaining loci contain one or more genome-wide significant SNPs within 500 kb of the lead SNP in one of the individual datasets (Supplementary Data 4).
Nearest gene: gene closest to the index SNP, rsID the SNP with the lowest P value in the multivariate analysis, A1 the effect allele, increasing healthspan, parental lifespan, and odds to become long-lived, Freq1 frequency of the A1 allele. P P value of the trait association. For MANOVA, this is the P value against the null hypothesis of association with neither healthspan, parental lifespan, nor longevity.
eQTL for 78 genes colocalise with the GWAS signal at 9 out of 10 loci of interest.
| Locus | Chr | Position | ||
|---|---|---|---|---|
| 3 | 27562988 | |||
| 4 | 38385479 | |||
| 6 | 109006838 | |||
| 6 | 161010118 | |||
| 9 | 22102165 | |||
| 11 | 113639842 | |||
| 12 | 95580818 | |||
| 19 | 11202306 | |||
| 19 | 45411941 |
Genes which showed a significant effect (FDR < 5%) of gene expression on ageing traits are displayed here.
Gene names are annotated with the direction of effect, where + and – indicate whether the life-extending association of the locus is linked with higher or lower gene expression, respectively.
Locus: nearest gene to lead variant in the multivariate analysis, Chr: chromosome, Position: base-pair position of lead variant (GRCh37), Cis-genes: genes in physical proximity (<500 kb) to the lead variant of the locus which colocalise with the multivariate signal, Trans-genes: genes located more than 500 kb from the lead variant of the locus.
Fig. 3Seven hallmark gene pathways are enriched for ageing-related genes.
N number of genes of interest vs. total number of genes in the gene set for which eQTL are available. P Nominal P value of the hypergeometric test for enrichment (against 24,670 background genes). Pbonf Bonferroni-corrected P value for testing seven hallmark pathways (containing at least three genes). The figure shows individual genes on the x-axis and hallmark pathways are listed on the y-axis, matching the order of the table. Squares represent the presence of a gene in the gene set.
Multivariate MR of iron-related traits on healthspan, parental lifespan, and longevity shows a protective effect for transferrin and a deleterious effect for serum iron.
| Exposure | SE | ||||||
|---|---|---|---|---|---|---|---|
| Serum iron | −0.79 | 0.242 | 1E−03 | 4E−03 | −1.10 (0.58) | −1.17 (0.63) | −5.07 (2.42) |
| Transferrin saturation | 0.80 | 0.252 | 1E−03 | 4E−03 | 1.11 (0.61) | 1.16 (0.66) | 5.15 (2.52) |
| Transferrin | 0.32 | 0.100 | 2E−03 | 4E−03 | 0.48 (0.24) | 0.46 (0.26) | 2.02 (1.00) |
| Ferritin | −0.01 | 0.024 | 0.5380 | 1.0000 | 0.13 (0.06) | −0.02 (0.06) | −0.26 (0.24) |
The effects of 15 SNPs genome-wide significant for one or more iron-related traits were tested against the effects of our GWAS meta-analysis and individual healthspan, parental lifespan, and longevity GWAS in an inverse variance-weighted regression.
Coefficients are derived from a model with a fixed regression intercept, as a sensitivity analysis showed a non-significant intercept centred around zero for all traits (Pintercept ≥ 0.76). Although the causal effect sizes appear large, in practice, homeostatic effects prevent large variation in one of the exposures independent of the others.
βMR: the causal effect of one standard deviation increase in the exposure on the healthspan/parental lifespan/longevity meta-analysis (in standard deviation units), conditional on the other exposures, P: nominal P value for the MR effect, Padj: multiple testing-corrected P value, βhealthspan, βlifespan, βlongevity: the conditional effect of one standard deviation increase in the exposure on healthspan (in -logHR units), parental lifespan (in -logHR units), or longevity (in logOR units), with the standard error reported in parentheses.