| Literature DB >> 33031577 |
Benoit Lehallier1,2,3, Maxim N Shokhirev4, Tony Wyss-Coray1,2,3,5, Adiv A Johnson6.
Abstract
We previously identified 529 proteins that had been reported by multiple different studies to change their expression level with age in human plasma. In the present study, we measured the q-value and age coefficient of these proteins in a plasma proteomic dataset derived from 4263 individuals. A bioinformatics enrichment analysis of proteins that significantly trend toward increased expression with age strongly implicated diverse inflammatory processes. A literature search revealed that at least 64 of these 529 proteins are capable of regulating life span in an animal model. Nine of these proteins (AKT2, GDF11, GDF15, GHR, NAMPT, PAPPA, PLAU, PTEN, and SHC1) significantly extend life span when manipulated in mice or fish. By performing machine-learning modeling in a plasma proteomic dataset derived from 3301 individuals, we discover an ultra-predictive aging clock comprised of 491 protein entries. The Pearson correlation for this clock was 0.98 in the learning set and 0.96 in the test set while the median absolute error was 1.84 years in the learning set and 2.44 years in the test set. Using this clock, we demonstrate that aerobic-exercised trained individuals have a younger predicted age than physically sedentary subjects. By testing clocks associated with 1565 different Reactome pathways, we also show that proteins associated with signal transduction or the immune system are especially capable of predicting human age. We additionally generate a multitude of age predictors that reflect different aspects of aging. For example, a clock comprised of proteins that regulate life span in animal models accurately predicts age.Entities:
Keywords: age-related disease; aging; aging clock; health span; life span; longevity
Year: 2020 PMID: 33031577 PMCID: PMC7681068 DOI: 10.1111/acel.13256
Source DB: PubMed Journal: Aging Cell ISSN: 1474-9718 Impact factor: 9.304
Figure 1529 proteins that were previously identified to change their expression level with age in human plasma were analyzed in a large, proteomic dataset derived from 4263 healthy individuals with an age range of 18–95 years. The six proteins that exhibited the most significant change in plasma expression level with age were CGA.FSHB (a), SOST (b), GDF15 (c), MLN (d), RET (e), and PTN (f). The expression trend over time is visually shown for each protein. RFU = relative fluorescent unit
20 examples of common aging plasma proteins with highly intriguing links to aging and/or disease
| Protein |
| Intriguing connections to aging and/or disease |
|---|---|---|
| ADAMTS5 | 7.69E−65, 1.88E−03 |
Mice lacking
Wwp2 promotes the maintenance of cartilage homeostasis via the suppression of Adamts5 in mice (Mokuda et al., |
| BDNF | 2.78E−30, 2.84E−03 |
Treating Huntington's disease mice with human mesenchymal stem cells that overexpress BDNF extends life span and increases neurogenesis‐like activity (Pollock et al., Exercise elevates BDNF levels and induces adult hippocampal neurogenesis in Alzheimer's disease mice (S. H. Choi et al., In a zebrafish model of Alzheimer's disease, BDNF enhances neurogenesis and neural stem cell plasticity (Bhattarai et al., |
| CCL11 | 8.87E−94, 3.34E−03 |
In a cohort of non‐diabetic women, plasma levels of CCL11 are associated with central obesity and are reduced in response to an exercise program (Choi et al., Injecting recombinant Ccl11 into young mice reduces neurogenesis and impairs both memory and learning (Villeda et al., Administering recombinant Ccl11 to young mice results in synaptic loss and increased microglial reactivity (Das et al., |
| CGA.FSHB | 2.89E−320, 1.64E−02 |
Long‐lived mice deficient in growth hormone receptor exhibit decreased plasma levels of follicle‐stimulating hormone (V. Chandrashekar et al., Bone loss is mitigated in ovariectomized mice treated with an antibody specific to the β‐subunit of follicle‐stimulating hormone (Zhu et al., An antibody specific to the β‐subunit of follicle‐stimulating hormone decreases body fat, stimulates brown adipose tissue, and promotes thermogenesis in mice (Liu et al., |
| FGA.FGB.FGG | 8.38E−11, 7.25E−04 |
Treating mice with fibrinogen causes demyelination via the induction of adaptive immune responses and the recruitment of peripheral macrophages (Ryu et al., Inhibiting fibrin with the monoclonal antibody 5B8 attenuates neurodegeneration and innate immunity in mouse models of multiple sclerosis and Alzheimer's disease (Ryu et al., In Alzheimer's disease mice, genetically deleting a binding motif in fibrinogen reduces neuroinflammation and cognitive decline (Merlini et al., |
| IL15RA | 1.31E−43, 1.57E−03 |
Mice lacking Fast skeletal muscles in
|
| IL6 | 4.13E−05, 7.16E−04 |
The ability to ward off bacterial or viral infection is impaired in Genetically disrupting Transgenic mice overexpressing human |
| LIFR | 5.43E−08, −6.27E−04 |
Increasing the expression of Inoculating mice with breast cancer cells lacking Mouse Lifr contains separate protein domains that either maintain stem cell self‐renewal or induce differentiation (X. J. Wang et al., |
| LILRB2 | 9.22E−21, 1.07E−03 |
The genetic deletion of Small molecule inhibitors targeting the binding site of LILRB2 disrupt LILRB2‐Aβ interactions and reduce Aβ cytotoxicity (Cao et al., The anti‐tumor effects of T‐cell immune checkpoint inhibitors are enhanced by the blockade of LILRB2 (Chen et al., |
| MMP12 | 2.53E−92, 3.64E−03 |
A single nucleotide polymorphism in Large artery atherosclerosis is associated with a genetic variant in the In mice deficient in |
| NAB1 | 1.14E−26, −2.01E−03 |
NAB1 is upregulated in human heart failure and mice overexpressing In dogs with moderate heart failure, treatment with rosuvastatin reduces the expression of NAB1 in left ventricular tissue (Zaca et al., A single nucleotide polymorphism in |
| NTN1 | 2.09E−50, 2.32E−03 |
Overexpressing In mice lacking the low‐density lipoprotein receptor, deleting In a mouse model of obesity, the hematopoietic deletion of |
| PAK4 | 2.47E−04, 9.28E−04 |
Knocking down Overexpressing or depleting Growth is suppressed and invasive potential is decreased by the inhibition of |
| PLA2G2A | 1.56E−03, 7.11E−04 |
The size and multiplicity of intestinal tumors are reduced in mice overexpressing The expression of In |
| PLXNB2 | 9.33E−40, 1.17E−03 |
Inhibiting PLXNB2 suppresses the development of xenograft tumors in mice (Yu et al., Inhibiting PLXNB2 makes prostate cancer stem cells more sensitive to chemotherapy (Li et al., Motor sensory recovery following spinal cord injury is impaired in mice lacking |
| POMC | 1.53E−07, 9.34E−04 |
Mutations in Blocking the expression of In obese patients with defects in |
| PRKAA1.PRKAB1.PRKAG1 | 4.11E−02, 3.24E−04 |
Worms constitutively expressing Ampk elevates cellular NAD+ levels and enhances the activity of Sirt1 in mouse skeletal muscle (Canto et al., Overexpressing |
| RBM3 | 6.61E−20, 2.21E−03 |
Cold stress increases the expression level of Overexpressing In response to hypoxic ischemia, Rbm3 promotes the proliferation of neural stem/progenitor cells in the subgranular zone (X. Zhu et al., |
| SIRT5 | 9.61E−10, 8.53E−04 |
Creating a Knocking out Mice deficient in |
| UFM1 | 2.51E−03, 5.82E−04 |
Deletion mutations that affect the ufm‐1 cascade result in reduced fecundity and life span in worms (Hertel et al., RNAi knockdown against A homozygous mutation in |
For each protein, the q‐value and age coefficient (measured in a human proteomic dataset derived from 4263 individuals aged 18–95 years) as well as three relevant connections to aging and/or disease are provided.
Examples of common aging plasma proteins that can significantly extend life span in a vertebrate animal model when manipulated
| Protein |
| Life span effect |
|---|---|---|
| AKT2 | 1.61E−16, 1.04E−03 | Mice deficient in |
| GDF11 | 1.92E−02, −7.20E−04 | In killifish, levels of gdf11 decrease with age and treating aged animals with recombinant gdf11 lengthens |
| GDF15 | 1.71E−249, 5.26E−03 | The overexpression of human GDF15 in female mice extends |
| GHR | 7.56E−24, −1.53E−03 |
|
| NAMPT | 5.39E−04, 1.12E−03 | Wheel‐running activity is enhanced and longevity is boosted ( |
| PAPPA | 9.29E−05, 8.09E−04 | The incidence of spontaneous tumors is reduced and life is prolonged ( |
| PLAU | 6.46E−11, 8.67E−04 | Overexpressing |
| PTEN | 2.41E−02, 4.06E−04 | Longevity is enhanced ( |
| SHC1 | 7.18E−04, 8.53E−04 |
|
For each protein, the q‐value and age coefficient (measured in a human proteomic dataset derived from 4263 individuals aged 18–95 years) as well as the life span effect are included. Bolded words and numbers highlight the lifespan effect in response to a given intervention.
A follow‐up study assessed life span in Shc1 knockout mice at two different locations. At one location, Shc1 −/− mice on a 40% calorie restriction diet exhibited a survival benefit (median 70th percentile survival was increased by 8%). At the other site, no longevity benefit was observed in Shc1 knockout mice fed ad libitum (Ramsey et al., 2014).
Figure 2An overrepresentation analysis in the Gene Ontology Biological Process database was performed for all proteins that significantly (q < 0.05) change their expression level with age in human plasma and have a positive age coefficient. The top 30 enrichment results are presented as –log10(fdr)
Figure 3The ability of 13 different protein sets to predict age in a plasma proteomic dataset derived from 3301 human participants (age range of 18–76 years) was tested using machine learning. For each clock, the learning set utilized 2178 subjects and the test set utilized 1123 subjects. LASSO modeling was also performed for each clock to determine if a smaller set of proteins within the larger set could accurately predict human age. For each of these clocks, the Pearson correlation (a) and median absolute error (b) are reported. The two numbers in parenthesis for each clock indicate the number of available SOMAmers used for the subset of proteins identified by LASSO modeling or the full list of proteins
Figure 4Plots of predicted age vs. chronological age are shown for the most predictive aging clock identified. The most accurate aging clock was identified by LASSO modeling of all 2978 proteins available for measurement in the plasma proteomic dataset derived from 3301 human participants (age range of 18–76 years). This clock used 491 SOMAmers, had a Pearson correlation of 0.98 in the learning set (a), a median absolute error of 1.84 years in the learning set (a), a Pearson correlation of 0.96 in the test set (b), and a median absolute error of 2.44 years in the test set (b). 2178 subjects were utilized for the learning set (a) and 1123 subjects were utilized for the test set (b). MAE = median absolute error
Figure 5We used our ultra‐predictive aging clock to predict age in a human plasma proteomic dataset containing sedentary subjects as well as individuals that are aerobic exercise‐trained. For sedentary subjects, their respective chronological and predicted ages were 37.54 ± 20.88 and 46.34 ± 26.48 years. For aerobic exercise‐trained subjects, their respective chronological and predicted ages were 37.35 ± 19.82 and 40.91 ± 18.48 years. Results are presented as mean ± standard deviation. The difference in delta age (i.e., the difference between chronological and predicted age) between sedentary and aerobic exercise‐trained subjects was statistically significant (p‐value = 6.7E−5)
Figure 6The ability of 1565 protein sets associated with different Reactome pathways to predict age in a plasma proteomic dataset derived from 3301 human participants (age range of 18–76 years) was tested using machine learning. For each clock, the learning set utilized 2178 subjects and the test set utilized 1123 subjects. LASSO modeling was also performed for each clock to determine if a smaller set of proteins within the larger set could more accurately predict human age. We visualize the Pearson correlation (a) for the 19 pathways with the highest Pearson correlation. We also visualize the median absolute error (b) for the 19 pathways with the lowest median absolute error. The two numbers in parenthesis for each clock indicate the number of available SOMAmers used for the subset of proteins identified by LASSO modeling or the full list of proteins. The full name of the pathway abbreviated with ellipses is “Regulation of insulin‐like growth factor (IGF) transport and uptake by insulin‐like growth factor binding proteins (IGFBPs)”