| Literature DB >> 31746094 |
Bryan J Bunning1, Kévin Contrepois2, Brittany Lee-McMullen2, Gopal Krishna R Dhondalay1, Wenming Zhang1, Dana Tupa1, Olivia Raeber1, Manisha Desai3, Kari C Nadeau1, Michael P Snyder2, Sandra Andorf1.
Abstract
Aging is intimately linked to system-wide metabolic changes that can be captured in blood. Understanding biological processes of aging in humans could help maintain a healthy aging trajectory and promote longevity. We performed untargeted plasma metabolomics quantifying 770 metabolites on a cross-sectional cohort of 268 healthy individuals including 125 twin pairs covering human lifespan (from 6 months to 82 years). Unsupervised clustering of metabolic profiles revealed 6 main aging trajectories throughout life that were associated with key metabolic pathways such as progestin steroids, xanthine metabolism, and long-chain fatty acids. A random forest (RF) model was successful to predict age in adult subjects (≥16 years) using 52 metabolites (R2 = .97). Another RF model selected 54 metabolites to classify pediatric and adult participants (out-of-bag error = 8.58%). These RF models in combination with correlation network analysis were used to explore biological processes of healthy aging. The models highlighted established metabolites, like steroids, amino acids, and free fatty acids as well as novel metabolites and pathways. Finally, we show that metabolic profiles of twins become more dissimilar with age which provides insights into nongenetic age-related variability in metabolic profiles in response to environmental exposure.Entities:
Keywords: LC-MS; aging; machine learning; metabolomics; random forest; twins
Mesh:
Year: 2019 PMID: 31746094 PMCID: PMC6974708 DOI: 10.1111/acel.13073
Source DB: PubMed Journal: Aging Cell ISSN: 1474-9718 Impact factor: 9.304
Figure 1Untargeted metabolomics of aging plasma. (a) Natural log metabolite abundances of all 770 detected metabolites (rows) in 268 individuals (columns) ranging from low (blue) to high (red). Metabolites are ordered by metabolic class and median intensity across all the samples in the study, and individuals are ordered chronologically by age. (b) Principal component analysis using the top two principal components associated with age (i.e., PC4 and PC5). Association with age is calculated via linear modeling of each PC
Cohort demographics
| <16 years | ≥16 years | Total | |
|---|---|---|---|
| Individuals (twin sets) | 53 (25) | 215 | 268 (125) |
| Age in years (mean, | 7.0 ± 4.1 | 42.5 ± 17.3 | 35.5 ± 21.1 |
| Sex % female ( | 45.3% (24) | 69.8% (150) | 64.9% (174) |
| BMI (mean, | 16.0 ± 2.7 | 27.0 ± 6.3 | 24.9 ± 7.2 |
| Monozygotic individuals % of twins ( | 64.0% (32) | 91.0% (182) | 85.6% (214) |
12 Singletons, 1 set of triplets.
Zygosity unknown for 1 twin pair.
Figure 2Metabolic aging trajectories. Fuzzy c‐mean clustering of all 770 metabolite abundances fitted to a loess curve and Z‐score scaled (black lines), adjusted for sex and BMI, as a function of age in years. Average trend of clusters is shown as a red line. Pathways with FDR < 0.1 are considered significant and displayed
Figure 3Regression model Random Forest analysis. (a) Significant metabolites in the regression RF model ordered by importance and color‐coded by cluster. (b) 2D scatter plot representing predicted age vs. actual age. MSE = mean squared error. (c) Correlation network analysis of significant metabolites in the model. Nodes are colored by metabolic class and sized by betweenness centrality. Edges are colored by association direction
Figure 4Classification model Random Forest analysis. (a) Significant metabolites in the classification RF model ordered by importance and color‐coded by cluster. (b) Two‐way table of the classification model. (c) Correlation network analysis of significant metabolites in the model. Nodes are colored by metabolic class and sized by betweenness centrality. Edges are colored by association direction
Figure 5Variability of metabolic profiles in twins. (a) Spearman's correlation of the whole collection of metabolites between various pairs of individuals. p‐value shown was calculated via Kruskal–Wallis test. (b) Spearman's correlation plotted as a function of age shows a significant decreasing trend over time in twins