| Literature DB >> 29581467 |
Timothy V Pyrkov1, Konstantin Slipensky2, Mikhail Barg2, Alexey Kondrashin2, Boris Zhurov3, Alexander Zenin3, Mikhail Pyatnitskiy3, Leonid Menshikov3, Sergei Markov2, Peter O Fedichev4,5.
Abstract
Age-related physiological changes in humans are linearly associated with age. Naturally, linear combinations of physiological measures trained to estimate chronological age have recently emerged as a practical way to quantify aging in the form of biological age. In this work, we used one-week long physical activity records from a 2003-2006 National Health and Nutrition Examination Survey (NHANES) to compare three increasingly accurate biological age models: the unsupervised Principal Components Analysis (PCA) score, a multivariate linear regression, and a state-of-the-art deep convolutional neural network (CNN). We found that the supervised approaches produce better chronological age estimations at the expense of a loss of the association between the aging acceleration and all-cause mortality. Consequently, we turned to the NHANES death register directly and introduced a novel way to train parametric proportional hazards models suitable for out-of-the-box implementation with any modern machine learning software. As a demonstration, we produced a separate deep CNN for mortality risks prediction that outperformed any of the biological age or a simple linear proportional hazards model. Altogether, our findings demonstrate the emerging potential of combined wearable sensors and deep learning technologies for applications involving continuous health risk monitoring and real-time feedback to patients and care providers.Entities:
Mesh:
Year: 2018 PMID: 29581467 PMCID: PMC5980076 DOI: 10.1038/s41598-018-23534-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Accuracy of age-predicting models. The biological age estimation according to deep convolutional neural network CNN_Age (A), the multivariate regression REG_Age (B), and the unsupervised PCA_Age (C) models. The solid lines and the transparent ± standard deviation bands are color coded as green, blue, and red, representing the whole population, the patients diagnosed with diabetes by a doctor, and individuals with self-reported high blood pressure, respectively. All the calculations were produced using the NHANES 2003–2006 cohort wearable accelerometers data, comprising one-week long activity tracks (1 min−1 sampling rate).
Figure 2Performance of the biological age models to distinguish between the longer- and the shorter-living individuals illustrated with Kaplan-Meier survival curves. Each participant was classified into the “high-” and the “low-” risks groups according to the deep convolutional neural network CNN_Age (A), the multivariate regression REG_Age (B), and the unsupervised PCA_Age (C) models. The p-values characterize the survival curves separation significance.
Cox-Gompertz parametric proportional hazards model parameters.
| Covariate | Hazard Ratio, 95% CI, p-value |
|---|---|
| age | 1.090, CI[1.082, 1.099] (p = 2e-109) |
| gender | 1.503, CI[1.281, 1.765] (2.4 yr., p = 6e-07) |
| diabetes | 1.690, CI[1.401, 2.038] (2.2 yr., p = 4e-08) |
| smoking | 2.300, CI[1.866, 2.836] (3.8 yr., p = 6e-15) |
| high blood pressure | 1.198, CI[1.018, 1.409] (1.0 yr., p = 3e-02) |
| log(activity) | 2.378, CI[2.051, 2.756] (4.9 yr., p = 1e-30) |
Association of the biological age- and all-cause mortality -predicting models with all-cause mortality.
| HRA Covariates | Age, Gender, Smoking, | Age, Gender, Smoking, |
|---|---|---|
| Model | Hazard Ratio, 95% CI, p-value | Hazard Ratio, 95% CI, p-value |
| PCA Age | 1.45, | 1.07, |
| REG Age | 1.31, | 1.01, |
| CNN Age | 1.05, | 0.94, |
| Hazard Weighted regr | 1.27, | 1.08, |
| Hazard CNN | 1.53, | 1.15, |
Two class of models were evaluated: one including the HRA parameters: age, gender, diabetes, smoking and hypertension as covariates (the left column) and the other including additionally the negative logarithm of the average daily physical activity (the biological age proxy, the right column). The corresponding 95% hazard ratio intervals are given along with the significance p-value and the effect levels, expressed in years of life lost (see the text for details).
Figure 3Architecture of convolutional neural network. The same network architecture was used to train both models for predicting age and for predicting mortality rates.
Figure 4Learning curves for the CNN age and hazard predictor models. Learning curves for age-predicting model CNN_Age are shown for each of 5 folds for the training (A) and test (B) subsets. Cross-validation learning curves (B) show a minimum of MSE at epochs 500–800, while further training results in model overfitting. Panels (C,D) show the learning curves for the mortality risk-predicting CNN model. Here some of the cross-validation learning curves show a minimum of MSE at epoch 3000. Learning curves are color-coded according to the fold number in the 5-fold training/cross-validation setup.