| Literature DB >> 36192713 |
Zahra Asghari Varzaneh1, Mostafa Shanbehzadeh2, Hadi Kazemi-Arpanahi3,4.
Abstract
BACKGROUND: Aging is a chief risk factor for most chronic illnesses and infirmities. The growth in the aged population increases medical costs, thus imposing a heavy financial burden on families and communities. Successful aging (SA) is a positive and qualitative view of aging. From a biomedical perspective, SA is defined as the absence of diseases or disability disorders. This is distinct from normal aging, which is associated with age-related deterioration in physical and cognitive functions. From a social perspective, SA highlights life satisfaction and individual well-being, usually attained through socialization. It is an abstract and multidimensional concept surrounded by imprecision about its definition and measurement. Our study attempted to find the most effective features of SA as defined by Rowe and Kahn's theory. The determined features were used as input parameters of six machine learning (ML) algorithms to create and validate predictive models for SA.Entities:
Keywords: Aged; Ensemble learning; Machine learning; Quality of life
Mesh:
Year: 2022 PMID: 36192713 PMCID: PMC9527392 DOI: 10.1186/s12911-022-02001-6
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 3.298
Fig. 1An overview of the proposed system
Fig. 2Structure of the confusion matrix
Definition of evaluation criteria
| Performance metrics | Definitions |
|---|---|
| Precision | TP/(TP + FP) |
| Specificity /true negative rate (TNR) | TN/(TN + FP) |
| Sensitivity/true positive rate (TPR) or recall | TP/(TP + FN) |
| Accuracy | (TP + TN)/(TP + TN + FP + FN) |
| F-measure | (2 × precision × recall)/(precision + recall) |
*True Positive (TP), True Negative (TN), False Positive (FP), False Negative (FN)
Hyperparameters for ML algorithms
| ML Algorithms | Hyperparameters |
|---|---|
| DT | min_samples_split = 2, min_samples_leaf = 1, max_features = none |
| SVM | C = 1, kernel = rbf, gamma = scale |
| NB | Alpha = 1.0, |
| ANN | Learning rate = 0.01, Number of Hidden Units = 2 |
| KNN | n_neighbors (K) = 5 |
Results of correlation of factors affecting SA
| No. | Class | Variable name | Variable type | Concept codes | Frequency | chi-square | |
|---|---|---|---|---|---|---|---|
| 1 | Socio-demographic | Age (years) | Polynomial | 1 (65–69) 2 (70–79) 3 (> 80) | 1 (413) 2 (283) 3 (287) | 0.073 | 0.036 |
| 2 | Sex | Binominal | 1 (Female) 2 (Male) | 1 (422) 2 (561) | 0.158 | < 0.01 | |
| 3 | Educational level | Polynomial | 1 (No literacy) 2 (Elementary) 3 (Diploma) 4 (Academic) | 1 (154) 2 (297) 3 (445) 4 (87) | 0.60 | 0.206 | |
| 4 | Marital status | Polynomial | 1 (married) 2 (single) 3 (divorced) 4 (widowed) | 1 (574) 2 (89) 3 (18) 4 (302) | 0.036 | 0.651 | |
| 5 | Occupation | Polynomial | 1 (no job) 2 (housekeeper) 3 (retired) 4 (self-employment) | 1 (112) 2 (324) 3 (443) 4 (104) | 0.068 | 0.117 | |
| 6 | Income level | Binominal | 1 (under the poverty line) 2 (on poverty line) | 1 (521) 2 (426) | 0.138 | < 0.01 | |
| 7 | Insurance status | Binominal | 1 (Have) 2 (Haven’t) | 1 (623) 2 (360) | 0.005 | 0.853 | |
| 8 | Disease comorbidities | Hypertension | Binomial | 1 (Yes) 2 (No) | 1 (714) 2 (269) | 0.275 | < 0.01 |
| 9 | Cardiovascular accident (CVA) | Binomial | 1 (Yes) 2 (No) | 1 (268) 2 (715) | 0.249 | < 0.01 | |
| 10 | Bone disease | Binomial | 1 (Yes) 2 (No) | 1 (399) 2 (584) | 0.070 | 0.013 | |
| 11 | Renal disease | Binomial | 1 (Yes) 2 (No) | 1 (264) 2 (719) | 0.027 | 0.344 | |
| 12 | Liver disease | Binomial | 1 (Yes) 2 (No) | 1 (120) 2 (863) | 0.054 | 0.057 | |
| 13 | Muscle disease | Binomial | 1 (Yes) 2 (No) | 1 (335) 2 (648) | 0.116 | < 0.01 | |
| 14 | Depression | Binomial | 1 (Yes) 2 (No) | 1 (388) 2 (595) | 0.74 | < 0.01 | |
| 15 | Convalescences | Binomial | 1 (Yes) 2 (No) | 1 (170) 2 (813) | 0.058 | 0.038 | |
| 16 | Eye disease | Binomial | 1 (Yes) 2 (No) | 1 (460) 2 (523) | 0.017 | 0.540 | |
| 17 | Diabetes | Binomial | 1 (Yes) 2 (No) | 1 (722) 2 (743) | 0.264 | < 0.01 | |
| 18 | Cancer | Binomial | 1 (Yes) 2 (No) | 1 (88) 2 (895) | 0.107 | < 0.01 | |
| 19 | Other diseases | Binomial | 1 (Yes) 2 (No) | 1 (456) 2 (527) | 0.025 | 0.678 | |
| 20 | Behavioral and psychosocial factors | Tension management | Binomial | 1 (Yes) 2 (No) | 1 (560) 2 (423) | 0.063 | 0.025 |
| 21 | Social and interpersonal relationships | Binomial | 1 (Weak) 2 (Strong) | 9 (399) 10 (584) | 0.062 | 0.028 | |
| 22 | Life satisfaction | Binomial | 1 (Pleasant) 2 (Unpleasant) | 1 (466) 2 (517) | 0.138 | < 0.01 | |
| 23 | Healthy lifestyle | Polynomial | 1 (Low) 2 (Medium) 3 (High) | 1 (377) 2 (538) 3 (68) | 0.075 | 0.029 | |
| 24 | Nutritional status | Binomial | 1 (Bad) 2 (Good) | 1 (351) 2 (632) | 0.132 | < 0.01 | |
| 25 | Ability to perform daily activities | Binomial | 1 (dependence) 2 (independence) | 1 (471) 2 (512) | 0.149 | < 0.01 | |
| 26 | Quality of life | Binomial | 3 (Low) 4 (High) | 1 (465) 2 (518) | 0.205 | < 0.01 | |
| 27 | Physical activity | Binomial | 1 (Low) 2 (High) | 1 (192) 2 (791) | 0.162 | < 0.01 | |
| 28 | Disease prevention activities | Binomial | 1 (Have) 2 (Haven't) | 1 (472) 2 (511) | 0.070 | 0.013 |
Results of univariant regression
| No. | Class | Variable name | Concept codes | Odd ratio (OR) | CI (confidence interval) | |
|---|---|---|---|---|---|---|
| 1 | Socio–demographic | Age (years) | 1 (65–69) | 1.000 | – | – |
| 2 (70–79) | 0.658 | 0.236–0.785 | < 0.001 | |||
| 3 (> 80) | 0.358 | 0.125–0.527 | 0.006 | |||
| 2 | Sex | 1 (Female) | 1.000 | – | – | |
| 2 (Male) | 0.435 | 0.217–0.784 | < 0.001 | |||
| 3 | Income level | 1 (under the poverty line) | 1.000 | – | – | |
| 2 (on poverty line) | 1.365 | 1.215–1.606 | 0.003 | |||
| 4 | Insurance status | 1 (Have) | 1.000 | – | – | |
| 2 (Haven’t) | 0.635 | 0.487–0.963 | 0.013 | |||
| 5 | Disease comorbidities | Hypertension | 1 (Yes) | 1.000 | – | – |
| 2 (No) | 0.321 | 0.221–0.425 | < 0.001 | |||
| 6 | Cardiovascular accident (CVA) | 1 (Yes) | 1.000 | – | – | |
| 2 (No) | 1.369 | 1.023–1.409 | 0.005 | |||
| 7 | Bone disease | 1 (Yes) | 1.000 | – | – | |
| 2 (No) | 1.458 | 1.369–1.548 | < 0.001 | |||
| 8 | Muscle disease | 1 (Yes) | 1.000 | – | – | |
| 2 (No) | 1.745 | 1.678–1.978 | < 0.001 | |||
| 9 | Depression | 1 (Yes) | 1.000 | – | – | |
| 2 (No) | 2.360 | 2.023–2.415 | 0.018 | |||
| 10 | Convalescences | 1 (Yes) | 1.000 | – | – | |
| 2 (No) | 0.368 | 0.234–0.478 | < 0.001 | |||
| 11 | Diabetes | 1 (Yes) | 1.000 | – | – | |
| 2 (No) | 1.369 | 1.023–1.409 | 0.005 | |||
| 12 | Cancer | 1 (Yes) | 1.000 | – | – | |
| 2 (No) | 2.364 | 1.998–2.419 | < 0.001 | |||
| 13 | Behavioral and psychosocial factors | Tension management | 1 (Yes) | 1.000 | – | – |
| 2 (No) | 0.425 | 0.365–0.574 | < 0.001 | |||
| 14 | Social and interpersonal relationships | 1 (Weak) | 1.000 | – | – | |
| 2 (Strong) | 1.785 | 1.561–1.941 | < 0.001 | |||
| 15 | Life satisfaction | 1 (Pleasant) | 1.000 | – | – | |
| 2 (Unpleasant) | 0.369 | 0.236–0.578 | < 0.001 | |||
| 16 | Healthy lifestyle | 1 (Low) | 1 | – | – | |
| 2 (Medium) | 1.236 | 1.010–1.368 | 0.029 | |||
| 3 (High) | 2.360 | 2.263–2.684 | 0.001 | |||
| 17 | Nutritional status | 1 (Bad) | 1.000 | – | – | |
| 2 (Good) | 1.466 | 1.231–1.785 | < 0.001 | |||
| 18 | Ability to perform daily activities | 1 (dependence) | 1.000 | – | – | |
| 2 (independence) | 2.365 | 2.020–2.474 | < 0.001 | |||
| 19 | Quality of life | 3 (Low) | 1.000 | – | – | |
| 4 (High) | 1.386 | 1.120–1.585 | < 0.001 | |||
| 20 | Physical activity | 1 (Low) | 1.000 | – | – | |
| 2 (High) | 1.365 | 1.120–1.485 | < 0.001 | |||
| 21 | Disease prevention activities | 1 (Have) | 1.000 | – | – | |
| 2 (Haven't) | 0.368 | 0.234–0.478 | < 0.001 |
The result of evaluating the efficiency of ML models
| Model | Precision | Recall | Specificity | F-measure | Accuracy | AUC |
|---|---|---|---|---|---|---|
| DT | 74 | 73.2 | 73.7 | 73.6 | 73.5 | 80 |
| 95% CI | (0.72, 0.752) | (0.725,0.74) | (0.719,0.75.2) | (0.71,0.747) | (0.71,0.759) | (0.79, 0.813) |
| Stanandard deviation (SD) | 0.041 | 0.013 | 0.029 | 0.038 | 0.0217 | 0.029 |
| SVM | 78 | 97.5 | 81.6 | 86.3 | 85.1 | 95 |
| 95% CI | (0.763, 0.791) | (0.763,0.78) | (0.792,0.83) | (0.839,0.884) | (0.845,0.87) | (0.937, 0.961) |
| SD | 0.035 | 0.019 | 0.035 | 0.102 | 0.042 | 0.018 |
| NB | 65 | 70.6 | 67.6 | 67.6 | 68.6 | 74 |
| 95% CI | (0.631, 0.67) | (0.69,0.713) | (0.659,0.692) | (0.658,0.685) | (0.669,0.69) | (0.725, 0.763) |
| SD | 0.024 | 0.022 | 0.030 | 0.0173 | 0.015 | 0.037 |
| ANN | 48 | 64.7 | 77.2 | 89.3 | 77.1 | 78.2 |
| 95% CI | (0.462, 0.499) | (0.615,0.67) | (0.764,0.788) | (0.883,0.907) | (0.759,0.78) | (0.761, 0.793) |
| SD | 0.06 | 0.057 | 0.035 | 0.041 | 0.012 | 0.02 |
| KNN | 90 | 72.1 | 86.6 | 80 | 73.3 | 91 |
| 95% CI | (0.886, 0.914) | (0.715,0.73) | (0.852,0.887) | (0.780,0.817) | (0.715,0.74) | (0.89, 0.925) |
| SD | 0.048 | 0.019 | 0.051 | 0.0227 | 0.018 | 0.012 |
| Ensemble 1 (KNN) | 93 | 87.8 | 92.4 | 90.3 | 89.6 | 96 |
| 95% CI | (0.917, 0.941) | (0.86,0.893) | (0.919,0.941) | (0.89,0.917) | (0.874,0.91) | (0.951, 0.973) |
| SD | 0.03 | 0.024 | 0.0107 | 0.0162 | 0.052 | 0.027 |
| Ensemble 2 (Bag Tree) | 82 | 86.3 | 82.8 | 85.8 | 84.4 | 90 |
| 95% CI | (0.802, 0.841) | (0.851,0.87) | (0.812,0.845) | (0.832,0.871) | (0.83,0.861) | (0.891, 0.817) |
| SD | 0.03 | 0.012 | 0.031 | 0.026 | 0.039 | 0.032 |
Fig. 3An overview of the comparison of ML algorithms
Fig. 4Confusion matrix of classifiers (0 indicates a SA class and 1 indicates none-SA class.)
Fig. 5ROC for ML algorithms