| Literature DB >> 34831575 |
Myeounggon Lee1, Yoonjae Noh2, Changhong Youm3, Sangjin Kim2, Hwayoung Park3, Byungjoo Noh4, Bohyun Kim3, Hyejin Choi3, Hyemin Yoon2.
Abstract
The elderly population in South Korea accounted for 15.5% of the total population in 2019. Thus, it is important to study the various elements governing the process of healthy aging. Therefore, this study investigated multiple prediction models to determine the health-related quality of life (HRQoL) in elderly adults based on the demographics, questionnaires, gait ability, and physical fitness. We performed eight physical fitness tests on 775 participants wearing shoe-type inertial measurement units and completing walking tasks at slower, preferred, and faster speeds. The HRQoL for physical and mental components was evaluated using a 36-item, short-form health survey. The prediction models based on multiple linear regression with feature importance were analyzed considering the best physical and mental components. We used 11 variables and 5 variables to form the best subset of features underlying the physical and mental components, respectively. We laid particular emphasis on evaluating the functional endurance, muscle strength, stress level, and falling risk. Furthermore, stress, insomnia severity, number of diseases, lower body strength, and fear of falling were taken into consideration in addition to mental-health-related variables. Thus, the study findings provide reliable and objective results to improve the understanding of HRQoL in elderly adults.Entities:
Keywords: elderly adults; health-related quality of life; machine learning; physical and mental components; prediction model
Mesh:
Year: 2021 PMID: 34831575 PMCID: PMC8624167 DOI: 10.3390/ijerph182211816
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Physical fitness test.
Figure 2Scheme of the study.
Figure 3Average root mean square error (aRMSE) based on the cumulative number of features (CNoF) of the six categorized models. The y-axis indicates the aRMSE of the model; the x-axis indicates the level of the CNoF sorted by the ranking of feature selection methods. Each line graph indicates one of the six categorized models that is identical to the legend. LR_PV, multiple linear regression (LR) model with feature rank based on p-value (PV); LR_FI, multiple LR model with feature rank based on feature importance (FI); RF_PV, random forest (RF) regression with feature rank based on PV; RF_FI, RF regression with feature rank based on FI; SVM_PV, support vector machine (SVM) regression of radial basis function kernel with feature rank based on PV; SVM_PV, SVM regression of radial basis function kernel with feature rank based on FI.
Statistics of the generated models which minimizes the average root mean square error (aRMSE) in both physical component and mental component.
| Component | Model | CNoF | aRMSE | Std RMSE | Min RMSE | RMSE 25% | RMSE 50% | RMSE 75% | Max RMSE |
|---|---|---|---|---|---|---|---|---|---|
| Physical | LR_PV | 33 | 13.92776 | 0.68349 | 11.96545 | 13.50805 | 13.95991 | 14.44462 | 15.38411 |
| (a) LR_FI | 15 | 13.76088 | 0.71617 | 11.86612 | 13.29279 | 13.78654 | 14.24163 | 15.43223 | |
| RF_PV | 31 | 14.39872 | 0.73950 | 12.57450 | 13.91250 | 14.33669 | 14.98490 | 15.94272 | |
| RF_FI | 15 | 14.20864 | 0.68617 | 12.62523 | 13.68770 | 14.26701 | 14.74725 | 15.90077 | |
| SVM_PV | 8 | 14.76165 | 0.81308 | 12.99783 | 14.15996 | 14.79660 | 15.25064 | 17.17015 | |
| SVM_FI | 8 | 14.39575 | 0.82367 | 12.58128 | 13.84777 | 14.39586 | 15.01643 | 16.60752 | |
| Mental | (b) LR_PV | 12 | 11.36805 | 0.61973 | 9.53575 | 10.89313 | 11.37711 | 11.78536 | 12.96427 |
| LR_FI | 6 | 11.37025 | 0.70205 | 9.17258 | 10.86453 | 11.36144 | 11.82792 | 13.24375 | |
| RF_PV | 18 | 11.64159 | 0.59699 | 10.09463 | 11.21322 | 11.66051 | 12.00546 | 13.00478 | |
| RF_FI | 39 | 11.60706 | 0.62697 | 10.11383 | 11.11899 | 11.63986 | 11.98359 | 13.47048 | |
| SVM_PV | 4 | 11.79612 | 0.76560 | 10.14775 | 11.30421 | 11.70546 | 12.23328 | 13.82748 | |
| SVM_FI | 2 | 11.68649 | 0.75505 | 10.05444 | 11.15436 | 11.58823 | 12.22781 | 13.59275 |
In the case of the physical component, model (a) with cumulative number of features (CNoF) of 15 shows the best performance among all the models that can be generated. In the case of the mental component, model (b) with CNoF of 12 minimizes the aRMSE among all the models that can be generated.
Figure 4Box plots of root mean square error (RMSE) with respect to the model with its CNoF minimizing the aRMSE. It depicts the boxplots of RMSE distribution in terms of the CNoF that minimizes the aRMSE. CNoF, cumulative number of features, wherein the number in the parentheses indicates the CNoF maximizing performance within each category (Table 1); green lines in the middle of the boxplot indicate the median.
Description of beta of features in the best performing model.
| Physical Component (LR_FI, CNoF = 15). | Beta | Mental Component (LR_PV, CNoF = 12) | Beta |
|---|---|---|---|
| Demographic characteristics | Demographic characteristics | ||
| Age * | −1.88 | Total number of diseases * | −2.78 |
| Total number of diseases * | −3.63 | ||
| Questionnaires | Questionnaires | ||
| Stress response index-modified form (SRI-MF) * | −6.98 | SRI-MF * | −6.91 |
| Total Insomnia severity index (ISI) * | −5.08 | Total ISI * | −3.95 |
| Total Physical activities (PAs) * | 4.69 | Fear of falling * | −3.71 |
| Fear of falling * | −6.42 | ||
| Gait ability | Gait ability | ||
| Preferred speed_coefficient of variance (CV) | −3.28 | Faster speed_Walking speed | 2.92 |
| Faster speed_CV Single-support phase * | −3.53 | ||
| Slower speed_Stride time * | −0.54 | ||
| Physical fitness | Physical fitness | ||
| 6-min walking test (6MWT) * | 7.66 | 6MWT | 3.96 |
| Standing time (ST) from a long sitting position (LSP) * | −6.02 | Five times sit-to-stand * | −3.86 |
| 3-m Timed-up-and-go test right side | −6.83 | 3-m Timed-up-and-go test left side | −3.81 |
| Single-leg stance | 2.30 | 3-m Timed-up-and-go test right side | −3.61 |
| Five times sit-to-stand | −6.06 | Bicep curls right | 3.33 |
| Handgrip right side * | 5.11 | Bicep curls left | 3.24 |
| ST from LSP | −2.83 |
The best performing models are the multiple linear regression (LR) model with the feature rank based on the feature importance (FI) with cumulative number of features (CNoF) of 15 for the physical component and the LR model with a feature rank based on the p-value (PV) with CNoF of 12 for the mental component. Features masked with a symbol, *, indicate that those features are selected for the best prediction model.
Figure 5Heat-map of the variables in the best prediction model obtained from the Pearson correlation coefficient matrix, which is illustrated by color based on the correlation level. When the correlation levels are close to +1 and −1, the colors of the boxes are close to red and blue, respectively.
Figure 6Network analysis of the edges based on the Pearson correlation. Edges in black, red, and blue represent |0.2| < r < |0.25|, r ≥ 0.25, and r ≤ −0.25, respectively.