| Literature DB >> 35728903 |
Max Olsson1, David C Currow2, Magnus Per Ekström3.
Abstract
OBJECTIVE: To evaluate which factors are the most strongly related to self-perceived health among older men and describe the shape of the association between the related factors and self-perceived health using machine learning. DESIGN ANDEntities:
Keywords: geriatric medicine; health informatics; public health
Mesh:
Year: 2022 PMID: 35728903 PMCID: PMC9214374 DOI: 10.1136/bmjopen-2022-061242
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 3.006
Characteristics of 475 men aged 73 years in the study population
| Factors (% missing values) | Self-perceived health | ||
| Total n=475 | Better n=193 | Worse n=282 | |
| Anthropometrics, lifestyle and demographic factors | |||
| Body mass index (1.1%) | 27.07 (4.1) | 25.77 (2.8) | 27.96 (4.5) |
| University degree (0%) | 96 (20.2%) | 57 (29.5%) | 39 (13.8%) |
| Ever smoker (1.9%) | 313 (65.9%) | 113 (58.5%) | 200 (70.9%) |
| Pack-years of smoking (9.7%) | 8.84 (16.6) | 10.06 (21.6) | 8.00 (11.9) |
| Exercise frequency (1.7%) | |||
| Less than once a week | 82 (17.3%) | 10 (5.2%) | 72 (25.5%) |
| 1–3 times a week | 134 (28.2%) | 42 (21.8%) | 92 (32.6%) |
| 3–6 times a week | 146 (30.7%) | 79 (40.9%) | 67 (23.8%) |
| Everyday | 113 (23.8%) | 62 (32.1%) | 51 (18.1%) |
| Standard units of alcohol (15.8%) | 6.43 (6.3) | 6.88 (5.8) | 6.13 (6.6) |
| Sleep quality (0.6%) | |||
| Very bad | 65 (13.7%) | 4 (2.1%) | 61 (21.6%) |
| Bad | 159 (33.5%) | 36 (18.7%) | 123 (43.6%) |
| Quite good | 137 (28.8%) | 80 (41.5%) | 57 (20.2%) |
| Good | 5 (1.1%) | 0 (0.0%) | 5 (1.8%) |
| Very good | 109 (22.9%) | 73 (37.8%) | 36 (12.8%) |
| Sleep duration (0.6%) | |||
| 4 hours or less | 43 (9.1%) | 8 (4.1%) | 35 (12.4%) |
| 5 hours | 77 (16.2%) | 21 (10.9%) | 56 (19.9%) |
| 6 hours | 177 (37.3%) | 80 (41.5%) | 97 (34.4%) |
| 7 hours | 130 (27.4%) | 68 (35.2%) | 62 (22.0%) |
| 8 hours | 32 (6.7%) | 14 (7.3%) | 18 (6.4%) |
| 9 hours | 5 (1.1%) | 1 (0.5%) | 4 (1.4%) |
| 10 hours or more | 11 (2.3%) | 1 (0.5%) | 10 (3.5%) |
| Symptoms | |||
| Anxiety (2.9%) | 1.03 (2.0) | 0.12 (0.4) | 1.65 (2.4) |
| Appetite (2.5%) | 0.71 (1.7) | 0.05 (0.4) | 1.16 (2.1) |
| Breathlessness (2.9%) | 2.08 (2.6) | 0.50 (1.0) | 3.16 (2.8) |
| Depression (3.2%) | 1.48 (2.2) | 0.25 (0.6) | 2.32 (2.5) |
| Drowsiness (4.0%) | 2.35 (2.4) | 0.73 (0.9) | 3.46 (2.5) |
| Fatigue (3.2%) | 2.76 (2.6) | 0.90 (1.2) | 4.03 (2.5) |
| Nausea (2.9%) | 0.60 (1.5) | 0.09 (0.5) | 0.95 (1.8) |
| Pain (2.7%) | 2.84 (2.6) | 1.05 (1.4) | 4.07 (2.5) |
| Health conditions (4.2%) | |||
| Cancer | 89 (18.7%) | 33 (17.1%) | 56 (19.9%) |
| Cardiovascular disease | 179 (37.7%) | 48 (24.9%) | 131 (46.5%) |
| Diabetes | 79 (16.6%) | 18 (9.3%) | 61 (21.6%) |
| Hypertension | 275 (57.9%) | 91 (47.2%) | 184 (65.2%) |
| Hyperlipidaemia | 127 (26.7%) | 36 (18.7%) | 91 (32.3%) |
| Respiratory disease | 85 (17.9%) | 17 (8.8%) | 68 (24.1%) |
| Rheumatological disease | 30 (6.3%) | 9 (4.7%) | 21 (7.4%) |
All values are presented as the mean (SD) or frequency (%). Values in the table correspond to the values after imputation of missing values by median for continuous variables and mode for categorical variables.
Figure 1Importance of factors for self-perceived health. Ranking of variables for predicting better (excellent/very good) or worse (fair/poor) self-perceived health is measured in SHapley Additive exPlanations (SHAP) absolute mean values and is calculated on the participants in the validation set. A higher SHAP absolute mean value corresponds to greater importance for predicting self-perceived health as better or worse. The variable importance is multivariate and is calculated in relation to all other factors in the dataset. BMI, body mass index.
Figure 2Variable impact on log-odds for self-perceived health. Each dot represents a participant in the validation set. X-axes represent the participants’ scores for the factor. Y-axes represent the change in log-odds of rating self-perceived health as either worse (positive logodds) or better (negative log-odds). The red line is a locally estimated scatterplot smoothing line (LOESS) that represents a summarized value of the participant’s log-odds. The dashed line marks a log-odds of zero. The symptoms are reported using a numerical rating scale (NRS) from 0 (none) to 10 (worst possible intensity of the symptom). Standard units of alcohol correspond to weekly average. BMI, body mass index.