| Literature DB >> 28281663 |
Francisco Félix Caballero1,2,3, George Soulis4, Worrawat Engchuan5, Albert Sánchez-Niubó6,7, Holger Arndt8, José Luis Ayuso-Mateos1,2,3, Josep Maria Haro2,6, Somnath Chatterji9, Demosthenes B Panagiotakos4.
Abstract
A most challenging task for scientists that are involved in the study of ageing is the development of a measure to quantify health status across populations and over time. In the present study, a Bayesian multilevel Item Response Theory approach is used to create a health score that can be compared across different waves in a longitudinal study, using anchor items and items that vary across waves. The same approach can be applied to compare health scores across different longitudinal studies, using items that vary across studies. Data from the English Longitudinal Study of Ageing (ELSA) are employed. Mixed-effects multilevel regression and Machine Learning methods were used to identify relationships between socio-demographics and the health score created. The metric of health was created for 17,886 subjects (54.6% of women) participating in at least one of the first six ELSA waves and correlated well with already known conditions that affect health. Future efforts will implement this approach in a harmonised data set comprising several longitudinal studies of ageing. This will enable valid comparisons between clinical and community dwelling populations and help to generate norms that could be useful in day-to-day clinical practice.Entities:
Mesh:
Year: 2017 PMID: 28281663 PMCID: PMC5345043 DOI: 10.1038/srep43955
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Five-factor solution corresponding to EFA conducted on the developmental sample at ELSA baseline (n = 7756): factor loading estimates after Geomin rotation.
| Variable | F1 | F2 | F3 | F4 | F5 |
|---|---|---|---|---|---|
| 0.03 | 0.13 | −0.02 | −0.07 | ||
| 0.01 | 0.17 | −0.01 | −0.08 | ||
| 0.02 | −0.01 | 0.06 | −0.01 | ||
| 0.01 | 0.01 | 0.06 | −0.02 | ||
| 0.01 | 0.22 | 0.03 | −0.05 | ||
| 0.04 | 0.04 | 0.07 | −0.05 | ||
| 0.01 | 0.12 | −0.06 | −0.08 | ||
| −0.02 | −0.12 | 0.08 | 0.03 | ||
| 0.04 | 0.16 | 0.14 | −0.11 | ||
| −0.01 | 0.10 | 0.04 | 0.01 | ||
| −0.01 | 0.05 | −0.04 | 0.02 | ||
| −0.04 | −0.04 | −0.12 | 0.11 | ||
| 0.02 | −0.05 | −0.07 | 0.01 | ||
| −0.04 | −0.06 | −0.04 | 0.01 | ||
| −0.04 | −0.11 | −0.08 | 0.16 | ||
| 0.04 | 0.17 | 0.18 | −0.06 | ||
| 0.02 | −0.09 | 0.09 | 0.07 | ||
| −0.02 | −0.20 | 0.01 | 0.13 | ||
| 0.04 | 0.06 | 0.13 | |||
| 0.05 | 0.08 | 0.05 | −0.02 | ||
| 0.01 | −0.03 | −0.01 | 0.14 | ||
| 0.05 | −0.01 | 0.16 | |||
| 0.04 | 0.15 | ||||
| 0.07 | 0.07 | 0.12 | |||
| 0.06 | 0.21 | 0.04 | |||
| 0.02 | −0.02 | 0.03 | 0.06 | ||
| −0.01 | 0.18 | 0.06 | −0.01 | ||
| −0.11 | 0.11 | 0.11 | 0.22 | ||
| 0.15 | 0.14 | 0.07 | |||
| 0.01 | −0.01 | 0.01 | 0.01 | ||
| 0.02 | 0.01 | −0.01 | 0.01 | ||
| 0.01 | −0.02 | 0.01 | 0.01 | ||
| −0.05 | −0.16 | 0.10 | 0.08 | ||
| −0.01 | 0.01 | 0.04 | |||
| 0.05 | −0.07 | 0.03 | −0.05 | ||
| −0.03 | −0.08 | −0.05 | −0.03 | ||
| 0.10 | −0.01 | −0.15 | −0.02 | ||
| 0.06 | 0.07 | 0.16 | 0.04 | ||
| −0.01 | −0.16 | −0.02 | 0.06 | ||
| 0.01 | −0.10 | −0.03 | 0.07 | ||
| 0.01 | 0.01 | 0.06 | 0.01 | ||
| 0.02 | 0.03 | 0.12 | −0.05 | ||
| 0.08 | 0.03 | 0.01 | 0.07 | ||
| 0.01 | 0.02 | 0.06 | 0.06 | ||
| 0.04 | 0.09 | −0.14 |
In bold, factor loadings higher than or equal to 0.25.
Figure 1Second-order Confirmatory Factor Analysis conducted over the validation sample at ELSA baseline.
***p < 0.001.
Reliability of the Expected-A-Posteriori (EAP) estimates and Deviance Information Criterion (DIC) values associated to four different Bayesian multilevel IRT models.
| Model | EAP reliability | DIC value |
|---|---|---|
| 1) No intercept variance, no slopes | 0.903 | 1,194,740 |
| 2) Itemwise intercept variance, no slopes | 0.920 | 1,112,449 |
| 3) Homogeneous intercept variance, no slopes | 0.906 | 1,112,414 |
| 4) Intercept variance and slope variances–hierarchical item and slope parameters |
In bold, highest EAP reliability value and lowest DIC value.
Mixed-effect multilevel regression to assess the relationship between different factors and health score.
| Variables | Coefficient (95% CI) | |z| | |
|---|---|---|---|
| Fixed part | |||
| Intercept | 57.27 (56.49, 58.05) | 143.91 | <0.001 |
| Gender (Ref. Male) | −1.54 (−1.87, −1.22) | 9.31 | <0.001 |
| Age group (Ref. <65 years) | |||
| −0.63 (−0.83, −0.42) | 5.96 | <0.001 | |
| −4.43 (−4.79, −4.08) | 24.36 | <0.001 | |
| Quintile of household wealth (Ref. | |||
| 2.46 (2.13, 2.79) | 14.51 | <0.001 | |
| 3.62 (3.28, 3.97) | 20.37 | <0.001 | |
| 4.48 (4.11, 4.84) | 24.21 | <0.001 | |
| 5.03 (4.64, 5.41) | 25.49 | <0.001 | |
| Formal education (Ref. No qualification) | 0.84 (0.55, 1.13) | 5.67 | <0.001 |
| Marital status (Ref. Married) | |||
| 0.18 (−0.41, 0.78) | 0.61 | 0.54 | |
| −1.11 (−1.39, −0.83) | 7.77 | <0.001 | |
| Falls (Ref. No) | −1.61 (−1.77, −1.45) | 19.30 | <0.001 |
| Smoke (Ref. Never smoker) | |||
| −1.48 (−1.77, −1.18) | 9.81 | <0.001 | |
| −0.99 (−1.37, −0.61) | 5.09 | <0.001 | |
| Alcohol consumption (Ref. Not drinking) | |||
| 2.68 (2.40, 2.96) | 18.69 | <0.001 | |
| 3.36 (3.03, 3.68) | 20.07 | <0.001 | |
| Physical activity (Ref. Inactive) | |||
| 5.38 (5.04, 5.72) | 30.80 | <0.001 | |
| 8.20 (7.90, 8.51) | 52.77 | <0.001 | |
| Employment (Ref. Not in work) | 2.88 (2.67, 3.09) | 26.65 | <0.001 |
| Size of the social network (Ref. Small) | |||
| 0.25 (0.09, 0.41) | 3.07 | 0.002 | |
| 0.27 (0.07, 0.47) | 2.62 | 0.009 | |
| Standard deviation of random intercept | 8.43 | 0.07 | (8.30, 8.57) |
| Standard deviation of random effect corresponding to Wave | 1.00 | 0.02 | (0.95, 1.04) |
| Standard deviation of the level-1 residuals | 5.28 | 0.02 | (5.23, 5.33) |
Figure 2Scatter plots corresponding to the relationship between age (X-axis) and health status (Y-axis) in each ELSA wave.
Age was collapsed at 90 years for people older than 90.
Figure 3Relationship between different factors and level of health assessed by machine learning method.
Mean Decrease Accuracy (MDA) values.
Multiple linear regression model to assess the relationship between the presence of different chronic conditions and the health score created.
| Variables | Coefficient (95% CI) | |β| | |
|---|---|---|---|
| Intercept | 85.32 (83.86, 86.78) | <0.001 | — |
| Chronic lung disease (Ref. No) | −6.95 (−7.75, −6.15) | <0.001 | 0.13 |
| Asthma (Ref. No) | −3.01 (−3.61, −2.40) | <0.001 | 0.08 |
| Arthritis (Ref. No) | −8.66 (−9.09, −8.24) | <0.001 | 0.31 |
| Osteoporosis (Ref. No) | −7.68 (−8.60, −6.75) | <0.001 | 0.13 |
| Stroke (Ref. No) | −9.38 (−10.48, −8.27) | <0.001 | 0.13 |
| Cancer or a malignant tumour (Ref. No) | −1.70 (−2.51, −0.89) | <0.001 | 0.03 |
| Diabetes (Ref. No) | −4.50 (−5.35, −3.66) | <0.001 | 0.08 |
| Psychiatric problems (Ref. No) | −4.67 (−5.41, −3.94) | <0.001 | 0.10 |
Sensitivity analysis over the ELSA baseline. The analysis was adjusted for gender, age and formal education.