| Literature DB >> 26694962 |
Mathilde G E Verdam1,2, Frans J Oort3,4, Mirjam A G Sprangers3.
Abstract
PURPOSE: The structural equation modeling (SEM) approach for detection of response shift (Oort in Qual Life Res 14:587-598, 2005. doi: 10.1007/s11136-004-0830-y ) is especially suited for continuous data, e.g., questionnaire scales. The present objective is to explain how the SEM approach can be applied to discrete data and to illustrate response shift detection in items measuring health-related quality of life (HRQL) of cancer patients.Entities:
Keywords: Discrete data; Health-related quality of life (HRQL); Item-level analyses; Response shift; SF-36 health survey; Structural equation modeling (SEM)
Mesh:
Year: 2015 PMID: 26694962 PMCID: PMC4870306 DOI: 10.1007/s11136-015-1195-0
Source DB: PubMed Journal: Qual Life Res ISSN: 0962-9343 Impact factor: 4.147
Stage 1 and Stage 2 of the SEM approach for discrete data
| Stage 1 | Measurement model: observed discrete ordinal scores | ||
|---|---|---|---|
| What | How | Requirements | |
| Step 1 | Test the assumption of underlying, bivariate normally distributed continuous scores for each pair of discrete ordinal variablesa | The likelihood ratio (LR) test statistic can be used to test the hypothesis of underlying bivariate normal distributed continuous variables. The LR test is a test of exact fitb, the root-mean-square error of approximation (RMSEA) can be used to evaluate approximate fit, with the criterion that RMSEA values should not be larger than 0.1 [ | Applicable only with three or more response categoriesc |
| Step 2 | Test the assumption of invariance of thresholds across occasions for each discrete ordinal variabled | The difference in LR test statistics can be used to test the difference in exact fit [ | Applicable only with 4 or more response categoriese |
| Step 3 | Investigate recalibration response shift as indicated by non-invariance of thresholds across occasions in the Stage 1 measurement model | To investigate whether the non-invariance of thresholds can be attributed to specific threshold parameters, the tenability of the equality restrictions across measurement occasions can be evaluated further. For example, by testing the invariance of individual thresholds. The LR test statistics can be used to test the difference in exact fit, and the ECVI difference can be used to test the difference in approximate fit | Applicable only with four or more response categoriesf |
| Step 4 | Assess differences in estimated means of the underlying variables (i.e., true change) across measurement occasions | The effect size can be estimated by | Applicable only with two or more response categories |
aThat is, 2n 2 − n tests for 2n 2 − n pairs of 2n variables
bTo guard against inflation of family wise Type I error, a Bonferroni-corrected significance level can be used to take into account multiple comparisons, where α* = α/(2n 2 − n)
cWhen there are only two response categories, there is not enough information to evaluate the LR test statistic for pairs of items. One can instead test the assumption of underlying, trivariate normally distributed continuous scores for each triplet of dichotomous variables
dThat is, n tests for 2n variables
eWhen there are only two or three response categories, there is not enough information to evaluate the difference in LR test statistic
fWhen there are only two, three or four response categories, it is not possible to attribute possible non-invariance to a specific threshold
gWhen there are only two variables, then we need additional restrictions on model parameters (e.g., equality restriction on factor loadings or restricting the residual covariances to zero) to achieve identification
hWhen the variables have only two response categories then we cannot test the invariance of factor loadings (see Supplement 1.4)
iWhen there are only two variables, it is possible to test the invariance of intercepts but, if significant, it is not possible to identify which of the two variables has response shift
j“True” change is represented by change in common factor means, recalibration is represented by change in the intercepts, and reprioritization and reconceptualization are represented by change in the factor loadings
Fig. 1The models of Stage 1 and Stage 2 of the SEM approach for discrete ordinal data. The pentagons at the bottom represent observed discrete ordinal variables x 1–x 5, the circles with – represent the corresponding underlying continuous variables. The same feature in Stage 2 (top of the figure), as the reflective indicator variables (the circles reflect the fact that they are not directly observed). Each y* is associated with a residual factor . The residual factors represent everything that is specific to the corresponding y*. Residual factors of the same variable are correlated across measurement occasion. The circles at the top are the underlying common factors (ξ) at each measurement occasion and represent everything that – have in common (e.g., health-related quality of life). In Stage 1, each observed discrete variable x is modeled to be reflective of a single underlying continuous variable y*. Assuming a bivariate normal distribution for each pair of y* variables, we can estimate the means and variances and covariances on the basis of observed frequencies in the two-dimensional frequency tables of each pair of x variables. In Stage 2, the means and variances and covariances of y* are modeled using a common factor model with common factors ξ. Across occasion differences in estimates of measurement parameters are indicative of response shift. Specifically, in Stage 1 we investigate invariance of thresholds, and in Stage 2 we investigate invariance of intercepts, factor loadings, and residual variances (see also Table 1)
Fig. 2The estimation of thresholds (δ): observed discrete scores x are representations of underlying continuous scores y*. There are 20, 45 and 35 % observed responses in categories 1, 2 and 3, respectively. The first threshold is located where the area under the curve to the left of the threshold is 20 % (δ 1 = −0.842). The second threshold is located where the area under the curve to the left of the threshold is 65 % (δ 1 = 0.385)
Background and clinical variables of the selected study sample (N = 437) and the group of patients that was excluded due to attrition or due to too many missing values (N = 49)
| Variables | Selected study sample | Excluded sample |
|---|---|---|
| Mean (SD) | Mean (SD) | |
| Age | 57.0 (12.1) | 60.0 (12.0) |
| Karnofsky performance* | 78.4 (13.7) | 74.2 (13.0) |
Significant differences between the selected study sample and the excluded sample were evaluated with independent sample t tests for continuous variables and Chi-square test statistics for categorical variables
* Differences between the groups were significant at α = 0.05
Frequency distributions of the items of the SF-36 at baseline and follow-up that were used for statistical analyses (N = 437)
| Item | Time | Response categories | |||||
|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | ||
| Mental health (MH) | |||||||
| 24 Have you been a very nervous person? | Baseline | 14 (3 %) | 30 (7 %) | 55 (13 %) | 182 (42 %) | 91 (21 %) | 64 (15 %) |
| Follow-up | 10 (2 %) | 16 (4 %) | 35 (8 %) | 154 (35 %) | 118 (27 %) | 103 (24 %) | |
| 25 Have you felt so down in the dumps that nothing could cheer you up? | Baseline | 7 (2 %) | 13 (3 %) | 24 (6 %) | 80 (18 %) | 112 (26 %) | 200 (6 %) |
| Follow-up | 2 (0 %) | 7 (2 %) | 16 (4 %) | 76 (17 %) | 136 (31 %) | 199 (46 %) | |
| 26 Have you felt calm and peaceful? | Baseline | 23 (5 %) | 55 (13 %) | 100 (23 % | 69 (16 %) | 141 (32 %) | 48 (11 %) |
| Follow-up | 20 (5 %) | 45 (10 %) | 114 (26 %) | 45 (10 %) | 167 (38 %) | 45 (10 %) | |
| 28 Have you felt downhearted and blue? | Baseline | 8 (2 %) | 17 (4 %) | 33 (8 %) | 145 (33 %) | 119 (27 %) | 114 (26 %) |
| Follow-up | 7 (2 %) | 12 (3 %) | 22 (5 %) | 153 (35 %) | 120 (28 %) | 122 (28 %) | |
| 30 Have you been a happy person? | Baseline | 20 (5 %) | 22 (5 %) | 85 (20 %) | 48 (11 %) | 135 (31 %) | 126 (29 %) |
| Follow-up | 21 (5 %) | 29 (7 %) | 81 (19 %) | 52 (12 %) | 154 (35 %) | 99 (23 %) | |
| General physical health (GH) | |||||||
| 1 In general, would you say your health is…? | Baseline | 50 (12 %) | 153 (35 %) | 162 (37 %) | 40 (9 %) | 31 (7 %) | |
| Follow-up | 32 (7 %) | 179 (41 %) | 174 (40 %) | 40 (9 %) | 11 (3 %) | ||
| 33 I seem to get sick a little easier than other people | Baseline | 24 (6 %) | 29 (7 %) | 118 (27 %) | 60 (14 %) | 205 (47 %) | |
| Follow-up | 20 (4 %) | 41 (9 %) | 131 (30 %) | 59 (14 %) | 185 (42 %) | ||
| 34 I am as health as anybody I know | Baseline | 94 (22 %) | 100 (23 %) | 102 (23 %) | 76 (17 %) | 64 (15 %) | |
| Follow-up | 99 (23 %) | 91 (21 %) | 125 (29 %) | 73 (17 %) | 48 (11 %) | ||
| 35 I expect my health to get worse | Baseline | 46 (11 %) | 56 (13 %) | 172 (39 %) | 58 (13 %) | 104 (24 %) | |
| Follow-up | 35 (8 %) | 47 (11 %) | 197 (45 %) | 56 (13 %) | 101 (23 %) | ||
| 36 My health is excellent | Baseline | 130 (30 %) | 71 (16 %) | 80 (18 %) | 101 (23 %) | 54 (12 %) | |
| Follow-up | 131 (30 %) | 87 (20 %) | 71 (16 %) | 112 (26 %) | 35 (8 %) | ||
| Physical functioning (PF) | |||||||
| 3 Vigorous activities | Baseline | 274 (63 %) | 138 (32 %) | 25 (6 %) | |||
| Follow-up | 289 (66 %) | 120 (27 %) | 28 (6 %) | ||||
| 4 Moderate activities | Baseline | 142 (33 %) | 181 (41 %) | 114 (26 %) | |||
| Follow-up | 135 (31 %) | 185 (42 %) | 117 (27 %) | ||||
| 5 Lifting or carrying groceries | Baseline | 128 (29 %) | 184 (42 %) | 125 (29 %) | |||
| Follow-up | 114 (24 %) | 161 (37 %) | 172 (39 %) | ||||
| 6 Climbing several flights of stairs | Baseline | 85 (19 %) | 149 (34 %) | 203 (46 %) | |||
| Follow-up | 104 (24 %) | 161 (37 %) | 172 (40 %) | ||||
| 7 Climbing one flight of stairs | Baseline | 31 (7 %) | 117 (27 %) | 289 (66 %) | |||
| Follow-up | 30 (7 %) | 128 (29 %) | 279 (64 %) | ||||
| 8 Bending, kneeling, or stooping | Baseline | 57 (13 %) | 151 (35 %) | 229 (52 %) | |||
| Follow-up | 58 (13 %) | 150 (34 %) | 229 (52 %) | ||||
| 9 Walking more than a mile | Baseline | 115 (26 %) | 129 (30 %) | 193 (44 %) | |||
| Follow-up | 126 (29 %) | 127 (29 %) | 184 (42 %) | ||||
| 10 Walking several blocks | Baseline | 54 (12 %) | 95 (22 %) | 288 (66 %) | |||
| Follow-up | 68 (16 %) | 97 (22 %) | 272 (62 %) | ||||
| 11 Walking one block | Baseline | 35 (8 %) | 75 (17 %) | 327 (75 %) | |||
| Follow-up | 41 (9 %) | 73 (17 %) | 323 (74 %) | ||||
| 12 Bathing or dressing yourself | Baseline | 11 (3 %) | 63 (14 %) | 363 (83 %) | |||
| Follow-up | 19 (4 %) | 47 (11 %) | 371 (85 %) | ||||
| Role limitations due to physical health (RP) | |||||||
| 13 Did you cut down on the amount of time you spent on work or other activities? | Baseline | 306 (70 %) | 131 (30 %) | ||||
| Follow-up | 290 (66 %) | 147 (34 %) | |||||
| 14 Did you accomplished less than you would like? | Baseline | 259 (59 %) | 178 (41 %) | ||||
| Follow-up | 254 (58 %) | 183 (42 %) | |||||
| 15 Were you limited in the kind of work or other activities? | Baseline | 293 (67 %) | 144 (33 %) | ||||
| Follow-up | 303 (69 %) | 134 (31 %) | |||||
| 16 Did you have difficulty performing the work or other activities? | Baseline | 273 (62 %) | 164 (38 %) | ||||
| Follow-up | 294 (67 %) | 143 (33 %) | |||||
| Bodily pain (BP) | |||||||
| 21 How much bodily pain have you had? | Baseline | 3 (1 %) | 20 (5 %) | 97 (22 %) | 78 (18 %) | 88 (20 %) | 151 (35 %) |
| Follow-up | 7 (2 %) | 21 (5 %) | 93 (21 %) | 95 (22 %) | 81 (19 %) | 140 (32 %) | |
| 22 How much did pain interfere with your normal work? | Baseline | 17 (4 %) | 27 (6 %) | 89 (20 %) | 120 (28 %) | 184 (42 %) | |
| Follow-up | 13 (3 %) | 23 (5 %) | 49 (11 %) | 125 (29 %) | 227 (52 %) | ||
| Social functioning (SF) | |||||||
| 20 To what extent have your physical health or emotional problems interfered with your normal social activities with family, friends, neighbors, or groups? | Baseline | 9 (2 %) | 25 (6 %) | 43 (10 %) | 131 (30 %) | 229 (52 %) | |
| Follow-up | 13 (3 %) | 23 (5 %) | 49 (11 %) | 125 (29 %) | 227 (52 %) | ||
| 32 How much of the time has your physical health or emotional problems interfered with your social activities? | Baseline | 24 (5 %) | 36 (8 %) | 145 (33 %) | 68 (16 %) | 164 (38 %) | |
| Follow-up | 34 (8 %) | 41 (9 %) | 132 (30 %) | 74 (17 %) | 156 (36 %) | ||
| Role limitations due to emotional problems (RE) | |||||||
| 17 Did you cut down on the amount of time you spent on work or other activities? | Baseline | 195 (45 %) | 242 (55 %) | ||||
| Follow-up | 175 (40 %) | 262 (60 %) | |||||
| 18 Did you accomplished less than you would like? | Baseline | 190 (44 %) | 247 (57 %) | ||||
| Follow-up | 176 (40 %) | 261 (60 %) | |||||
| 19 Did you do work or other activities less carefully than usual? | Baseline | 153 (35 %) | 284 (65 %) | ||||
| Follow-up | 147 (34 %) | 290 (66 %) | |||||
| Vitality (VT) | |||||||
| 23 Did you feel full of pep? | Baseline | 16 (4 %) | 32 (7 %) | 105 (24 %) | 58 (13 %) | 145 (33 %) | 81 (19 %) |
| Follow-up | 21 (5 %) | 42 (10 %) | 104 (24 %) | 60 (14 %) | 155 (35 %) | 55 (13) | |
| 27 Did you have a lot of energy? | Baseline | 26 (6 %) | 73 (17 %) | 133 (30 %) | 56 (13 %) | 94 (22 %) | 55 (13 %) |
| Follow-up | 35 (8 %) | 96 (22 %) | 134 (31 %) | 53 (12 %) | 83 (19 %) | 36 (8 %) | |
| 29 Did you feel worn out? | Baseline | 13 (3 %) | 19 (4 %) | 48 (11 %) | 135 (90 %) | 90 (2 %) | 132 (30 %) |
| Follow-up | 11 (3 %) | 28 (6 %) | 56 (13 %) | 147 (34 %) | 100 (23 %) | 95 (22 %) | |
| 31 Did you feel tired? | Baseline | 29 (7 %) | 52 (12 %) | 77 (18 %) | 166 (38 %) | 61 (14 %) | 52 (12 %) |
| Follow-up | 37 (8 %) | 53 (12 %) | 106 (24 %) | 155 (35 %) | 56 (13 %) | 20 (7 %) | |
| Health comparison (HC) | |||||||
| 2 Compared to 1 year ago, how would you rate your health in general now? | Baseline | 32 (7 %) | 83 (19 %) | 272 (62 %) | 43 (10 %) | 7 (2 %) | |
| Follow-up | 34 (8 %) | 69 (16 %) | 243 (56 %) | 78 (18 %) | 13 (3 %) | ||
Hypothesis tests and parameter estimates of Steps 1–3 from Stage 1
| Step 1 | Step 2 | Step 3 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BVN |
| Chisqdiff |
| Thresholds | Meansa | SDsa |
| |||||||
| 1 | 2 | 3 | 4 | 5 | Pre | Post | Pre | Post | ||||||
| MH | ||||||||||||||
| 24 | ✓ | 3 | 4.14 | 0.25 | −1.96 | −1.41 | −0.90 | 0.19 | 0.85 | 3.23 | 3.83 | 1.74 | 1.85 | 0.59 |
| 25 | ✓ | 3 | 0.59 | 0.90 | −2.34 | −1.84 | −1.14 | −0.63 | 0.10 | 4.70 | 4.75 | 2.19 | 1.84 | 0.61 |
| 26b | ✓ | 3 | 15.6 | <0.01 | ||||||||||
| 26pre | −1.62 | −0.92 | −0.23 | 0.16 | 1.20 | |||||||||
| 26post | −1.69 | −1.03 | −0.23 | 0.03 | 1.24 | |||||||||
| 28 | ✓ | 3 | 5.52 | 0.14 | −2.16 | −1.64 | −1.16 | −0.13 | 0.60 | 4.09 | 4.24 | 1.96 | 1.90 | 0.53 |
| 30 | ✓ | 3 | 5.41 | 0.14 | −1.68 | −1.28 | −0.51 | −0.21 | 0.62 | 4.40 | 4.12 | 2.61 | 2.47 | 0.64 |
| GH | ||||||||||||||
| 1 | ✓ | 2 | 3.61 | 0.16 | −1.31 | −0.07 | 1.10 | 1.65 | 1.08 | 1.04 | 0.90 | 0.71 | 0.62 | |
| 33 | ✓ | 2 | 3.63 | 0.16 | −1.62 | −1.17 | −0.23 | 0.14 | 3.72 | 3.40 | 2.32 | 2.05 | 0.55 | |
| 34 | ✓ | 2 | 4.88 | 0.09 | −0.77 | −0.10 | 0.52 | 1.13 | 1.19 | 1.11 | 1.56 | 1.41 | 0.49 | |
| 35 | ✓ | 2 | 2.25 | 0.32 | −1.34 | −0.79 | 0.31 | 0.72 | 2.39 | 2.46 | 1.91 | 1.72 | 0.56 | |
| 36 | ✓ | 2 | 4.91 | 0.09 | −0.53 | −0.07 | 0.44 | 1.26 | 1.22 | 1.07 | 2.29 | 2.02 | 0.62 | |
| PF | ||||||||||||||
| 3 | ✓ | n/a | 0.37 | 1.55 | −0.26 | −0.38 | 0.80 | 0.91 | 0.60 | |||||
| 4 | ✓ | n/a | −0.48 | 0.63 | 0.42 | 0.45 | 0.91 | 0.89 | 0.65 | |||||
| 5 | ✓ | n/a | −0.59 | 0.59 | 0.49 | 0.51 | 0.90 | 0.79 | 0.72 | |||||
| 6 | ✓ | n/a | −0.79 | 0.18 | 0.91 | 0.73 | 1.05 | 1.02 | 0.74 | |||||
| 7 | ✓ | n/a | −1.48 | −0.39 | 1.40 | 1.31 | 0.95 | 0.88 | 0.71 | |||||
| 8 | ✓ | n/a | −1.12 | −0.06 | 1.06 | 1.06 | 0.94 | 0.95 | 0.73 | |||||
| 9 | ✓ | n/a | −0.60 | 0.17 | 0.81 | 0.74 | 1.28 | 1.32 | 0.74 | |||||
| 10 | ✓ | n/a | −1.08 | −0.36 | 1.55 | 1.45 | 1.34 | 1.43 | 0.70 | |||||
| 11 | ✓ | n/a | −1.36 | −0.65 | 1.91 | 1.95 | 1.36 | 1.48 | 0.67 | |||||
| 12 | ✓ | n/a | −1.78 | −0.98 | 1.96 | 2.52 | 1.00 | 1.47 | 0.66 | |||||
| RP | ||||||||||||||
| 13 | n/a | n/a | 0.47 | −0.53 | −0.42 | 1.00 | 1.00 | 0.52 | ||||||
| 14 | n/a | n/a | 0.22 | −0.23 | −0.21 | 1.00 | 1.00 | 0.51 | ||||||
| 15 | n/a | n/a | 0.47 | −0.44 | −0.51 | 1.00 | 1.00 | 0.55 | ||||||
| 16 | n/a | n/a | 0.38 | −0.32 | −0.45 | 1.00 | 1.00 | 0.49 | ||||||
| BP | ||||||||||||||
| 21 | ✓ | 3 | 9.77 | 0.02 | −2.34 | −1.53 | −0.55 | −0.84 | 0.41 | 2.92 | 2.85 | 1.18 | 1.28 | 0.55 |
| 22 | ✓ | 2 | 0.58 | 0.75 | −1.74 | −1.23 | −0.56 | 0.11 | 3.63 | 2.85 | 2.06 | 1.28 | 0.51 | |
| SF | ||||||||||||||
| 20 | ✓ | 2 | 1.48 | 0.48 | −1.98 | −1.38 | −0.90 | −0.06 | 3.28 | 3.28 | 1.61 | 1.71 | 0.42 | |
| 32 | ✓ | 2 | 3.09 | 0.21 | −1.51 | −1.02 | −0.05 | 0.33 | 3.16 | 3.06 | 1.98 | 2.15 | 0.48 | |
| RE | ||||||||||||||
| 17 | n/a | n/a | −0.19 | 0.14 | 0.25 | 1.00 | 1.00 | 0.52 | ||||||
| 18 | n/a | n/a | −0.21 | 0.16 | 0.25 | 1.00 | 1.00 | 0.60 | ||||||
| 19 | n/a | n/a | −0.40 | 0.39 | 0.42 | 1.00 | 1.00 | 0.47 | ||||||
| VT | ||||||||||||||
| 23 | ✓ | 3 | 6.67 | 0.08 | −1.74 | −1.17 | −0.31 | 0.04 | 0.99 | 3.18 | 2.90 | 1.77 | 1.72 | 0.56 |
| 27 | ✓ | 3 | 1.05 | 0.79 | −1.48 | −0.66 | 0.18 | 0.52 | 1.26 | 1.93 | 1.68 | 1.24 | 1.21 | 0.58 |
| 29 | ✓ | 3 | 3.46 | 0.33 | −1.89 | −1.43 | −0.86 | 0.07 | 0.64 | 4.36 | 3.95 | 2.31 | 2.08 | 0.45 |
| 31 | ✓ | 3 | 5.86 | 0.12 | −1.46 | −0.83 | −0.27 | 0.77 | 1.32 | 2.47 | 2.16 | 1.64 | 1.64 | 0.52 |
| HC | ||||||||||||||
| 2 | ✓ | 2 | 6.96 | 0.03 | −0.68 | 1.07 | 1.96 | 1.77 | 1.97 | 1.22 | 1.35 | 0.03 | ||
BVN bivariate normality; the underlying assumption of bivariate normality was evaluated for each item, and considered to be tenable (✓) if the assumption holds for all item pairs according to the RMSEA (see Table 1)
Thresholds were estimated to be equal across measurement occasions using the standard parameterization, where the means and variances of the underlying variables at two consecutive measurement occasions are then defined by: μ 1 + μ 2 = 0 and
n/a not applicable, see also Table 1. MH mental health, GH general physical health, PF physical functioning, RP role limitations due to physical health, BP bodily pain, SF social functioning, RE role limitations due to emotional health, VT vitality, and HC health comparison
aThe alternative parameterization was used to estimate the means and standard deviations of the underlying variables under equal thresholds that were used for subsequent analyses. This entails that identification of the model is achieved by fixing the first two threshold values at zero and one, instead of restricting the sum of the means and variances of the underlying variables. This parameterization is equivalent to the standard parameterization; the linear transformation of the estimates is described in detail by Jöreskog [21]
bThe means and standard deviations of the underlying variables of Item 26 are not given as they cannot be readily compared across measurements due to recalibration response shift
Goodness of overall model fit and difference in model fit of the models in Stage 2
| Model |
|
| RMSEA [90 % CI] | ECVI [90 % CI] | Compared to |
| CHISQdiff | ECVIdiff [90 % CI] |
|---|---|---|---|---|---|---|---|---|
| Mental health (MH) | ||||||||
| 1a Measurement model | 25 | 61.559 | 0.058 [0.040; 0.076] | 0.279 [0.235; 0.341] | ||||
| 1b No response shift model | 31 | 158.28 | 0.097 [0.082; 0.112] | 0.386 [0.304; 0.485] | Model 1a | 6 | 96.72 | 0.194 [0.123; 0.276] |
| 1c Response shift model | 28 | 62.979 | 0.054 [0.036; 0.071] | 0.268 [0.224; 0.330] | Model 1a | 3 | 1.320 | −0.011 [−0.007; 0.003] |
| General physical health (GH) | ||||||||
| 2a Measurement model | 29 | 61.286 | 0.047 [0.031; 0.063] | 0.162 [0.115; 0.227] | ||||
| 2b No response shift model | 37 | 72.601 | 0.051 [0.033; 0.068] | 0.173 [0.130; 0.233] | Model 2a | 8 | 11.32 | −0.011 [−0.018; 0.019] |
| Physical functioning (PF)a | ||||||||
| 3a Measurement Model | 151 | 339.06 | 0.053 [0.046; 0.061] | 1.048 [0.935; 1.180] | ||||
| 3b No response shift model | 169 | 477.64 | 0.065 [0.058; 0.072] | 1.284 [1.143; 1.442] | Model 3a | 18 | 380.7 | 0.791 [0.654; 0945] |
| 3c Response shift model | 166 | 374.98 | 0.054 [0.047; 0.061] | 1.062 [0.942; 1.200] | Model 3a | 15 | 46.75 | 0.038 [−0.001; 0.095] |
| Role limitations due to physical health (RP) | ||||||||
| 4a Measurement model | 15 | 29.727 | 0.048 [0.021; 0.072] | 0.165 [0.138; 0.210] | ||||
| 4b No response shift model | 18 | 72.543 | 0.083 [0.064; 0.104] | 0.249 [0.120; 0.318] | ||||
| 4c Response shift model | 17 | 51.313 | 0.068 [0.047; 0.090] | 0.205 [0.164; 0.263] | ||||
| Bodily pain (BP) | ||||||||
| 5a Measurement model | 1 | 1.798 | 0.043 [0; 0.143] | 0.045 [0.044; 0.064] | ||||
| 5b No response shift model | 3 | 39.766 | 0.168 [0.124; 0.216] | 0.123 [0.085; 0.179] | Model 5a | 2 | 37.968 | 0.078 [0.040; 0.133] |
| 5c Response Shift Model | 2 | 5.941 | 0.067 [0; 0.133] | 0.073 [0.038; 0.125] | Model 5a | 1 | 4.143 | 0.005 [−0.002; 0.029] |
| Social functioning (SF) | ||||||||
| 6a Measurement Model | 1 | 0.143 | 0 [0; 0.092] | 0.042 [0.044; 0.052] | ||||
| 6b No response shift model | 2 | 1.303 | 0 [0; 0.084] | 0.040 [0.041; 0.055] | Model 6a | 1 | 1.16 | −0.002 [−0.002; 0.015] |
| Role limitations due to emotional problems (RE) | ||||||||
| 7a Measurement model | 5 | 13.022 | 0.061 [0.021; 0.102] | 0.103 [0.087; 0.137] | ||||
| 7b No response shift model | 7 | 17.834 | 0.060 [0.026; 0.095] | 0.105 [0.085; 0.143] | ||||
| Vitality (VT) | ||||||||
| 8a Measurement model | 11 | 4.7300 | 0 [0; 0.009] | 0.126 [0.140; 0.141] | ||||
| 8b No response shift model | 17 | 12.326 | 0 [0; 0.030] | 0.116 [0.126; 0.141] | Model 8a | 6 | 7.596 | −0.010 [−0.014; 0.016] |
N = 437; overall model fit and difference in fit was evaluated using WLS Chi-square values that are provided in the standard LISREL output (denoted C2_NNT)
aFor the subscale PF the WLS Chi-square values did not appear stable, and overall model fit was therefore evaluated using the Satorra–Bentler mean adjusted Chi-square values (denoted C3 in the standard LISREL output), and difference of model fit was evaluated using the difference in uncorrected (DWLS) Chi-square values (denoted C1 in the standard LISREL output)
Assessment of change in the items of the SF-36: results from Step 4 of Stage 1 and Stage 2, expressed as effect sizes (standardized differences)
| Item | Stage 1 | Stage 2 | |||
|---|---|---|---|---|---|
| Observed change in variables | True change in underlying variables | Modeled change in variables | Response shift change | True change | |
| Mental health (MH) | |||||
| 24 | 0.33** | 0.37** | 0.36** | 0.30c**/0.01d | 0.04 |
| 25 | 0.12* | 0.03 | 0.06 | 0.06 | |
| 26b | 0.06 | ||||
| 28 | 0.08 | 0.08 | 0.05 | 0.05 | |
| 30 | −0.08 | −0.13* | −0.13* | −0.16c** | 0.03 |
| General physical health (GH) | |||||
| 1 | −0.08 | −0.05 | −0.08 | −0.08 | |
| 33 | −0.08 | −0.15* | −0.04 | −0.04 | |
| 34 | −0.06 | −0.06 | −0.07 | −0.07 | |
| 35 | 0.05 | 0.04 | −0.05 | −0.05 | |
| 36 | −0.08 | −0.08 | −0.11* | −0.11* | |
| Physical functioning (PF) | |||||
| 3 | −0.04 | −0.15* | −0.04 | −0.00d | −0.04 |
| 4 | 0.03 | 0.04 | −0.04 | −0.04 | |
| 5 | 0.02 | 0.02 | −0.04 | −0.04 | |
| 6 | −0.17** | −0.24** | −0.05 | −0.05 | |
| 7 | −0.04 | −0.12* | −0.05 | −0.05 | |
| 8 | 0.00 | 0.00 | −0.05 | −0.05 | |
| 9 | −0.06 | −0.08 | −0.05 | −0.05 | |
| 10 | −0.10* | −0.10* | −0.06 | −0.06 | |
| 11 | −0.04 | 0.03 | −0.05 | −0.05 | |
| 12 | 0.00 | 0.51** | 0.46** | 0.51c**/−0.02d | −0.03 |
| Role limitations due to physical health (RP) | |||||
| 13 | 0.07 | 0.11* | 0.02 | 0.08c | −0.06 |
| 14 | 0.02 | 0.03 | −0.06 | −0.06 | |
| 15 | −0.04 | −0.07 | −0.07 | −0.07 | |
| 16 | −0.09 | −0.13* | −0.06 | −0.06 | |
| Bodily pain (BP) | |||||
| 21 | −0.07 | −0.06 | −0.06 | −0.23** | 0.17** |
| 22 | 0.08 | 0.16** | 0.16** | 0.16** | |
| Social functioning (SF) | |||||
| 20 | −0.03 | 0.00 | −0.04 | −0.04 | |
| 32 | −0.06 | −0.05 | −0.03 | −0.03 | |
| Role limitations due to emotional problems (RE) | |||||
| 17 | 0.08 | 0.12* | 0.09 | 0.09 | |
| 18 | 0.06 | 0.09 | 0.10* | 0.10* | |
| 19 | 0.02 | 0.04 | 0.08 | 0.08 | |
| Vitality (VT) | |||||
| 23 | −0.13* | −0.17** | −0.19** | −0.19** | |
| 27 | −0.20** | −0.22** | −0.27** | −0.27** | |
| 29 | −0.14* | −0.18** | −0.16** | −0.16** | |
| 31 | −0.18** | −0.20** | −0.20** | −0.20** | |
| Health comparison (HC) | |||||
| 2 | 0.11* | 0.11* | |||
N = 437; standardized mean differences of 0.2, 0.5, and 0.8 indicate small, medium, and large differences [12]
* p < .05; ** p < .01
aObserved change was calculated by considering the ordinal discrete response scale as a proxy for an interval response scale, and comparing baseline and follow-up measurements using paired t tests
bResults of Stage 2 for Item 26 cannot be interpreted because recalibration response shift was detected for this item in Stage 1
cRecalibration
dReprioritization