| Literature DB >> 25840902 |
Marcelo Coca Perraillon1, Ya-Chen Tina Shih2, Ronald A Thisted3.
Abstract
BACKGROUND: . When data on preferences are not available, analysts rely on condition-specific or generic measures of health status like the SF-12 for predicting or mapping preferences. Such prediction is challenging because of the characteristics of preference data, which are bounded, have multiple modes, and have a large proportion of observations clustered at values of 1.Entities:
Keywords: EQ-5D; SF-12; Tobit; health-related quality of life; mapping; mixture models; prediction
Mesh:
Year: 2015 PMID: 25840902 PMCID: PMC4574086 DOI: 10.1177/0272989X15577362
Source DB: PubMed Journal: Med Decis Making ISSN: 0272-989X Impact factor: 2.583
Figure 1Distribution of EQ-5D-3L by age group and medical condition. Data source: MEPS, 2000. All sample (A); by age group (B–E), and for selected self-reported conditions (F–I). “Any condition” refers to those who have heart disease, stroke, and/or diabetes. Some individuals have more than 1 condition.
Baseline Characteristics by Sample
| Estimation Sample ( | Validation Sample ( | |||||||
|---|---|---|---|---|---|---|---|---|
| % |
| % |
| |||||
| EQ-5D-3L | PCS | MCS | EQ-5D-3L | PCS | MCS | |||
| Age, years | ||||||||
| 18–40 | 45.53 | 0.88 (0.18) | 52.54 (7.10) | 51.31 (8.96) | 44.49 | 0.86 (0.20) | 52.29 (7.49) | 50.95 (9.30) |
| 41–65 | 39.87 | 0.79 (0.25) | 48.03 (10.59) | 51.12 (9.57) | 40.48 | 0.79 (0.25) | 48.29 (10.37) | 50.92 (9.56) |
| 66–80 | 11.33 | 0.71 (0.27) | 41.40 (12.10) | 51.83 (10.16) | 12.02 | 0.71 (0.26) | 41.42 (11.86) | 51.40 (9.95) |
| >80 | 3.26 | 0.60 (0.33) | 35.60 (11.60) | 50.70 (11.11) | 2.65 | 0.61 (0.29) | 34.73 (11.60) | 49.88 (11.46) |
| | 44.79 (17.4) | 44.91 (17.20) | ||||||
| Sex | ||||||||
| Male | 46.21 | 0.83 (0.23) | 49.86 (9.74) | 52.34 (8.68) | 46.22 | 0.84 (0.22) | 49.69 (9.85) | 52.09 (8.81) |
| Female | 53.79 | 0.80 (0.25) | 48.12 (10.80) | 50.36 (9.92) | 53.78 | 0.79 (0.25) | 48.18 (10.67) | 49.99 (1.03) |
| Race | ||||||||
| White | 83.08 | 0.81 (0.24) | 48.96 (10.33) | 51.31 (9.40) | 83.68 | 0.84 (0.24) | 48.90 (10.34) | 50.98 (9.52) |
| Black | 13.65 | 0.80 (0.26) | 48.33 (10.60) | 51.09 (9.55) | 12.89 | 0.80 (0.26) | 48.48 (10.35) | 50.98 (9.76) |
| Other | 3.27 | 0.84 (0.23) | 50.49 (9.59) | 51.20 (9.38) | 3.43 | 0.83 (0.24) | 49.81 (9.81) | 50.41 (9.45) |
| Education | ||||||||
| Less than HS | 56.78 | 0.78 (0.26) | 47.43 (11.00) | 50.68 (9.95) | 57.03 | 0.78 (0.26) | 47.44 (11.03) | 50.25 (10.06) |
| HS or more | 43.22 | 0.86 (0.20) | 50.89 (9.08) | 52.06 (8.61) | 42.97 | 0.85 (0.19) | 50.79 (8.95) | 51.90 (8.72) |
| Asthma | 8.46 | 0.73 (0.28) | 45.37 (12.07) | 49.20 (10.84) | 8.76 | 0.72 (0.30) | 45.26 (12.22) | 48.42 (11.19) |
| Current smoker | 21.94 | 0.78 (0.26) | 48.39 (10.52) | 49.54 (10.28) | 22.67 | 0.78 (0.26) | 48.29 (10.70) | 49.09 (10.61) |
| Diabetes | 6.63 | 0.65 (0.33) | 40.20 (12.57) | 48.31 (11.14) | 6.74 | 0.67 (0.31) | 40.54 (12.09) | 48.84 (10.75) |
| Emphysema | 1.38 | 0.53 (0.32) | 31.69 (11.01) | 45.74 (12.30) | 1.43 | 0.61 (0.30) | 33.83 (11.53) | 47.27 (11.99) |
| Heart disease | 9.44 | 0.63 (0.33) | 38.75 (13.11) | 48.48 (11.12) | 10.05 | 0.66 (0.31) | 39.64 (12.48) | 48.91 (11.34) |
| Hypertension | 22.78 | 0.70 (0.29) | 42.55 (12.30) | 49.90 (10.62) | 23.58 | 0.72 (0.28) | 43.21 (11.79) | 49.79 (1.54) |
| Joint pain | 30.9 | 0.68 (0.29) | 43.93 (12.33) | 49.41 (10.77) | 32.4 | 0.69 (0.28) | 43.36 (12.05) | 49.17 (10.65) |
| Stroke | 2.02 | 0.53 (0.36) | 35.18 (12.09) | 46.35 (11.93) | 2.43 | 0.57 (0.34) | 35.17 (11.52) | 47.85 (12.38) |
Note: Data source: MEPS 2000. HS = high school; MCS = SF-12 mental component; PCS = SF-12 physical component.
Model Parameters by Method
| Variable | OLS | Two-Part Model | Mixture 1 | Mixture 2 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| First Part | Second Part | Class 1 | Pr1 | Class 2 | Pr2 | Class 1 | Pr1 | Class 2 | Pr2 | ||
| 0.6328 | 0.3671 | ||||||||||
| (0.0070) | (0.0071) | ||||||||||
| Age | −0.0023 | 0.2368 | −0.0056 | −0.192 | −0.0058 | 0.0264* | 0.0212* | ||||
| (0.0021) | (0.0437) | (0.0025) | (0.0072) | (0.0011) | (0.0021) | (0.0040) | |||||
| Age2 | 0.002 | −0.0241 | 0.0001 | 0.0052 | 0.0002 | ||||||
| (0.0005) | (0.0107) | (0.0007) | (0.0021) | (0.0003) | |||||||
| Male | −0.005 | −0.0295 | −0.0094 | −0.008 | −0.0013 | ||||||
| (0.0036) | (0.0633) | (0.0052) | (0.0153) | (0.0020) | |||||||
| HS or more | 0.016 | −0.2655 | 0.0172 | 0.0522 | 0.0078 | ||||||
| (0.0037) | (0.0642) | (0.0054) | (0.0159) | (0.0020) | |||||||
| PCS | 0.0115 | 0.0046 | −0.2152 | 0.0096 | −0.3208 | ||||||
| (0.0003) | (0.0001) | (0.0073) | (0.0006) | (0.0091) | |||||||
| PCS2 | −0.0001 | ||||||||||
| (0.0001) | |||||||||||
| MCS | 0.0085 | 0.002 | −0.1414 | 0.0046 | −0.2491 | ||||||
| (0.0002) | (0.0001) | (0.0053) | (0.0006) | (0.0074) | |||||||
| PCS2 | −0.0001 | ||||||||||
| (0.0001) | |||||||||||
| PCS + MCS | −0.1661 | 0.0044 | 0.0359 | 0.0028 | |||||||
| (0.0044) | (0.0003) | (0.0011) | (0.0001) | ||||||||
| (PCS + MCS)2 | −0.0018 | −0.0002 | 0.0003 | −0.0001 | |||||||
| (0.0003) | (0.0001) | (0.0001) | (0.00001) | ||||||||
| Intercept | 0.8172 | 1.9158 | 0.7476 | 0.9908 | 0.7521 | 0.7692 | 1.5638 | 0.3351 | −1.955 | ||
| (0.0043) | (0.0807) | (0.0052) | (0.0176) | (0.0022) | (0.0010) | (0.0678) | (0.0160) | (0.1439) | |||
Note: Data source: MEPS 2000. HS = high school; PCS = SF-12 physical component centered at 50; MCS = SF-12 mental component centered at 50; OLS = ordinary least squares; Pr1 = probability of class 1; Pr2 = probability of class 2. (PCS+MCS) centered at 100. In two-part models, the first part is a logistic regression for Pr(EQ-5D-3L<1) and the second part is OLS. Numbers in parentheses are standard errors.
P < 0.05. **P < 0.01. ***P < 0.001.
Model Comparisons
| Mixture 1 | Mixture 2 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Linear (OLS) | Two-Part | Estimation | Validation | Estimation | Validation | |||||
| Estimation | Validation | Estimation | Validation | WA | WA | WA | CEC | WA | CEC | |
| RMSE | 0.148 | 0.149 | 0.148 | 0.149 | 0.155 | 0.157 | 0.147 | 0.169 | 0.146 | 0.166 |
| MAE | 0.107 | 0.108 | 0.105 | 0.106 | 0.121 | 0.122 | 0.105 | 0.095 | 0.104 | 0.093 |
| Predicted | ||||||||||
| | 0.814 | 0.810 | 0.814 | 0.810 | 0.811 | 0.808 | 0.814 | 0.837 | 0.811 | 0.833 |
| | 0.180 | 0.188 | 0.181 | 0.188 | 0.181 | 0.182 | 0.190 | 0.218 | 0.189 | 0.216 |
| Minimum | 0.045 | 0.001 | −0.152 | −0.208 | 0.177 | 0.175 | 0.013 | −0.036 | −0.007 | −0.051 |
| Maximum | 1.035 | 1.033 | 0.994 | 0.994 | 0.939 | 0.938 | 0.989 | 1.000 | 0.989 | 1.000 |
Note: CEC = conditional on estimated class; MAE = mean absolute error; OLS = ordinary least squares; RMSE = root mean squared error; WA = weighted average prediction. Mixture 1 does not include covariates in the probability model. Mixture 2 model includes SF-12 mental and physical components and age as predictors of mixture probabilities. Mean, standard deviation, maximum, and minimum for estimation and validation samples are 0.814, 0.240, –0.594, 1 and 0.813, 0.239, –0.594, 1, respectively.
Prediction Performance in Validation Sample by Level of the EQ-5D-3L Index
| Mixture 2 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Linear (OLS) | Two-Part | Mixture 1 | WA | CEC | |||||||
| EQ-5D-3L level |
| MAE | RMSE | MAE | RMSE | MAE | RMSE | MAE | RMSE | MAE | RMSE |
| All sample | |||||||||||
| <0 | 134 | 0.436 | 0.466 | 0.386 | 0.434 | 0.468 | 0.492 | 0.390 | 0.430 | 0.301 | 0.400 |
| 0–0.699 | 1241 | 0.179 | 0.226 | 0.177 | 0.235 | 0.194 | 0.239 | 0.174 | 0.232 | 0.191 | 0.286 |
| 0.7–0.899 | 2673 | 0.099 | 0.118 | 0.093 | 0.110 | 0.107 | 0.122 | 0.093 | 0.110 | 0.096 | 0.126 |
| 0.9–1 | 3073 | 0.072 | 0.095 | 0.076 | 0.100 | 0.090 | 0.100 | 0.072 | 0.095 | 0.040 | 0.100 |
| All | 7121 | 0.108 | 0.149 | 0.106 | 0.149 | 0.122 | 0.157 | 0.104 | 0.146 | 0.093 | 0.166 |
| Diabetes, heart disease, or stroke | |||||||||||
| <0 | 62 | 0.366 | 0.400 | 0.333 | 0.378 | 0.241 | 0.351 | ||||
| 0–0.699 | 437 | 0.179 | 0.224 | 0.173 | 0.226 | 0.189 | 0.288 | ||||
| 0.7–0.899 | 411 | 0.106 | 0.130 | 0.092 | 0.114 | 0.073 | 0.100 | ||||
| 0.9–1 | 240 | 0.103 | 0.130 | 0.104 | 0.130 | 0.078 | 0.134 | ||||
| All | 1110 | 0.147 | 0.192 | 0.138 | 0.188 | 0.127 | 0.214 | ||||
Note: CEC = conditional on estimated class; MAE = mean absolute error; OLS = ordinary least squares; RMSE = root mean squared error; WA = weighted average prediction.. Mixture 1 does not include covariates in the probability model. Mixture 2 model includes SF-12 mental and physical components and age as predictors of mixture probabilities.
Classification Based on Posterior and Predicted Probabilities for Mixture 2 Model
| Class Based on Posterior Probabilities | Class Based on Estimated Probabilities | ||
|---|---|---|---|
| 0 | 1 | 2 | |
| 0 | 2538 | 534 | 1 |
| (82.59) | (17.38) | (0.03) | |
| 1 | 880 | 2566 | 108 |
| (24.76) | (72.20) | (3.04) | |
| 2 | 6 | 269 | 219 |
| (1.21) | (54.45) | (44.33) | |
Note: Numbers in parentheses are row percentages. Classifications based on posterior probabilities use the observed EQ-5D-3L, while classifications based on estimated probabilities assume that the EQ-5D-3L is not observed. Both classifications assign each individual to the class with maximum probability.
Model Comparisons for Those with Diabetes, Heart Disease, or Stroke
| Mixture 2 | ||||||
|---|---|---|---|---|---|---|
| Linear (OLS) | Estimation | Validation | ||||
| Estimation | Validation | WA | CEC | WA | CEC | |
| RMSE | 0.197 | 0.193 | 0.194 | 0.225 | 0.188 | 0.214 |
| MAE | 0.150 | 0.147 | 0.144 | 0.137 | 0.138 | 0.127 |
| Predicted | ||||||
| | 0.654 | 0.658 | 0.654 | 0.670 | 0.658 | 0.677 |
| | 0.252 | 0.245 | 0.252 | 0.300 | 0.246 | 0.288 |
| Minimum | −0.003 | −0.003 | −0.035 | −0.086 | −0.035 | −0.086 |
| Maximum | 1.031 | 1.043 | 0.985 | 1.000 | 0.989 | 1.000 |
Note: CEC = conditional on estimated class; MAE = mean absolute error; OLS = ordinary least squares; RMSE = root mean squared error; WA = weighted average prediction. Mixture 1 does not include covariates in the probability model. Mixture 2 model includes SF-12 mental and physical components and age as predictors of mixture probabilities. Mean, standard deviation, maximum, and minimum for estimation and validation samples are 0.654, 0.320, –0.594, 1 and 0.673, 0.305, –0.594, 1, respectively.
Figure 2Histogram of predicted EQ-5D-3L scores by model. CEC = conditional on estimated class; WA = weighted average prediction. Mixture 1 does not include covariates in the probability model. Mixture 2 model includes SF-12 mental and physical components and age as predictors of mixture probabilities. Any condition includes respondents who reported having diabetes, heart disease, or stroke.