| Literature DB >> 19419553 |
Stavros Petrou1, Jane Morrell, Helen Spiby.
Abstract
BACKGROUND: Multi-attribute utility measures are preference-based health-related quality of life measures that have been developed to inform economic evaluations of health care interventions. The objective of this study was to compare the empirical validity of two multi-attribute utility measures (EQ-5D and SF-6D) based on hypothetical preferences in a large maternity population in England.Entities:
Mesh:
Year: 2009 PMID: 19419553 PMCID: PMC2687423 DOI: 10.1186/1477-7525-7-40
Source DB: PubMed Journal: Health Qual Life Outcomes ISSN: 1477-7525 Impact factor: 3.186
Descriptive statistics of EQ-5D and SF-6D utility scores (n = 493)
| EQ-5D utility score | SF-6D utility score | |
| Mean | 0.861 | 0.809 |
| Standard deviation | (0.181) | (0.140) |
| Median | 0.848 | 0.830 |
| Inter-quartile range | (0.796, 1.000) | (0.706, 0.938) |
| Minimum | -0.077 | 0.374 |
| Maximum | 1.000 | 1.000 |
| 95% CI | (0.844, 0.877) | (0.796, 0.822) |
| 99% CI | (0.838, 0.883) | (0.792, 0.826) |
| Mean difference (95% CI) | 0.052 (0.040, 0.064)* | |
CI denotes confidence interval.
* p-value < 0.001
Relationship between mean EQ-5D utility scores and self-reported health status (n = 493)
| Group | Overall utility score | Self-reported health status | |||||
| Excellent | Very good | Good | Fair | Poor | |||
| 0.861 | 0.964 | 0.908 | 0.819 | 0.651 | 0.366 | <0.001 | |
| 17–24 | 0.848 | 0.949 | 0.887 | 0.815 | 0.583 | -0.077 | <0.001 |
| 25–34 | 0.866 | 0.973 | 0.912 | 0.813 | 0.672 | 0.678 | <0.001 |
| 35–44 | 0.855 | 0.954 | 0.923 | 0.861 | 0.659 | 0.121 | <0.001 |
| White | 0.864 | 0.968 | 0.911 | 0.821 | 0.659 | 0.366 | <0.001 |
| Non-white | 0.793 | 0.840 | 0.835 | 0.804 | 0.508 | - | 0.125 |
| Yes | 0.876 | 0.963 | 0.915 | 0.825 | 0.714 | 0.455 | <0.001 |
| No | 0.798 | 0.977 | 0.875 | 0.804 | 0.475 | -0.077 | <0.001 |
| Owner occupier | 0.880 | 0.964 | 0.913 | 0.821 | 0.723 | 0.490 | <0.001 |
| Rented | 0.807 | 0.965 | 0.888 | 0.817 | 0.565 | 0.243 | <0.001 |
| Yes | 0.894 | 0.971 | 0.912 | 0.852 | 0.712 | 0.678 | <0.001 |
| No | 0.800 | 0.941 | 0.894 | 0.775 | 0.626 | 0.055 | <0.001 |
| Singleton | 0.859 | 0.964 | 0.907 | 0.818 | 0.647 | 0.366 | <0.001 |
| Twin | 0.970 | 1.000 | 1.000 | 1.000 | 0.848 | - | - |
| Yes | 0.867 | 0.965 | 0.907 | 0.824 | 0.651 | - | <0.001 |
| No | 0.847 | 0.961 | 0.910 | 0.809 | 0.623 | 0.366 | <0.001 |
* ANOVA.
All tests for linear trend were statistically significant (p < 0.05) with the exception of non-white ethnic (p = 0.068) and twin (p = 0.053) sub-groups.
Relationship between mean SF-6D utility scores and self-reported health status (n = 493)
| Group | Overall utility score | Self-reported health status | |||||
| 0.809 | 0.916 | 0.856 | 0.741 | 0.652 | 0.507 | <0.001 | |
| 17–24 | 0.815 | 0.916 | 0.849 | 0.742 | 0.656 | 0.374 | <0.001 |
| 25–34 | 0.806 | 0.911 | 0.856 | 0.735 | 0.655 | 0.583 | <0.001 |
| 35–44 | 0.810 | 0.954 | 0.866 | 0.767 | 0.631 | 0.461 | <0.001 |
| White | 0.816 | 0.924 | 0.859 | 0.750 | 0.656 | 0.507 | <0.001 |
| Non-white | 0.690 | 0.709 | 0.790 | 0.666 | 0.564 | - | 0.217 |
| Yes | 0.817 | 0.914 | 0.862 | 0.734 | 0.676 | 0.534 | <0.001 |
| No | 0.776 | 0.928 | 0.828 | 0.759 | 0.583 | 0.374 | <0.001 |
| Owner occupier | 0.821 | 0.919 | 0.860 | 0.728 | 0.683 | 0.553 | <0.001 |
| Rented | 0.778 | 0.903 | 0.837 | 0.771 | 0.613 | 0.461 | <0.001 |
| Yes | 0.831 | 0.924 | 0.856 | 0.758 | 0.678 | 0.583 | <0.001 |
| No | 0.767 | 0.886 | 0.852 | 0.716 | 0.638 | 0.432 | <0.001 |
| Singleton | 0.809 | 0.915 | 0.855 | 0.741 | 0.652 | 0.507 | <0.001 |
| Twin | 0.822 | 1.000 | 0.902 | 0.681 | 0.628 | - | 0.109 |
| Yes | 0.814 | 0.915 | 0.858 | 0.749 | 0.641 | - | <0.001 |
| No | 0.799 | 0.919 | 0.852 | 0.724 | 0.698 | 0.507 | <0.001 |
* ANOVA.
All tests for linear trend were statistically significant (p < 0.05) with the exception of the non-white ethnic sub-group (p = 0.077).
Figure 1Scatter plot of paired EQ-5D and SF-6D utility scores.
Efficiency of multi-attribute utility measures to detect differences in self-reported health status; all women (n = 493)
| Measure | Categorisation of self-reported health status | Utility score | Relative efficiencyb | ROC curve | ||||
| Mean | (SD) | Areac | 95% CI | |||||
| EQ-5D | Excellent | 0.964 | (0.085) | 9.334 | <0.001 | 1.000 | 0.721* | (0.666, 0.776) |
| Very good, good, fair or poor | 0.837 | (0.189) | ||||||
| SF-6D | Excellent | 0.916 | (0.091) | 10.604 | <0.001 | 1.291 | 0.798* | (0.748, 0.849) |
| Very good, good, fair or poor | 0.784 | (0.138) | ||||||
| EQ-5D | Excellent or very good | 0.925 | (0.119) | 9.156 | <0.001 | 1.000 | 0.756* | (0.709, 0.802) |
| Good, fair or poor | 0.765 | (0.213) | ||||||
| SF-6D | Excellent or very good | 0.874 | (0.108) | 14.205 | <0.001 | 2.407 | 0.841* | (0.804, 0.877) |
| Good, fair or poor | 0.712 | (0.125) | ||||||
| EQ-5D | Excellent, very good or good | 0.890 | (0.145) | 7.222 | <0.001 | 1.000 | 0.849* | (0.790, 0.908) |
| Fair or poor | 0.616 | (0.258) | ||||||
| SF-6D | Excellent, very good or good | 0.830 | (0.127) | 10.742 | <0.001 | 2.212 | 0.852* | (0.800, 0.905) |
| Fair or poor | 0.634 | (0.119) | ||||||
| EQ-5D | Excellent, very good, good or fair | 0.867 | (0.169) | 3.469 | 0.018 | 1.000 | 0.814* | (0.633, 0.996) |
| Poor | 0.366 | (0.353) | ||||||
| SF-6D | Excellent, very good, good or fair | 0.813 | (0.136) | 7.938 | <0.001 | 5.236 | 0.847* | (0.686, 1.000) |
| Poor | 0.507 | (0.093) | ||||||
SD denotes standard deviation. ROC denotes receiver operating characteristic. CI denotes confidence interval.
a Not assuming equality of variance as Levene test showed statistically significant differences in variances between self-reported health status groups.
b Relative efficiency statistic is referenced to 1.0 for the EQ-5D measure. A value higher than 1.0 indicates that the SF-6D is more efficient than the EQ-5D in detecting differences between women in terms of their self-reported health status.
c Area under receiver operating characteristic (ROC) curves; * p < 0.05 indicates that area under the ROC curve was statistically significantly greater than 0.5 and that measure has discriminatory power.
Efficiency of multi-attribute utility measures to detect differences in self-reported health status; women for whom both utility scores were between 0.296 and 1.0 (n = 481)
| Measure | Categorisation of self-reported health status | Utility score | Relative efficiencyb | ROC curve | ||||
| EQ-5D | Excellent | 0.964 | (0.085) | 8.682 | <0.001 | 1.000 | 0.712* | (0.655, 0.768) |
| Very good, good, fair or poor | 0.861 | (0.137) | ||||||
| SF-6D | Excellent | 0.916 | (0.091) | 10.038 | <0.001 | 1.337 | 0.792* | (0.741, 0.844) |
| Very good, good, fair or poor | 0.794 | (0.129) | ||||||
| EQ-5D | Excellent or very good | 0.931 | (0.103) | 10.029 | <0.001 | 1.000 | 0.747* | (0.699, 0.795) |
| Good, fair or poor | 0.804 | (0.143) | ||||||
| SF-6D | Excellent or very good | 0.876 | (0.106) | 13.816 | <0.001 | 1.898 | 0.835* | (0.797, 0.872) |
| Good, fair or poor | 0.725 | (0.114) | ||||||
| EQ-5D | Excellent, very good or good | 0.897 | (0.124) | 8.672 | <0.001 | 1.000 | 0.829* | (0.763, 0.896) |
| Fair or poor | 0.716 | (0.126) | ||||||
| SF-6D | Excellent, very good or good | 0.833 | (0.123) | 9.663 | <0.001 | 1.242 | 0.834* | (0.775, 0.894) |
| Fair or poor | 0.661 | (0.105) | ||||||
| EQ-5D | Excellent, very good, good or fair | 0.882 | (0.134) | 6.482 | 0.018 | 1.000 | 0.720 | (0.471, 0.970) |
| Poor | 0.678 | (0.053) | ||||||
| SF-6D | Excellent, very good, good or fair | 0.818 | (0.l30) | 10.121 | 0.006 | 2.438 | 0.770* | (0.544, 0.997) |
| Poor | 0.583 | (0.039) | ||||||
SD denotes standard deviation. ROC denotes receiver operating characteristic. CI denotes confidence interval.
a Not assuming equality of variance as Levene test showed statistically significant differences in variances between self-reported health status groups.
b Relative efficiency statistic is referenced to 1.0 for the EQ-5D measure. A value higher than 1.0 indicates that the SF-6D is more efficient than the EQ-5D in detecting differences between women in terms of their self-reported health status.
c Area under receiver operating characteristic (ROC) curves; * p < 0.05 indicates that area under the ROC curve was statistically significantly greater than 0.5 and that measure has discriminatory power.
Efficiency of multi-attribute utility measures to detect differences in postnatal depression
| Measure | Categorisation of postnatal depression risk score | Utility score | Relative efficiencyb | ROC curve | ||||
| EQ-5D | EPDS score < 13 | 0.885 | (0.147) | 4.332 | <0.001 | 1.000 | 0.696* | (0.615, 0.777) |
| EPDS score ≥ 13 | 0.738 | (0.244) | ||||||
| SF-6D | EPDS score < 13 | 0.830 | (0.129) | 7.008 | <0.001 | 2.617 | 0.767* | (0.697, 0.837) |
| EPDS score ≥ 13 | 0.696 | (0.132) | ||||||
| EQ-5D | EPDS score < 10 | 0.896 | (0.142) | 5.404 | <0.001 | 1.000 | 0.679* | (0.619, 0.739) |
| EPDS score ≥ 10 | 0.780 | (0.208) | ||||||
| SF-6D | EPDS score < 10 | 0.843 | (0.125) | 8.192 | <0.001 | 2.298 | 0.749* | (0.696, 0.802) |
| EPDS score ≥ 10 | 0.724 | (0.133) | ||||||
| EQ-5D | EPDS score < 13 | 0.891 | (0.130) | 3.699 | <0.001 | 1.000 | 0.664* | (0.579, 0.750) |
| EPDS score ≥ 13 | 0.806 | (0.152) | ||||||
| SF-6D | EPDS score < 13 | 0.833 | (0.126) | 6.509 | <0.001 | 3.096 | 0.760* | (0.685, 0.835) |
| EPDS score ≥ 13 | 0.708 | (0.123) | ||||||
| EQ-5D | EPDS score < 10 | 0.901 | (0.128) | 5.110 | <0.001 | 1.000 | 0.662* | (0.600, |
| EPDS score ≥ 10 | 0.821 | (0.141) | 0.724) | |||||
| SF-6D | EPDS score < 10 | 0.846 | (0.122) | 7.800 | <0.001 | 2.330 | 0.743* | (0.688, 0.798) |
| EPDS score ≥ 10 | 0.735 | (0.126) | ||||||
EPDS denotes Edinburgh Postnatal Depression Scale. SD denotes standard deviation. ROC denotes receiver operating characteristic. CI denotes confidence interval.
a Not assuming equality of variance as Levene test showed statistically significant differences in variances between self-reported health status groups.
b Relative efficiency statistic is referenced to 1.0 for the EQ-5D measure. A value higher than 1.0 indicates that the SF-6D is more efficient than the EQ-5D in detecting differences between women in terms of their self-reported health status.
c Area under receiver operating characteristic (ROC) curves; * p < 0.05 indicates that area under the ROC curve was statistically significantly greater than 0.5 and that measure has discriminatory power.