Eleanor M Pullenayegum1, Kuhan Perampaladas2, Kathryn Gaebel2, Brett Doble3, Feng Xie2. 1. Child Health Evaluative Sciences, Hospital for Sick Children, Toronto, ON, Canada. eleanor.pullenayegum@sickkids.ca. 2. Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, ON, Canada. 3. Faculty of Business and Economics, Centre for Health Economics, Monash University, Clayton, VIC, Australia.
Abstract
BACKGROUND: EQ-5D-3L scoring algorithms vary amongst countries, not only in the values of regression coefficients but also in the independent variables included in the regression model (hereafter referred to as model specification). It is unclear how much of this variation is due to differences in health state selection, the relative frequencies with which health states were valued, and model diagnostics, rather than to genuine differences in population preferences. METHODS: Using aggregate data from a recent review, we noted all model specifications that were used. For each country the country's own model was re-fitted, as were all other model specifications. This was done twice: once using all valued health states for each country, and again using a common set of 17 health states for all countries. Goodness of fit was assessed using the following model diagnostics: mean absolute error (MAE), mean squared error (MSE) and rho (the Pearson correlation coefficient between predicted and observed mean utilities), both with and without leave-one-out cross-validation. RESULTS: Thirteen countries contributed data. Even when using a common set of health states, the preferred model varied across countries. However, choice of health states did impact the preferred model specification: when using cross-validation, the preferred specification changed in five of ten countries when moving from 17 health states to all valued health states. The relative frequency with which health states were valued had little impact on the preferred model. CONCLUSIONS: Variation in choices of health states to value is responsible for some, but not all, of the observed heterogeneity in model specification. Relative frequency of health state valuation and choice of model diagnostic has a limited impact on model preference, however, use of cross-validation has a substantial impact. The use of cross-validation, implemented through omitting health states rather than respondents, is recommended as one approach to assessing model fit.
BACKGROUND: EQ-5D-3L scoring algorithms vary amongst countries, not only in the values of regression coefficients but also in the independent variables included in the regression model (hereafter referred to as model specification). It is unclear how much of this variation is due to differences in health state selection, the relative frequencies with which health states were valued, and model diagnostics, rather than to genuine differences in population preferences. METHODS: Using aggregate data from a recent review, we noted all model specifications that were used. For each country the country's own model was re-fitted, as were all other model specifications. This was done twice: once using all valued health states for each country, and again using a common set of 17 health states for all countries. Goodness of fit was assessed using the following model diagnostics: mean absolute error (MAE), mean squared error (MSE) and rho (the Pearson correlation coefficient between predicted and observed mean utilities), both with and without leave-one-out cross-validation. RESULTS: Thirteen countries contributed data. Even when using a common set of health states, the preferred model varied across countries. However, choice of health states did impact the preferred model specification: when using cross-validation, the preferred specification changed in five of ten countries when moving from 17 health states to all valued health states. The relative frequency with which health states were valued had little impact on the preferred model. CONCLUSIONS: Variation in choices of health states to value is responsible for some, but not all, of the observed heterogeneity in model specification. Relative frequency of health state valuation and choice of model diagnostic has a limited impact on model preference, however, use of cross-validation has a substantial impact. The use of cross-validation, implemented through omitting health states rather than respondents, is recommended as one approach to assessing model fit.
Keywords:
EQ-5D; Health utility; Heterogeneity; Scoring algorithm
Authors: Dominik Golicki; Michał Jakubczyk; Maciej Niewada; Witold Wrona; Jan J V Busschbach Journal: Value Health Date: 2009-09-10 Impact factor: 5.725
Authors: James W Shaw; A Simon Pickard; Shengsheng Yu; Shijie Chen; Vincent G Iannacchione; Jeffrey A Johnson; Stephen Joel Coons Journal: Value Health Date: 2009-12-04 Impact factor: 5.725
Authors: Nikki McCaffrey; Billingsley Kaambwa; David C Currow; Julie Ratcliffe Journal: Health Qual Life Outcomes Date: 2016-09-20 Impact factor: 3.186
Authors: Long Hoang Nguyen; Bach Xuan Tran; Quynh Ngoc Hoang Le; Tung Thanh Tran; Carl A Latkin Journal: Health Qual Life Outcomes Date: 2017-10-11 Impact factor: 3.186