BACKGROUND AND OBJECTIVE: Participants rated clinical marker states (CMS) to make respondents familiar with the task of preference instruments, ground their ratings in relation to other health states, and help investigators interpret patient ratings. The objective was to assess the reliability of CMS using appropriate reliability statistics. STUDY DESIGN AND SETTING: Eighty-one patients rated CMSs for mild, moderate, and severe chronic respiratory disease using the feeling thermometer (FT) and the standard gamble (SG) before and after a 3-month respiratory rehabilitation program. To assess reliability we used (a) intraclass correlation coefficients (ICC) with the variance between CMSs as signal and the variance between raters, the variance within raters, and the signal as noise; (b) scatter plots; and (c) Bland-Altman plots. RESULTS: ICCs were 0.47 for the FT and 0.37 for the SG. Scatter and Bland-Altman plots showed large between- and within-person variability; 64.2% and 11.3% of the CMSs ratings were in the correct order on both occasions on the FT and SG, respectively. CONCLUSION: Our results suggest moderate reliability of CMSs ratings for the FT and poor reliability for the SG, which may explain their lack of improving the SG's measurement properties. Investigators should use appropriate reliability statistics when addressing related issues.
BACKGROUND AND OBJECTIVE:Participants rated clinical marker states (CMS) to make respondents familiar with the task of preference instruments, ground their ratings in relation to other health states, and help investigators interpret patient ratings. The objective was to assess the reliability of CMS using appropriate reliability statistics. STUDY DESIGN AND SETTING: Eighty-one patients rated CMSs for mild, moderate, and severe chronic respiratory disease using the feeling thermometer (FT) and the standard gamble (SG) before and after a 3-month respiratory rehabilitation program. To assess reliability we used (a) intraclass correlation coefficients (ICC) with the variance between CMSs as signal and the variance between raters, the variance within raters, and the signal as noise; (b) scatter plots; and (c) Bland-Altman plots. RESULTS: ICCs were 0.47 for the FT and 0.37 for the SG. Scatter and Bland-Altman plots showed large between- and within-person variability; 64.2% and 11.3% of the CMSs ratings were in the correct order on both occasions on the FT and SG, respectively. CONCLUSION: Our results suggest moderate reliability of CMSs ratings for the FT and poor reliability for the SG, which may explain their lack of improving the SG's measurement properties. Investigators should use appropriate reliability statistics when addressing related issues.
Authors: Holger J Schünemann; Roger Goldstein; M Jeffery Mador; Douglas McKim; Elisabeth Stahl; Lauren E Griffith; Ahmed M Bayoumi; Peggy Austin; Gordon H Guyatt Journal: Qual Life Res Date: 2006-02 Impact factor: 4.147
Authors: Pablo Alonso-Coello; Shanil Ebrahim; Gordon H Guyatt; Kari A O Tikkinen; Mark H Eckman; Ignacio Neumann; Sarah D McDonald; Elie A Akl; Shannon M Bates Journal: BMC Pregnancy Childbirth Date: 2012-05-30 Impact factor: 3.007
Authors: Lara Siebeling; Gerben ter Riet; Willem M van der Wal; Ronald B Geskus; Marco Zoller; Patrick Muggensturm; Irena Joleska; Milo A Puhan Journal: BMC Pulm Med Date: 2009-05-06 Impact factor: 3.317
Authors: Terri R Fried; Peter H Van Ness; Amy L Byers; Virginia R Towle; John R O'Leary; Joel A Dubin Journal: J Gen Intern Med Date: 2007-04 Impact factor: 5.128