| Literature DB >> 18684327 |
Stefan J Cano1, Thomas T Warner, Alan J Thompson, Kailash P Bhatia, Ray Fitzpatrick, Jeremy C Hobart.
Abstract
BACKGROUND: The United States Food and Drug Administration (FDA) are currently producing guidelines for the scientific adequacy of patient reported outcome measures (PROMs) in clinical trials, which will have implications for the selection of scales used in future clinical trials. In this study, we examine how the Cervical Dystonia Impact Profile (CDIP-58), a rigorous Rasch measurement developed neurologic PROM, stands up to traditional psychometric criteria for three reasons: 1) provide traditional psychometric evidence for the CDIP-58 in line with proposed FDA guidelines; 2) enable researchers and clinicians to compare it with existing dystonia PROMs; and 3) help researchers and clinicians bridge the knowledge gap between old and new methods of reliability and validity testing.Entities:
Mesh:
Year: 2008 PMID: 18684327 PMCID: PMC2538506 DOI: 10.1186/1477-7525-6-58
Source DB: PubMed Journal: Health Qual Life Outcomes ISSN: 1477-7525 Impact factor: 3.186
Figure 1Measurement model of the CDIP-58.
Adapted from table 4 of the FDA draft guidelines for measurement properties reviewed for PRO instruments used in clinical trials
| Reliability | Test-retest | ✓ |
| Internal consistency | Whether the items in a domain are intercorrelated, as evidenced by an internal consistency statistic (e.g., coefficient alpha) | ✓ |
| Inter-interviewer reproducibility (for interviewer-administered PROs only) | Agreement between responses when the PRO is administered by two or more different interviewers | NA |
| Validity | Content-related | ✓* |
| Ability to measure the concept (also known as construct-related validity; can include tests for discriminant, convergent, and known-groups validity) | Whether relationships among items, domains, and concepts conform to what is predicted by the conceptual framework for the PRO instrument itself and its validation hypotheses. | ✓ |
| Ability to predict future outcomes (also known as predictive validity) | Whether future events or status can be predicted by changes in the PRO scores | ✗ |
| Ability to detect change | Includes calculations of effect size and standard error of measurement among others | ✓** |
| Interpretability | Smallest difference that is considered clinically important; this can be a specified difference (the minimum important difference (MID)) or, in some cases, any detectable difference. The MID is used as a benchmark to interpret mean score differences between treatment arms in a clinical trial | ✓/✗*** |
| Responder definition – used to identify responders in clinical trials for analyzing differences in the proportion of responders between treatment arms | Change in score that would be clear evidence that an individual patient experienced a treatment benefit. Can be based on experience with the measure using a distribution-based approach, a clinical or non-clinical anchor, an empirical rule, or a combination of approaches. | NA |
✓ = tested; ✗ = not tested; * Reported in Cano et al 2004 [6]; ** Reported in Cano et al 2006 [28]; *** Although not including MID in our responsiveness paper (Cano et al, 2006 [28]), we include a comparison of relative responsiveness to existing PROs used in CD research in order to increase the interpretability of CDIP-58 change scores against these measures.
Respondent characteristics
| Number | 391 |
| Sex | |
| Female | 72 |
| Age | |
| Mean (SD) | |
| Range | |
| Ethnicity | |
| White | |
| Years since CD onset | |
| Mean (SD) | |
| Range | |
| Employment status | |
| Retired | 35 |
| Employed | 29 |
| Unable to work due to CD | 20 |
| Treatment | |
| Botulinum Injections | 90 |
| Drug therapy | 53 |
| Alternative treatment | 0 |
| Surgery | 0 |
| External measures (Mean; SD) | |
| SF-36** Bodily Pain | 46 (26) |
| SF-36 Social Functioning | 55 (31) |
| SF-36 Physical Functioning | 57 (29) |
| SF-36 Mental Health | 62 (20) |
| GHQ*** Anxiety | 38 (19) |
| GHQ Depression | 14 (19) |
| HADS*** Anxiety | 43 (24) |
| HADS Depression | 33 (19) |
*All values are percentages unless specified otherwise; ** range 0-100, *** converted to 0-100 (original range 0-21)
Convergent and discriminant construct validity of the CDIP-58
| Instrument | Scale/ | ||||||||
| Validity | Head and | Pain and | Upper | Walking | Sleep | Annoyance | Mood | Psycho- | |
| CDIP-58 | Head and Neck Symptoms | ||||||||
| Pain and Discomfort | - | ||||||||
| Upper Limb Activities | 0.64 | 0.53 | - | ||||||
| Walking | 0.63 | 0.65 | - | ||||||
| Sleep | 0.50 | 0.53 | 0.54 | 0.50 | - | ||||
| Annoyance | 0.63 | 0.55 | 0.60 | 0.56 | 0.52 | - | - | ||
| Mood | 0.52 | 0.48 | 0.54 | 0.53 | 0.49 | - | |||
| Psychosocial Functioning | 0.67 | 0.51 | 0.55 | 0.53 | 0.44 | 0.69 | - | ||
| SF-36* | Bodily Pain | -0.60 | -0.72 | -0.62 | -0.59 | -0.54 | -0.51 | -0.48 | |
| Social Functioning | -0.56 | -0.56 | -0.70 | -0.62 | -0.55 | -0.63 | -0.59 | ||
| Physical Functioning | -0.45 | -0.57 | -0.53 | -0.45 | |||||
| Mental Health | -0.43 | -0.43 | -0.60 | ||||||
| HADS | Anxiety | 0.34 | 0.31 | 0.48 | |||||
| Depression | 0.41 | 0.42 | 0.50 | ||||||
| GHQ** | Anxiety | 0.30 | 0.22 | 0.30 | |||||
| Depression | 0.35 | 0.18 | 0.43 | ||||||
| Demo-graphic | Age | -0.05 | -0.03 | -0.02 | 0.01 | 0.00 | -0.04 | -0.07 | 0.00 |
| Variables | Sex | -0.04 | -0.17 | 0.02 | -0.07 | 0.00 | 0.06 | 0.04 | 0.02 |
| Education | -0.04 | -0.05 | -0.01 | -0.07 | -0.02 | -0.03 | -0.03 | -0.08 | |
*4/8 scales omitted from table as not applicable to analyses; **2/8 scales omitted from table as not applicable to analyses
aCorrelations falling outside of the predicted range; bCorrelations consistent with predictions
Data quality, scaling assumptions, targeting, reliability and validity
| Head and | Pain and | Upper | Walking | Sleep | Annoyance | Mood | Psycho- | |
| Data quality | ||||||||
| Item missing data (range %) | 1–2 | 3–4 | 2–4 | 3–4 | 2 | 2–4 | 2–4 | 1–2 |
| Computable scale scores (%) | 98 | 97 | 98 | 97 | 98 | 98 | 96 | 99 |
| Corrected item-total correlations | ||||||||
| Mean | 0.76 | 0.82 | 0.78 | 0.87 | 0.89 | 0.83 | 0.78 | 0.82 |
| Range | 0.67–0.81 | 0.70–0.87 | 0.64–0.87 | 0.82–0.91 | 0.84–0.93 | 0.79–0.89 | 0.68–0.84 | 0.70–0.90 |
| Item-other scale correlations | ||||||||
| Range | 0.37–0.72 | 0.40–0.75 | 0.43–0.72 | 0.42–0.75 | 0.39–0.52 | 0.42–0.73 | 0.39–0.82 | 0.38–0.69 |
| Scaling successes (%) | 83 | 80 | 100 | 100 | 100 | 100 | 86 | 100 |
| Targeting | ||||||||
| Mean score | 57.6 | 53.8 | 42.9 | 37.1 | 33.3 | 37.9 | 29.7 | 49.2 |
| Standard deviation | 25.6 | 27.9 | 27.6 | 31.5 | 31.5 | 26.9 | 24.9 | 29.5 |
| Score range | 0–100 | 0–100 | 0–100 | 0–100 | 0–100 | 0–100 | 0–100 | 0–100 |
| Floor/ceiling effect (%) | 1/6 | 2/7 | 7/0 | 17/4 | 27/7 | 7/4 | 13/1 | 5/4 |
| Skewness | -0.23 | -0.15 | 0.11 | 0.44 | 0.66 | 0.51 | 0.82 | 0.02 |
| Reliability (n = 377–385) | ||||||||
| Cronbach's alpha | 0.92 | 0.93 | 0.94 | 0.97 | 0.96 | 0.96 | 0.95 | 0.96 |
| TRT (ICC; n = 92–95) | 0.85 | 0.83 | 0.94 | 0.95 | 0.86 | 0.83 | 0.85 | 0.89 |