| Literature DB >> 31359210 |
Angela M Stover1,2, Lori D McLeod3, Michelle M Langer2,4, Wen-Hung Chen5, Bryce B Reeve1,6.
Abstract
BACKGROUND: This paper is part of a series comparing different psychometric approaches to evaluate patient-reported outcome (PRO) measures using the same items and dataset. We provide an overview and example application to demonstrate 1) using item response theory (IRT) to identify poor and well performing items; 2) testing if items perform differently based on demographic characteristics (differential item functioning, DIF); and 3) balancing IRT and content validity considerations to select items for short forms.Entities:
Keywords: Item response theory; Measurement; PROMIS®; Scale construction; Scale evaluation
Year: 2019 PMID: 31359210 PMCID: PMC6663947 DOI: 10.1186/s41687-019-0130-5
Source DB: PubMed Journal: J Patient Rep Outcomes ISSN: 2509-8020
Fig. 1Example of an item characteristic curve (ICC) or “trace line”. Vertical axis: probability of endorsement for each response category; Horizontal axis: theta (level of latent trait, e.g., level of depressive symptoms). Response option choices: 0: Never; 1: Rarely; 2: Sometimes; 3: Often; 4: Always
Common terms used in an IRT graded response model
| Term | Abbreviation/Symbol | Description |
|---|---|---|
| Slope parameter | • Also referred to as the discrimination parameter. • Measures the strength of the relationship between the item and the latent variable being measured. • Items with larger slopes are better able to distinguish between individuals with higher and lower levels of the latent variable being measured. | |
| Threshold parameters | • Also known as the location parameters or the difficulty/severity parameters. • Represents the points along theta at which the corresponding response categories are the most discriminating or informative. • Items with higher thresholds represent greater severity of the latent variable being measured. | |
| Theta | • Latent variable being measured (e.g., depression). | |
| Item characteristic curve | ICC | • Also referred to as a “trace line.” • Visual image showing the probability of an item response across the range of theta (latent trait). • Can reveal weak items and overlapping response categories. |
| Test characteristic curve | TCC | • Sum of the ICCs across all items. • Shows the expected total summed score on the scale for each level of theta. |
| Item information function | IIF | • Index of the precision in measurement in distinguishing between individuals with different levels of the latent variable being measured. • More information indicates greater precision and reliability. • Item information is peaked when the slope parameter is high. • Standard error of measurement is inversely related to information. |
| Test information function | TIF | • Sum of the item information functions across all items. • Indicates where along theta the scale has the greatest measurement precision. |
| Item fit | • Diagnostic statistic that examines goodness of fit of the IRT model for each item. • Examines observed and expected response proportions for each item value. • Significant result indicates item misfit. | |
| Local dependence | LD | • Statistic that examines bivariate fit to identify evidence of items that are excessively related given the common underlying construct. • Significant result indicates content redundancy between two or more items. |
| Differential item functioning | DIF | • Measurement bias in an item between two or more groups while holding the latent trait level constant. |
Fig. 2IRTPRO screen shots for graded response model and differential item functioning. a introductory screen. b first part of sequence to enable DIF detection c second part of sequence to enable DIF d DIF Output
Item parameters, fit statistics, local dependence, and DIF results for the 51 PROMIS® depression items
| # | Item Stem | Item Parameters | Item Fit | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| LD | |||||||||||
| 1 | I felt hopeless | 4.46 | 0.38 | 0.97 | 1.53 | 2.23 | 116.2 | 88 | |||
| 2 | I felt worthless | 4.17 | 0.44 | 1.00 | 1.57 | 2.14 | 102.0 | 91 | |||
| 3 | I felt depressed | 3.84 | −0.11 | 0.59 | 1.29 | 2.15 | 120 | ||||
| 4 | I felt unhappy | 3.81 | −0.39 | 0.41 | 1.27 | 2.06 | 151.5 | 120 | |||
| 5 | I felt that nothing could cheer me up | 3.73 | 0.32 | 0.94 | 1.73 | 2.34 | 99.4 | 98 | |||
| 6 | I felt like a failure | 3.73 | 0.19 | 0.72 | 1.58 | 2.05 | 137.2 | 116 | |||
| 7 | I felt helpless | 3.65 | 0.35 | 0.94 | 1.67 | 2.36 | 115.0 | 105 | |||
| 8 | I felt that I wanted to give up on everything | 3.6 | 0.56 | 1.1 | 1.6 | 2.3 | 111.3 | 95 | |||
| 9 | I felt that I had nothing to look forward to | 3.3 | 0.31 | 0.8 | 1.5 | 2.3 | 147.1 | 117 | |||
| 10 | I felt that my life was empty | 3.24 | 0.28 | 0.73 | 1.56 | 2.11 | 148.5 | 120 | |||
| 11 | I felt emotionally exhausted | 3.32 | −0.18 | 0.49 | 1.27 | 2.05 | 126.0 | 133 | |||
| 12 | I felt sad | 3.20 | −0.47 | 0.37 | 1.31 | 2.29 | 131 | ||||
| 13 | I felt I had no reason for living | 3.1 | 0.92 | 1.5 | 1.9 | 2.6 | 83.2 | 71 | 21,29,35 | ||
| 14 | I found that things in my life were overwhelming | 3.1 | −0.09 | 0.6 | 1.5 | 2.3 | 153.8 | 138 | |||
| 15 | I felt that I was not needed | 3.08 | 0.21 | 0.84 | 1.55 | 2.38 | 162.4 | 127 | |||
| 16 | I felt disappointed in myself | 3.05 | −0.35 | 0.39 | 1.31 | 2.09 | 146 | ||||
| 17 | I felt like I needed help for my depression | 3.0 | 0.54 | 1.01 | 1.7 | 2.2 | 105 | ||||
| 18 | I had trouble enjoying the things I used to enjoy | 2.9 | −0.09 | 0.60 | 1.4 | 2.2 | 165.2 | 146 | |||
| 19 | I felt discouraged about the future | 2.92 | −0.26 | 0.37 | 1.31 | 2.09 | 159.4 | 153 | |||
| 20 | I felt that I was to blame for things | 2.88 | −0.02 | 0.73 | 1.63 | 2.38 | 181.3 | 141 | |||
| 21 | I wished I were dead and away from it all | 2.8 | 1.01 | 1.5 | 2.1 | 2.6 | 71 | 13,29,35 | |||
| 22 | I felt upset for no reason | 2.83 | 0.23 | 0.93 | 1.85 | 2.93 | 135.3 | 117 | |||
| 23 | I felt that nothing was interesting | 2.83 | 0.06 | 0.87 | 1.90 | 2.64 | 122.1 | 122 | |||
| 24 | I felt I was not as good as other people | 2.8 | 0.15 | 0.80 | 1.6 | 2.3 | 161.5 | 130 | |||
| 25 | I withdrew from other people | 2.72 | 0.04 | 0.70 | 1.47 | 2.30 | 160.5 | 144 | |||
| 26 | I had trouble making decisions | 2.71 | −0.16 | 0.75 | 1.66 | 2.57 | 174.8 | 140 | 32,38 | ||
| 27 | I had trouble feeling close to people | 2.66 | −0.12 | 0.55 | 1.41 | 2.24 | 153.6 | 155 | |||
| 28 | I felt pessimistic | 2.65 | −0.30 | 0.48 | 1.38 | 2.23 | 198.8 | 155 | |||
| 29 | I felt that others would be better off if I were dead | 2.6 | 1.10 | 1.55 | 2.4 | 3.0 | 70.1 | 67 | 13,21,35 | ||
| 30 | I felt lonely | 2.54 | −0.09 | 0.60 | 1.45 | 2.21 | 186.0 | 158 | |||
| 31 | I felt unloved | 2.51 | 0.26 | 0.90 | 1.76 | 2.48 | 161.8 | 136 | |||
| 32 | I had trouble thinking clearly | 2.51 | −0.15 | 0.78 | 1.84 | 2.84 | 178.7 | 145 | 26,38 | ||
| 33 | I had mood swings | 2.45 | −0.32 | 0.56 | 1.42 | 2.32 | 181.8 | 153 | |||
| 34 | I felt like crying | 2.44 | 0.05 | 0.82 | 1.67 | 2.62 | 178.5 | 148 | 45 | 4 | |
| 35 | I thought about suicide | 2.43 | 1.34 | 1.80 | 2.37 | 2.86 | 74.9 | 54 | 13,21,29 | ||
| 36 | I felt ignored by people | 2.41 | −0.07 | 0.73 | 1.67 | 2.55 | 190.2 | 150 | |||
| 37 | I felt guilty | 2.39 | 0.07 | 0.86 | 1.76 | 2.54 | 160.8 | 139 | |||
| 38 | I had trouble keeping my mind on what I was doing | 2.4 | −0.47 | 0.45 | 1.7 | 2.7 | 187.6 | 152 | 26,32 | ||
| 39 | I felt that everything I did was an effort | 2.3 | −0.21 | 0.60 | 1.5 | 2.4 | 181.3 | 165 | |||
| 40 | My thinking was slower than usual | 2.12 | −0.22 | 0.78 | 2.09 | 3.11 | 155.6 | 139 | 46 | ||
| 41 | I felt slowed down | 2.09 | −0.48 | 0.38 | 1.47 | 2.55 | 168 | 43,44 | |||
| 42 | I felt like being alone | 1.99 | −0.83 | −0.19 | 1.09 | 2.37 | 214.8 | 172 | |||
| 43 | I got tired more easily than usual | 1.99 | −0.56 | 0.24 | 1.37 | 2.39 | 216.3 | 182 | 41,44 | ||
| 44 | I felt that I had no energy | 1.99 | −0.81 | 0.15 | 1.20 | 2.39 | 192.4 | 182 | 41,43 | ||
| 45 | I had crying spells | 1.99 | 0.83 | 1.49 | 2.34 | 3.33 | 122.0 | 109 | 34 | 4 | |
| 46 | I reacted slowly to things that were done or said | 1.9 | −0.18 | 0.93 | 2.3 | 3.3 | 160.8 | 143 | 40 | ||
| 47 | I was unable to do many of my usual activities | 1.8 | 0.18 | 1.08 | 2.2 | 3.6 | 146 | ||||
| 48 | I had little desire to eat | 1.48 | 0.29 | 1.39 | 2.70 | 3.86 | 175.3 | 151 | |||
| 49 | I disliked the way my body looked | 1.39 | −1.07 | −0.31 | 0.87 | 1.82 | 217 | 4 | |||
| 50 | I ate more than usual | 1.19 | −0.54 | 0.70 | 2.33 | 3.73 | 180 | ||||
| 51 | I lost weight without trying | 0.57 | 1.90 | 4.24 | 7.03 | 9.39 | 151.8 | 124 | |||
Note. Italicized and values are significant at p < 0.01 after Benjamini-Hochberg correction for multiplicity
LD Local dependence detected with indicated item numbers
The current calibrations are provided for didactic purposes and are not intended to replace the official PROMIS® parameters or to be used for research
Fig. 3Example of an item showing uniform b-DIF. Women (dotted lines) consistently endorsed higher levels of depression (their curves are shifted to the left of men in every response category)
Comparison of two potential short forms selected through a combination of psychometric properties and content validity considerations
| DSM-V Criterion | Item | Short Form 1 | Short Form 1 | Short Form 2 | Short Form 2 |
|---|---|---|---|---|---|
| 1. Depressed mood most of day | I felt unhappy | x | 4 | x | 4 |
| I felt depressed | x | 3 | |||
| 2. Little interest or pleasure in doing things | I had trouble enjoying the things I used to enjoy | x | 18 | ||
| I felt I had nothing to look forward to | x | 9 | |||
| 3. Symptoms: psychomotor retardation or agitation | I reacted slowly to things that were done or said | x | 46 | ||
| 4. Symptoms: insomnia/ hypersomnia | |||||
| 5. Symptoms: fatigue | I felt that everything I did was an effort | x | 39 | ||
| 6. Symptoms: worthlessness | I felt I was not as good as other people | x | 24 | ||
| I felt worthless | x | 2 | |||
| I felt like a failure | x | 6 | |||
| 7. Symptoms: excessive guilt | I felt guilty | x | 37 | ||
| 8. Symptoms: diminished ability to think/ concentrate or indecisiveness | I had trouble making decisions | x | 26 | ||
| 9. Symptoms: suicidal ideation | I felt I had no reason for living | x | 13 | ||
| I felt like I wanted to give up on everything | x | 8 | |||
| 10. Symptoms: significant weight gain or loss/appetite loss | • I had little desire to eat • I ate more than usual • I lost weight without trying | ||||
| 11. Significant distress or impairment | I felt that nothing could cheer me up | x | 5 | ||
| I felt emotionally exhausted | x | 11 | |||
| Symptoms: hopeless | I felt hopeless | x | 1 | x | 1 |
| Symptoms: helpless | I felt helpless | x | 7 | ||
| Symptoms: withdrew from others | I withdrew from others | x | 25 | ||
Model fit changes for short form selection
| Model | Cronbach’s alpha | AIC | BIC | -2log likelihood | Δ in -2log likelihood | RMSEA | M2 (df) |
|---|---|---|---|---|---|---|---|
| 51 items | 0.983 | 65,230.18 | 66,432.60 | 64,720.18 | – | 0.43 | 163,378.86 (1071)*** |
| Short Form 1 Prioritizing Content (10 items) | 0.946 | 13,762.99 | 14,234.40 | 13,562.99 | 51,157.19 | 0.01 | 1469.53 (1420) |
| Short Form 2 Prioritizing Precision (10 items) | 0.945 | 13,825.12 | 14,296.54 | 13,625.12 | 51,095.06 | 0.01 | 1513.90 (1420) |
Cronbach’s alpha = measure of internal consistency/reliability from Classical Test Theory (criterion: ≥.90).
AIC Akaike information criterion (criterion: the lower the number, the better the fit)
BIC Bayesian information criterion (criterion: the lower the number, the better the fit)
-2log likelihood = if models are nested, subtract at each step to see if step is significant
RMSEA Root mean square error of approximation (criterion: ≤ .05).
M2 = model fit.
***p < .001 (Note: a significant value for model fit indicates that the model does NOT fit well)
df Degrees of freedom.
Fig. 4Item characteristic curves and information curves for items in a Short Form 1 b Short Form 2. Solid lines: item characteristic curves; dotted lines: information curves
Fig. 5Comparing test information curves for short forms 1 and 2. a Short Form 1 b Short Form 2