| Literature DB >> 28320462 |
Gregor Liegl1, Barbara Gandek2,3, H Felix Fischer4,5, Jakob B Bjorner6,7,8, John E Ware2,3, Matthias Rose4,2, James F Fries9, Sandra Nolte4,10.
Abstract
BACKGROUND: Physical function (PF) is a core patient-reported outcome domain in clinical trials in rheumatic diseases. Frequently used PF measures have ceiling effects, leading to large sample size requirements and low sensitivity to change. In most of these instruments, the response category that indicates the highest PF level is the statement that one is able to perform a given physical activity without any limitations or difficulty. This study investigates whether using an item format with an extended response scale, allowing respondents to state that the performance of an activity is easy or very easy, increases the range of precise measurement of self-reported PF.Entities:
Keywords: Ceiling effects; Item format; Item information; Item-response theory; Measurement range; Patient-reported outcomes; Physical function; Response scale
Mesh:
Year: 2017 PMID: 28320462 PMCID: PMC5359818 DOI: 10.1186/s13075-017-1273-5
Source DB: PubMed Journal: Arthritis Res Ther ISSN: 1478-6354 Impact factor: 5.156
Experimental PROMIS PF items for five activities administered in three different item formats
| Item format | Item | Item stem | Item content | Number and wording of response options | Attribution to health |
|---|---|---|---|---|---|
| A | A1 | Are you able to … | … do two hours of physical labor? | 5 Without any difficulty | No |
| A2a | … do yard work like raking leaves, weeding or pushing a lawn mower? | ||||
| A3 | … climb several flights of stairs? | ||||
| A4a | … go for a walk of at least 15 minutes? | ||||
| A5a | … open previously opened jars? | ||||
| B | B1a | Does your health now limit you in … | … doing two hours of physical labor? | 5 Not at all | Yes |
| B2 | … doing yard work like raking leaves, weeding or pushing a lawn mower? | ||||
| B3a | … climbing several flights of stairs? | ||||
| B4 | … going for a walk of at least 15 minutes? | ||||
| B5 | … opening previously opened jars? | ||||
| C | C1 | How difficult is it for you to … | … do two hours of physical labor? | 6 Very easy | No |
| C2 | … do yard work like raking leaves, weeding or pushing a lawn mower? | ||||
| C3 | … climb several flights of stairs? | ||||
| C4 | … go for a walk of at least 15 minutes? | ||||
| C5 | … open previously opened jars? |
a Item is part of the final Patient Reported Outcomes Measurement Information System Physical Function (PROMIS PF) item bank version 1.0
Psychometric results for the experimental items presented in three different item formats
| Item | Formatb | Content | Slope | Thresholdc | Item fit: | Imax (at | Area under the curvef | Percentage floor/percentage ceilingg | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | Mean | ||||||||
| A1 | A | Do 2 hours of physical labor | 3.49 | 38.6 | 42.9 | 47.8 | 54.9 | 46.1 | 0.6523 | 3.71 (T = 42) | 92.9 | 10.4/41.6 | |
| B1a | B | 4.53 | 38.0 | 43.0 | 48.4 | 53.1 | 45.6 | 0.1133 | 5.93 (T = 49) | 132.9 | 10.0/42.7 | ||
| C1 | C | 4.01 | 37.7 | 42.0 | 46.3 | 52.7 | 59.8 | 47.7 | 0.0358 | 4.88 (T = 42) | 140.3 | 10.2/19.6 | |
| A2a | A | Do yard work | 4.09 | 36.3 | 40.1 | 44.3 | 50.7 | 42.9 | 0.1473 | 5.10 (T = 40) | 111.1 | 6.6/57.3 | |
| B2 | B | 4.79 | 35.7 | 40.8 | 46.1 | 50.6 | 43.3 | 0.0751 | 6.58 (T = 47) | 144.0 | 6.7/52.7 | ||
| C2 | C | 4.53 | 34.3 | 39.1 | 43.1 | 49.3 | 56.0 | 44.4 | 0.0300 | 6.10 (T = 40) | 167.5 | 5.3/32.1 | |
| A3 | A | Climb several flights of stairs | 3.78 | 35.2 | 40.3 | 45.2 | 52.0 | 43.2 | 0.1722 | 4.28 (T = 41) | 107.0 | 5.8/51.5 | |
| B3a | B | 4.20 | 34.2 | 40.8 | 46.7 | 51.3 | 43.3 | 0.8460 | 5.16 (T = 48) | 126.0 | 5.1/51.3 | ||
| C3 | C | 3.78 | 33.3 | 39.8 | 44.0 | 51.0 | 57.1 | 45.0 | 0.1174 | 4.31 (T = 42) | 135.0 | 6.3/25.4 | |
| A4a | A | Go for a walk of at least 15 minutes | 3.78 | 33.2 | 36.4 | 40.2 | 45.5 | 38.8 | 0.2497 | 4.45 (T = 37) | 91.3 | 3.7/73.5 | |
| B4 | B | 4.03 | 32.1 | 37.2 | 42.0 | 45.8 | 39.3 | 0.3555 | 4.93 (T = 43) | 107.0 | 3.4/71.6 | ||
| C4 | C | 3.99 | 30.3 | 35.6 | 39.5 | 44.9 | 50.8 | 40.2 | 0.0033 | 4.85 (T = 37) | 134.7 | 3.6/47.5 | |
| A5a | A | Open previously opened jars | 1.91 | 18.8 | 28.4 | 37.9 | 28.4 | 0.2434 | 1.10 (T = 28) | 36.5 | 0.9/85.8 | ||
| B5 | B | 1.90 | 12.9 | 22.8 | 32.3 | 39.6 | 26.9 | 0.5429 | 1.10 (T = 33) | 39.9 | 0.3/81.9 | ||
| C5 | C | 1.57 | 5.0 | 15.5 | 23.4 | 34.0 | 45.4 | 24.7 | 0.1877 | 0.77 (T = 20) | 33.6 | 0.3/62.4 | |
aItem is part of the final Patient Reported Outcomes Measurement Information System Physical Function (PROMIS PF) item bank version 1.0. bFormat A: “Are you able to …” (five-category response scale from “Without any difficulty” to “Unable to do”); format B: “Does your health now limit you in …” (five-category response scale from “Not at all” to “Cannot do”); format C: “How difficult is it for you to …” (six-category response scale from “Very easy” to “Impossible”). cThresholds are transformed to a T-score of 50 ± 10, where 50 = mean and 10 = standard deviation of the analytic sample; slopes are reported unchanged. d X 2 statistics (S-X 2) were evaluated after adjusting for multiple testing (p < 0.0033). eImax (at T-score) depicts the maximum of item information (upper number) of a given item at the corresponding point on the T-score continuum. fTotal area under the item information curve (IIC) on the empirically observed T-score range in the calibration sample (T-score = 11.1–73.6). gPercentage of participants who answered the item with the lowest (floor) or highest (ceiling) possible response category
Fig. 1Comparison of item information functions (IIFs) using different item formats. Format A: “Are you able to …” (five-category response scale from “Without any difficulty” to “Unable to do”); format B: “Does your health now limit you in …” (five-category response scale from “Not at all” to “Cannot do”); format C: “How difficult is it for you to …” (six-category response scale from “Very easy” to “Impossible”). Item parameters and IIFs were initially estimated using a standard normal physical function (PF) metric. PF values were subsequently transformed to a T-metric, where 50 = mean and 10 = standard deviations of the analytic sample (x-axis). Item information values on the y-axis are reported unchanged. I depicts the specific point on the T-score continuum, where a given item delivers maximum item information
Fig. 2Comparison of test information functions between different item formats. Format A: “Are you able to …” (five-category response scale from “Without any difficulty” to “Unable to do”); format B: “Does your health now limit you in …” (five-category response scale from “Not at all” to “Cannot do”); format C: “How difficult is it for you to …” (six-category response scale from “Very easy” to “Impossible”). Item parameters and information functions were initially estimated using a standard normal physical function (PF) metric. PF values were subsequently transformed to a T-metric, where 50 = mean and 10 = standard deviations of the analytic sample (x-axis). Test information values on the y-axis are reported unchanged
PROMIS PF full bank and short form scoring characteristics and agreement with simulated “true” scores
| General health groups | True PF | Full bank (124 items)b | Format Ae (5-item short form) | Format B (5-item short form) | Format C (5-item short form) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| RMSEc | Percentage floor/percentage ceilingd |
| RMSE | Percentage floor/percentage ceiling |
| RMSE | Percentage floor/percentage ceiling |
| RMSE | Percentage floor/percentage ceiling | ||
| Poor | 35.6 (6.5) | 35.7 (6.4) | 0.7 | 0.0/0.0 | 36.6 (6.3) | 3.0 | 3.9/0.2 | 36.4 (6.3) | 2.7 | 1.5/0.2 | 36.3 (6.3) | 2.7 | 0.5/0.0 |
| Fair | 41.9 (7.6) | 41.9 (7.7) | 0.8 | 0.0/0.0 | 42.5 (7.9) | 2.9 | 1.3/3.8 | 42.3 (790) | 2.6 | 0.5/4.6 | 42.3 (7.7) | 2.5 | 0.2/0.7 |
| Good | 48.9 (7.8) | 49.0 (7.9) | 1.1 | 0.0/0.1 | 49.4 (8.0) | 3.2 | 0.1/17.0 | 49.4 (8.0) | 3.1 | 0.0/21.5 | 49.1 (7.9) | 2.6 | 0.0/6.2 |
| Very good | 54.4 (7.2) | 54.5 (7.3) | 1.5 | 0.0/0.2 | 54.5 (6.9) | 3.8 | 0.0/37.9 | 54.5 (6.8) | 3.7 | 0.0/45.4 | 54.4 (7.3) | 2.9 | 0.0/16.8 |
| Excellent | 58.8 (6.5) | 58.7 (6.4) | 1.9 | 0.0/0.7 | 57.8 (5.3) | 4.3 | 0.0/59.0 | 57.8 (5.0) | 4.4 | 0.0/67.1 | 58.4 (6.3) | 3.3 | 0.0/32.6 |
| Full sample | 47.9 (11.0) | 48.0 (11.0) | 1.3 | 0.0/0.2 | 48.1 (10.4) | 3.5 | 1.1/23.6 | 48.1 (10.5) | 3.4 | 0.4/27.8 | 48.1 (10.5) | 2.8 | 0.1/11.3 |
a T-scores have a mean of 50 and standard deviation of 10 in the analytic sample. bFinal Patient Reported Outcomes Measurement Information System Physical Function (PROMIS PF) item bank version 1.0. cRSME = root mean square error between estimated T-scores and simulated “true” T-scores. dPercentage of the simulated sample who reached the lowest (“floor”) or highest (“ceiling”) possible score. eFormat A: “Are you able to …” (five-category response scale from “Without any difficulty” to “Unable to do”); Format B: “Does your health now limit you in …” (five-category response scale from “Not at all” to “Cannot do”); format C: “How difficult is it for you to…” (six-category response scale from “Very easy” to “Impossible”)
Analysis of variance (ANOVA) and relative validity (RV)
| Subgroup comparisons | General health groups considered for ANOVAa | Full bank (124 items)b | Format Ac (5-item short form) | Format B (5-item short form) | Format C (5-item short form) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Poor | Fair | Good | Very good | Excellent | F | RV | F | RVd (95% CI) | F | RV (95% CI) | F | RV (95% CI) | |
| a. Full sample | X | X | X | X | X | 16,957 | 1 .0 | 15,582 | 0.92 (0.91–0.93) | 16,139 | 0.95 (0.94–0.96) | 15,712 | 0.93 (0.92–0.94) |
| b. Average PF | X | X | X | 6960 | 1 .0 | 6246 | 0.90 (0.88–0.91) | 6473 | 0.93 (0.92–0.94) | 6349 | 0.91 (0.90–0.93) | ||
| c. Below-average PF | X | X | 3818 | 1 .0 | 3421 | 0.90 (0.87–0.92) | 3491 | 0.91 (0.89–0.94) | 3564 | 0.93 (0.91–0.96) | |||
| d. Above-average PF | X | X | 1870 | 1 .0 | 1476 | 0.79 (0.74–0.84) | 1467 | 0.78 (0.74–0.83) | 1720 | 0.92 (0.88–0.96) | |||
aSubgroups marked X were considered for calculating F values (ANOVA); n = 10,000 per subgroup. bFinal Patient Reported Outcomes Measurement Information System Physical Function (PROMIS PF) item bank version 1.0. cFormat A: “Are you able to …” (five-category response scale from “Without any difficulty” to “Unable to do”); format B: “Does your health now limit you in …” (five-category response scale from “Not at all” to “Cannot do”); format C: “How difficult is it for you to …” (six-category response scale from “Very easy” to “Impossible”). dRV calculation: (ANOVA F values derived from using a format-specific 5-item short form)/(ANOVA F values derived from using full bank scores)