| Literature DB >> 28707048 |
Linda Dirven1,2, Mogens Groenvold3,4, Martin J B Taphoorn5,6, Thierry Conroy7, Krzysztof A Tomaszewski8, Teresa Young9, Morten Aa Petersen3.
Abstract
BACKGROUND: The European Organisation of Research and Treatment of Cancer (EORTC) Quality of Life Group is developing computerized adaptive testing (CAT) versions of all EORTC Quality of Life Questionnaire (QLQ-C30) scales with the aim to enhance measurement precision. Here we present the results on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF).Entities:
Keywords: Cancer; Cognitive functioning; Computerized adaptive testing; EORTC QLQ-C30; Health-related quality of life; Item bank
Mesh:
Year: 2017 PMID: 28707048 PMCID: PMC5655578 DOI: 10.1007/s11136-017-1648-8
Source DB: PubMed Journal: Qual Life Res ISSN: 0962-9343 Impact factor: 4.147
Clinical characteristics of the 1030 participating patients
| Characteristic |
|
|---|---|
| Age in years, mean (range) | 63 (26–97) |
| Gender | |
| Men | 488 (47.4%) |
| Women | 542 (52.6%) |
| Country | |
| Denmark | 138 (13.4%) |
| France | 158 (15.3%) |
| Poland | 280 (27.2%) |
| United Kingdom | 454 (44.1%) |
| Cancer site | |
| Breast | 237 (23.0%) |
| Gastrointestinal | 144 (14.0%) |
| Gen-urinary | 171 (16.6%) |
| Gynecological | 99 (9.6%) |
| Hematological | 51 (5.0%) |
| Head and neck | 87 (8.4%) |
| Lung | 33 (3.2%) |
| Other | 208 (20.2%) |
| Cancer stage | |
| I–II | 615 (59.7%) |
| III–IV | 409 (39.7%) |
| Unknown | 6 (0.6%) |
| Current treatment | |
| Chemotherapy | 378 (36.7%) |
| Other treatment | 337 (32.7%) |
| No treatment | 314 (30.5%) |
| Unknown | 1 (0.1%) |
| Cohabitation | |
| Living with a partner | 750 (72.8%) |
| Living alone | 267 (25.9%) |
| Missing | 13 (1.3%) |
| Educational level | |
| 0–10 years | 311 (30.2%) |
| 11–13 years | 269 (26.1%) |
| 14–16 years | 221 (21.5%) |
| >16 years | 225 (21.8%) |
| Missing | 4 (0.4%) |
| Work | |
| Fulltime | 337 (32.7%) |
| Part-time | 76 (7.4%) |
| Retired | 431 (41.8%) |
| Other | 184 (17.9%) |
| Missing | 2 (0.2%) |
Parameter estimates and fit statistics for the 34 items in the final IRT model
| Item | Slope | Location | Item fit | Infit | Outfit |
|---|---|---|---|---|---|
|
| 1.55 | −1.57 | 0.813 | 1.01 | 1.03 |
|
| 1.73 | −0.99 | 0.726 | 0.97 | 0.92 |
|
| 1.59 | −1.41 | 0.640 | 1.06 | 0.97 |
|
| 2.01 | −1.52 | 0.980 | 0.98 | 0.90 |
|
| 2.25 | −1.23 | 0.392 | 1.01 | 0.89 |
|
| 1.97 | −1.53 | 0.965 | 0.99 | 0.83 |
|
| 1.78 | −0.83 | 0.609 | 0.98 | 0.94 |
|
| 1.24 | −1.56 | 0.689 | 0.98 | 0.90 |
|
| 2.25 | −1.56 | 0.714 | 1.01 | 0.88 |
|
| 2.66 | −1.36 | 0.602 | 0.99 | 0.80 |
|
| 1.84 | −1.45 | 0.106 | 1.05 | 1.00 |
|
| 1.83 | −1.11 | 0.374 | 0.96 | 0.92 |
|
| 2.68 | −1.00 | 0.880 | 0.97 | 0.86 |
|
| 3.15 | −0.91 | 0.588 | 0.99 | 0.84 |
|
| 2.52 | −1.12 | 0.139 | 1.03 | 0.96 |
|
| 2.48 | −1.29 | 0.718 | 0.93 | 0.85 |
|
| 1.63 | −2.40 | 0.973 | 1.15 | 1.20 |
|
| 3.32 | −1.14 | 0.554 | 1.07 | 0.81 |
|
| 2.50 | −1.36 | 0.389 | 1.05 | 1.02 |
|
| 2.31 | −1.30 | 0.590 | 0.97 | 0.87 |
|
| 1.66 | −1.19 | 0.260 | 0.99 | 0.90 |
|
| 2.08 | −1.55 | 0.412 | 0.93 | 0.84 |
|
| 2.48 | −1.66 | 0.998 | 1 | 0.86 |
|
| 2.62 | −1.30 | 0.964 | 0.94 | 0.73 |
|
| 1.82 | −1.55 | 0.764 | 0.93 | 0.85 |
|
| 3.74 | −1.46 | 0.837 | 0.98 | 0.90 |
|
| 2.91 | −1.49 | 0.918 | 0.94 | 0.81 |
|
| 2.15 | −1.05 | 0.334 | 0.91 | 0.89 |
|
| 2.78 | −1.47 | 0.995 | 0.95 | 0.79 |
|
| 1.80 | −1.45 | 0.933 | 0.96 | 0.87 |
|
| 1.88 | −1.76 | 0.686 | 1.05 | 0.94 |
|
| 2.22 | −1.56 | 0.562 | 1.01 | 0.94 |
|
| 1.20 | −1.76 | 0.926 | 0.96 | 0.88 |
|
| 3.01 | −1.40 | 0.363 | 0.95 | 0.74 |
Results of the DIF analysis
| Item | DIF |
|
| DIF |
|
|
|---|---|---|---|---|---|---|
| Item 1 | Country | −0.76 (Poland vs. rest) | <0.0001 | |||
| Item 3 | Age | 0.80 (≥70 vs. rest) | <0.0001 | |||
| Item 5 | No DIF | |||||
| Item 8 | Country | 1.15 (Poland vs. rest) | <0.0001 | |||
| Item 9 | No DIF | |||||
| Item 10 | Country | 1.33 (Poland vs. rest) | <0.0001 | |||
| Item 11 | No DIF | |||||
| Item 12 | Age | −1.26 (<50 vs. ≥50) | <0.0001 | |||
| Item 13 | No DIF | |||||
| Item 14 | No DIF | |||||
| Item 15 (q20) | Age | 0.93 (<70 vs. ≥70) | <0.0001 | Country | 0.93 (Poland vs. rest) | <0.0001 |
| Item 16 | No DIF | |||||
| Item 18 (q25) | Age | 1.67 (<40 vs. ≥40) | <0.0001 | |||
| Item 19 | Work | 0.69 (Retired vs. rest) | 0.0002 | |||
| Item 24 | Country | −0.74 (Poland vs. rest) | <0.0001 | |||
| Item 25 | No DIF | |||||
| Item 26 | No DIF | |||||
| Item 27 | No DIF | |||||
| Item 28 | Country | −1.56 (Poland vs. rest) | <0.0001 | |||
| Item 30 | Age | −1.38 (<40 vs. ≥40) | 0.0020 | Country | −0.85 (Denmark vs. rest) | <0.0001 |
| Item 31 | No DIF | |||||
| Item 32 | No DIF | |||||
| Item 33 | No DIF | |||||
| Item 34 | Country | −0.82 (Poland vs. rest) | <0.0001 | |||
| Item 35 | No DIF | |||||
| Item 36 | Country | 0.71 (Poland vs. rest) | 0.0006 | |||
| Item 37 | Country | 0.95 (Denmark vs. rest) | <0.0001 | |||
| Item 38 | Age | 0.75 (<50 vs. ≥50) | 0.0002 | Country | 2.57 (Poland vs. rest) | <0.0001 |
| Item 39 | No DIF | |||||
| Item 40 | No DIF | |||||
| Item 41 | Country | 1.08 (Denmark & France vs. Poland & United Kingdom) | <0.0001 | |||
| Item 42 | Country | 0.78 (Denmark & France vs. Poland & United Kingdom) | <0.0001 | |||
| Item 43 | No DIF | |||||
| Item 44 | Age | 1.49 (<40 vs. ≥40) | 0.0002 |
One beta for each group variable (e.g., country) is presented which summarizes the potential DIF, as well as the reference category that was used in each case
Fig. 1Test information function for the 34 items in the final model and information on the two original QLQ-C30 cognitive functioning (CF) items. CF scores for all response options (ranging from ‘not at all’ to ‘very much’) are presented and their level of measurement precision
Fig. 2Correlations and root mean square errors (RMSEs) of θ’s based on fixed-length CATs and the cognitive functioning score based on all 34 items. For example, scores based on three or more items correlated highly (>0.90) with the score based on all items
Fig. 3The average relative validity (RV) and relative required sample size using CAT measurement across observed and simulated data, compared to using the QLQ-C30 cognitive functioning sum scale. For example, using a CAT with two items, the data show that the validity of CAT is 1.24 times that of the QLQ-C30 cognitive functioning sum scale (RV = 1.24). Moreover, the required sample size is 37% (sample size = 0.63) smaller using this two-item CAT when compared to the QLQ-C30 cognitive functioning sum scale, while obtaining the same power