| Literature DB >> 29521952 |
J M Sparrow1,2, M T Grzeda1,2, N A Frost3, R L Johnston4, C S C Liu5,6, L Edwards1, A Loose1, J L Donovan2.
Abstract
PurposeTo develop a short, psychometrically robust and responsive cataract patient reported outcome measure suitable for use in high-volume surgical environments.MethodsA prospective study in which participants completed development versions of questionnaires exploring the quality of their eyesight using items harvested from two existing United Kingdom developed parent questionnaires. Participants were 822 patients awaiting cataract surgery recruited from 4 cataract surgical centres based in the UK. Exclusion criteria were other visually significant comorbidities and age <50 years. An iterative multi-stage process of evaluation using Rasch and factor analyses with sequential item reduction was undertaken.ResultsA definitive item set of just five items delivered performance in accordance with the requirements of the Rasch model: no threshold disordering, no misfitting items, Rasch-based reliability 0.90, person separation 2.98, Cronbach's α 0.89, good targeting of questions to patients with cataract with pre-operative item mean -0.41 logits and absence of significant floor or ceiling effects, minor deviations of item invariance, and confirmed unidimensionality. The test-re-test repeatability intra-class correlation coefficient was 0.89 with excellent responsiveness to surgery, Cohen's d -1.45 SD. Rasch calibration values are provided for Cat-PROM5 users.ConclusionsA psychometrically robust and highly responsive five-item cataract surgery patient reported outcome measure has been developed, which is suitable for use in high-volume cataract surgical services.Entities:
Mesh:
Year: 2018 PMID: 29521952 PMCID: PMC5898878 DOI: 10.1038/eye.2018.1
Source DB: PubMed Journal: Eye (Lond) ISSN: 0950-222X Impact factor: 3.775
Figure 1Flow chart for Cat-PROM5 development study. People awaiting cataract surgery without other visually significant comorbidities in either eye were recruited following informed consent.
Psychometric properties of the scale and criteria for acceptability
| Valid measurement model | To identify a pool of items which effectively measure the concept of visual difficulty. To remove items that do not fit the assumed criteria of a unidimensional measure | Applied to Rasch modelling:
Rasch-Andrich thresholds ordered as expected; Mean Square fit statistics: Outfit/Infit within 0.7–1.3 |
| Precision | The reliability indexes assessing the precision of the measure. Two indexes were of our interest: (a) Rasch-based reliability is the share of the ‘true’ variance in the total observed variance of the measure. (b) Person Separation index; the ratio of the reliable (‘true’) variation in measure to the variation stemming from the random noise. Both indices serve as Rasch equivalents to traditional reliability indices (Cronbach’s alpha.) | Rasch based reliability index: ≥0.7 acceptable reliability ≥0.8 good; ≥0.9 excellent Person separation >2.0 |
| Test–re-test reliability | Confirmation that the items of the scale return stable results assessed by administering the same questions repeatedly to the same patients in the absence of a change in clinical status. | Intraclass Correlation >0.70 Cohen’s Kappa: >0.5 moderate >0.6 good |
| Scale responsiveness | Evidence that the scale is sensitive to surgical intervention for cataract (mean change in Logits divided by the standard deviation computed for the all pre- and post-operative patients combined). | Effect size: Moderate >0.50 SD Large >0.80 SD |
| Discriminative validity | Evidence that the measure of visual difficulty is not simply a repetition of an existing clinical measure, that is, the instrument should capture information relevant to the wider experience of a person’s vision. | Low (<0.3) correlation with visual acuity (LogMAR) |
| Convergent validity | Evidence that the measure is highly correlated with other similar visual difficulty PROMs. | High (≥0.7) correlation with Catquest-9SF |
Outfit/Infit statistics <0.7 suggest item redundancy, >1.3 indicates poorly fitting items.[24]
Caution should be exercised before removing items located towards either end of the scale as these have lower correlations but may enhance precision towards the scale extremities.
Root-mean-square-error-of-approximation (RMSEA)[27, 28, 30] with threshold 0.09 stricter than usual 0.08 in view of categorical variables, sample <250 and type I error near 5% (ref. and comparative fit index (CFI).
Sociodemographic characteristics of participants
| Age median (1st Qr; 3rd Qr) | 76 (70; 81) | 76 (70; 82) | 76 (70; 82) | 76 (70; 82) |
| Gender, M:F ( | 71:129; 35.5%:64.5% | 131:183; 41.5%:57.9% | 136: 170; 44.4%:55.6% | 338:482; 40.9%:58.4% |
| Missing | 0; 0.0% | 2; 0.6% | 0; 0.0% | 2; 0.2% |
| Side R:L ( | 107:73; 53.5%:36.5% | 168:145; 53.2%:45.9% | 162:144; 52.9%:47.1% | 437:362; 53.2%:44.0% |
| Missing | 20; 10.0% | 3; 0.9% | 0; 0.0% | 23; 2.8% |
| Eye 1st:2nd ( | 154:43; 77.0%:21.5% | 229:84; 72.5%:26.6% | 169:137; 55.2% 44.8% | 552:264; 67.2%:32.1% |
| Missing | 3; 1.5% | 3; 0.9% | 0; 0.0% | 6; 0.7% |
| SES | ||||
| Q1 | 57; 28.5% | 70; 22.2% | 70; 22.9% | 197; 24.0% |
| Q2 | 41; 20.5% | 75; 23.7% | 67; 21.9% | 183; 22.3% |
| Q3 | 36; 18.0% | 71; 22.5% | 80; 26.1% | 187; 22.7% |
| Q4 | 45; 22.5% | 56; 17.7% | 51; 16.7% | 152; 18.5% |
| Q5 | 16; 8.0% | 22; 7.0% | 29; 9.5% | 67; 8.2% |
| SES missing | 5; 2.5% | 22; 7.0% | 9; 2.9% | 36; 4.4% |
| Site ( | ||||
| Bristol | 196; 98.0% | 107; 33.9% | 93; 30.4% | 396; 48.2% |
| Torbay | 4; 2.0% | 78; 24.7% | 47; 15.4% | 129; 15.7% |
| Cheltenham | — | 79; 25.0% | 81; 26.5% | 160; 16.7% |
| Brighton | — | 52; 16.5% | 85; 27.8% | 137; 19.5% |
SES—Index of Multiple Deprivation.
Psychometric performance of the Cat-PROM5 items for the pre- and post-operative ‘Cycle 2’ and for all Cycles combined. (Items ordered from low to high visual difficulty from above down)
| VSQ_Bad Eye | −0.92 (0.12) | 0.91 | 0.91 | 0.81 | 0.53 | 0.69 | −1.00 (0.07) | 1.13 | 1.14 | 0.78 |
| VSQ_Overall | −0.68 (0.10) | 1.10 | 1.06 | 0.85 | 0.63 | 0.73 | −0.66 (0.06) | 1.01 | 1.00 | 0.87 |
| VSQ_Reading | −0.02 (0.11) | 1.22 | 1.08 | 0.80 | 0.54 | 0.69 | 0.11 (0.06) | 1.10 | 1.08 | 0.81 |
| VCM1_Interfere | 0.04 (0.10) | 0.87 | 0.90 | 0.86 | 0.53 | 0.69 | 0.19 (0.06) | 0.84 | 0.88 | 0.86 |
| VSQ_Doing | 1.58 (0.14) | 0.88 | 0.84 | 0.74 | 0.57 | 0.66 | 1.36 (0.08) | 0.88 | 0.85 | 0.76 |
| Model Indices | Rasch-based reliability 0.88; Person separation: 2.66;
Cronbach’s | Rasch Person Measure ICC for 5-Item Scale 0.89 | Rasch-based reliability 0.90; Person separation: 2.98;
Cronbach’s | |||||||
‡The number of patients in the validation sample is 735 which includes all from Pilot, and those sampled and available from Cycle 1 and 2. If a patient from either of these later cycles was randomly selected to contribute their post-operative measurement but this was missing, then they were dropped from the model generation group. All patients were represented in subsequent computations. The parameters of Rasch models were estimated by Joint Maximum Likelihood methods using WINSTEPS v3.72.3 software.
Measure represents the Item Location, also called the item difficulty (it is the average of the Rasch-Andrich model thresholds).
Computed with SD of baseline measures as the denominator.
Computed with SD of total sample as the denominator N for the Rasch performance parameters relates to the sample size used to generate the Rasch model.
Figure 2Cat-PROM5 Person-Item map for all cycles showing respondent distributions for pre-operative (upper panel), post-operative (middle panel) completions, and the Item Locations (Loc) and Thresholds (probability crossover points between adjacent categories, lower panel) on the same Logit scale. In total, 1266 questionnaire completions were available. Pre- and post-operative means −0.41 and −3.61 respectively.