| Literature DB >> 31848877 |
Rebecca McKeown1, David R Ellard2, Abdul-Rasheed Rabiu3, Eleni Karasouli2, Rebecca S Kearney2.
Abstract
BACKGROUND: Ankle fractures are painful and debilitating injuries that pose a significant burden to society and healthcare systems. Patient reported outcome measures (PROMs) are commonly used outcome measures in clinical trials of interventions for ankle fracture but there is little evidence on their validity and reliability. This systematic review aims to identify and appraise evidence for the measurement properties of ankle specific PROMs used in adults with an ankle fracture using Consensus Based Standards for the Selection of Health Measurement Instrument (COSMIN) methodology.Entities:
Keywords: Ankle fracture; Measurement properties; Patient reported outcome measures; Reliability; Systematic review; Validity
Year: 2019 PMID: 31848877 PMCID: PMC6917678 DOI: 10.1186/s41687-019-0159-5
Source DB: PubMed Journal: J Patient Rep Outcomes ISSN: 2509-8020
Hypotheses set for construct validity testing
| Hypothesis Number | Hypothesis |
|---|---|
| 1 | Correlation with scores of instruments measuring a similar construct or another PROM included in the pre-specified list will be highly or moderately to highly correlated. |
| 2 | Correlation with scores of instruments measuring related but not the same constructs, for example generic disability scores or health related quality of life measures will be either moderately to highly or moderately correlated. |
| 3 | A weak to moderate correlation will be observed between PROM/s scores of instruments included here and two different subgroups of patients. These subgroups will be individuals who have had their fracture managed operatively and those who have had their fracture managed non-operatively. Here, fracture management is used as a surrogate for severity of fracture (i.e. more severe fractures usually managed operatively). Therefore, we would expect to see a weak to moderate correlation between the PROM score and the severity of the fracture. |
Fig. 1PRISMA flow diagram
PROM characteristics
| PROM | Construct(s) | Target Population | Recall period | Items and subscales | Response options | Source language (and additional language versions) |
|---|---|---|---|---|---|---|
| AAOS | Outcome for foot or ankle disability | Individuals with foot or ankle disability | Past 1 week (or since injury if less than 1 week) | 25 items, 20 items in the core scale and a shoe comfort scale consisting of 5 items | Likert scales varying in length 1–3, 1–5, 1–6 or 1–7 | English(Mexican- Spanish) |
| A-FORM | Outcome following ankle fracture | Individuals with ankle fracture | No recall period given | 15 items, no sub-scales. Two parts to the questionnaire. | Single response, multiple choice - 5 response options | English |
| OMAS | Outcome following ankle fracture | Individuals with ankle fracture | No recall period provided | 9 items, no subscales | Single response, multiple choice - 2, 3, 4 or 5 response options | Not specified (English, Turkish and Norwegian) |
Characteristics of studies
| Population | ||||||
|---|---|---|---|---|---|---|
| Article, PROM and Language | Number of participants (n) | Age mean (±, range) (yrs) | Gender (% female) | Injury Information | Follow up duration mean (±, range) | Method of Collection |
| Buker et al. (2018) OMAS, Turkish | 91 | 41.54 (±13.28, 20–60) | 30.8% | Operatively managed ankle #s | 27.92 months (±8.94, range N/S) | Initial in clinic, follow up in clinic or on telephone |
| Garratt et al. (2018) OMAS, Norwegian | Cohort 959, 299 for test-retest questionnaire | 57.5 (± N/S, 22.2–91.2) | 56.8% | Operatively managed ankle #s | Not specified, stated recruited over a 3 year period | At home via post |
| McPhail et al. (2014) A-FORM, English | Delphi panel – 8 Cohort - 41 | 36.8 (± N/S, 26.1–53.8) | 27% | Operatively managed (46.3%) and non-operatively managed (53.7%) ankle #s | 6–8 week post injury and at 12–16 weeks post injury | Either in clinic or at home via post |
| Olerud and Molander (1984) OMAS, Language N/S | 90 | N/S | N/S | Operatively managed ankle #s | N/S | N/S |
| Turhan et al. (2018) OMAS, Turkish | 100 | 42.3 (±17.7, 16–81) | 49% | Operatively (57%) and non-operatively managed (43%) ankle #s | 4.3 years (± and range N/S) | N/S |
| Zelle et al. (2017) AAOS, Spanish | 100 (83 returned 1st questionnaire, 63 returned 2nd questionnaire) | 42.98 (± N/S,18–88) | 41% | 58 ankle #s, 5 talus #s, 1 Achilles tendon rupture, 11 calcaneus #s, 6 midfoot #s. 73 operatively managed and 27 non-operatively managed | 3.97 months (±4.71 range N/S) | Initial at clinic or via post, follow up was via post. |
Key: N/S = not specified, ± = standard deviation, # = fracture; shows the characteristics of the six studies included in this review. Table 4 shows the overall methodological quality for each measurement property assessed in each of the articles using the COSMIN Risk of Bias Checklist [28]. The four articles which underwent the second review process for both risk of bias assessment and data extraction, following COSMIN guidance, are marked on the table with an asterisk
Scores for methodological quality using COSMIN risk of bias checklist
| PROM | AAOS | A-FORM | OMAS | |||
|---|---|---|---|---|---|---|
| Article | ||||||
| PROM Development | Doubtful | Inadequate | ||||
| Content Validity | ||||||
| Structural Validity | Doubtful | |||||
| Internal Consistency | Doubtful | Doubtful | Doubtful | |||
| Cross cultural validity and measurement invariance | ||||||
| Reliability | Inadequate | Inadequate | Doubtful | Inadequate | ||
| Measurement Error | Doubtful | Doubtful | ||||
| Criterion Validity | N/A | N/A | N/A | N/A | N/A | N/A |
| Construct validity | Doubtful (Convergent validity) | Doubtful (Convergent validity) | Adequate (Convergent Validity) Doubtful (Known Groups Validity) | Adequate (Convergent validity) | ||
| Responsiveness | ||||||
Scores for methodological quality using COSMIN Risk of Bias Checklist; available options are very good, adequate, doubtful, inadequate or N/A. Key: * = Articles were assessed by second reviewer for risk of bias and data extraction, N/A: Not applicable. A blank box indicates that the measurement property was not assessed in the study
Results presented in articles
| Article and PROM | Structural Validity | Internal Consistency | Cross-cultural validity | Reliability | Measurement Error | Construct Validity | Responsiveness |
|---|---|---|---|---|---|---|---|
Zelle et al. (2017) AAOS | N/R | N/R | N/R | ICC or weighted kappa not reported | N/R | AAOS-CS with SF-36-PCS AAOS-CS with SF-36-MCS AAOS-SCS with SF-36-PCS AAOS SCS with SF-36-MCS | N/R |
| McPhail et al. (2014) A-FORM | N/R | N/R | N/R | N/R | N/R | N/R | N/R |
| Buker et al. (2017) OMAS | N/R | Cronbach’s Alpha 0.76 | N/R | ICC 0.98 | N/R | OMAS with 5 FAOS Subscales: pain | N/R |
| Garratt et al. (2018) OMAS | CFI 0.99 and TLI 0.98 | Cronbach’s Alpha 0.82 | N/R | ICC 0.92 | MIC not defined | OMAS with SEFAS OMAS with SF-36-PCS OMAS with EQ-5D | N/R |
| Olerud and Molander (1984) OMAS | N/R | N/R | N/R | N/R | N/R | N/R | N/R |
| Turhan et al. (2017) OMAS | N/R | Cronbach’s Alpha 0.84 | N/R | ICC 0.98 | MIC not defined | OMAS with FAAM-ADL OMAS with FAAM-S OMAS with SF-12-PCS OMAS with SF-12-MCS | N/R |
Key: r = Pearson’s correlation, rs = Spearman’s correlation, ADL = Activities of Daily Living, QoL = Quality of life, FAAM-ADL = FAAM Activities of Daily Living Subscale, FAAM-S – FAAM Sports Subscale, AAO-CS = AAOS Core Score, AAOS-SCS = AAOS Shoe Comfort Scale, PCS=Physical component Score, MCS = Mental component Score, EQ-5D = EuroQol EQ-5D-5 L Score, ICC=Intraclass correlation coefficient, TLI = Tucker Lewis index, CFI=Confirmatory Factor Analysis, MIC = Minimally Important Change
Summary of findings table
| PROM | AAOS | A-FORM | OMAS | |||
|---|---|---|---|---|---|---|
| Overall Rating | Quality of Evidence | Overall Rating | Quality of Evidence | Overall Rating | Quality of Evidence | |
| Content validity | ? | N/A | ? | N/A | ? | N/A |
| ? | N/A | ? | N/A | ? | N/A | |
| ? | N/A | ? | N/A | ? | N/A | |
| ? | N/A | ? | N/A | ? | N/A | |
| Structural validity | ? | N/A | ? | N/A | + | High |
| Internal consistency | ? | N/A | ? | N/A | 3+ | Moderate |
| Cross-cultural validity | ? | N/A | ? | N/A | ? | N/A |
| Measurement invariance | ? | N/A | ? | N/A | ? | N/A |
| Reliability | ? | Very Low | ? | N/A | 3+ | Low |
| Measurement Error | ? | N/A | ? | N/A | ? | N/A |
| Criterion validity | N/A | N/A | N/A | N/A | N/A | N/A |
| Construct validity | 4+ | Low | ? | N/A | 16+ | High |
| Responsiveness | ? | N/A | ? | N/A | ? | N/A |
Key: + = Sufficient,? = Indeterminate, − = Insufficient, N/A = not applicable
Interpretability evidence of the PROMs
| Article and PROM | Distribution of total scores in study population | Percentage of missing total scores | Percentage of missing items | Floor and Ceiling Effects (Interpretability) |
|---|---|---|---|---|
| Zelle et al. (2017)* AAOS | Normal distribution following Shapiro-Wilks Test -no Mean (±) provided. | Missing total scores 83 of 100 in first test and 63 of 100 in re-test. | No data reported on items missing. | Not reported |
| McPhail et al. (2014)* A-FORM | Not reported for questionnaire in final format | Not reported | Not reported for questionnaire in final format | Not reported |
| Olerud and Molander (1984) OMAS | Not reported | Not reported | Not reported | Not reported |
| Garratt et al. (2018) OMAS | 75.62 (±24.07) - No information on distribution | 1.6% missing | 17.3% of respondents missed at least one item. “Jumping” most commonly missed item (6.2%). | Not reported |
| Buker et al. (2017)* OMAS | 72.58 (±23.27) - No information on distribution | Not reported | Not reported | Not reported |
| Turhan et al. (2017)* OMAS | 74.1 (±23.7) - No information on distribution | Not reported | Not reported | Floor - 0% Ceiling - 27-29% |
Key: * = * = Articles were assessed by second reviewer for risk of bias and data extraction