| Literature DB >> 31399055 |
Aidin Abedi1, C A C Prinsen2, Ishan Shah1, Zorica Buser3, Jeffrey C Wang1.
Abstract
BACKGROUND: Whiplash is a common traumatic cervical injury which is most often a consequence of rear-end motor vehicle accidents. It has been estimated that up to 50% of the whiplash patients suffer from chronic symptoms, resulting in extensive individual and societal burden. Several measurement instruments are used for initial assessment of whiplash and evaluation of response to treatment. However, a comprehensive assessment of the performance of these measures is lacking. Furthermore, there is no consensus on the most relevant outcome domains and their corresponding measurement instruments of choice. This systematic review aims to identify, describe, and critically appraise the performance properties of health-related measurement instruments in whiplash population.Entities:
Keywords: Outcome assessment; Patient-reported outcome measures; Reliability; Reproducibility of results; Responsiveness; Review; Validity; Whiplash injuries
Mesh:
Year: 2019 PMID: 31399055 PMCID: PMC6688369 DOI: 10.1186/s13643-019-1119-0
Source DB: PubMed Journal: Syst Rev ISSN: 2046-4053
Eligibility criteria
| Criterion | Explanation |
|---|---|
| Study designs | Development, adaptation, translation, or evaluation of measurement properties of measurement instruments. Reviews will be initially included for citation tracking. |
| Population | Patients with clinical manifestation of whiplash defined according to the Québec Task Force on Whiplash-Associated Disorders: “Whiplash is an acceleration-deceleration mechanism of energy transfer to the neck. It may result from rear end or side-impact motor vehicle collisions, but can also occur during diving or other mishaps. The impact may result in bony or soft-tissue injuries (whiplash injury), which in turn may lead to a variety of clinical manifestations (Whiplash-Associated Disorders) [ For content/face validity studies: > 50% of the patients with whiplash when patients are involved. For all other studies: > 50% of the whole study population with whiplash OR separate sub-group analysis of patients with whiplash performed. |
| Severity | Whiplash grade 0–IV according to the Quebec Task Force classification of whiplash-associated disorders [ |
| Type of measure | Unrestricted, including but not limited to classifications, patient-reported outcome measures, prediction rules/models, performance-based measures and imaging. |
| Construct | All health-related constructs addressed in the literature. |
| Timing | Acute (≤ 3 months) and chronic (> 3 months) phase |
| Settings | Acute care, rehabilitation, and community |
| Language | Unrestricted |
Fig. 1Decision-making algorithm for generation of recommendations. Methodology based on Prinsen et al. [16] and Mokkink et al. [30]. *As described in Table 2
Criteria for evaluation of the quality of results
| Measurement property | Rating | Criteria |
|---|---|---|
| Structural validity | + | CTT CFA: CFI or TLI or comparable measure > 0.95 OR RMSEA < 0.06 OR SRMR < 0.08a |
IRT/Rasch No violation of
no violation of
no violation of
adequate IRT: χ2 > 0.001 Rasch: infit and outfit mean squares ≥ 0.5 and ≤ 1.5 OR Z-standardized values > − 2 and < 2 | ||
| ? | CTT: not all information for ‘+’ reported IRT/Rasch: model fit not reported | |
| − | Criteria for ‘+’ not met | |
| Internal consistency | + | At least low evidencec for sufficient structural validityd AND Cronbach’s alpha(s) ≥ 0.70 for each unidimensional scale or subscalee |
| ? | Criteria for “At least low evidencec for sufficient structural validityd” not met | |
| − | At least low evidencec for sufficient structural validityd AND Cronbach’s alpha(s) < 0.70 for each unidimensional scale or subscalee | |
| Reliability | + | ICC or weighted Kappa ≥ 0.70 |
| ? | ICC or weighted Kappa not reported | |
| − | ICC or weighted Kappa < 0.70 | |
| Measurement error | + | SDC or LoA < MICd |
| ? | MIC not defined | |
| − | SDC or LoA > MICd | |
| Hypotheses testing for construct validity | + | The result is in accordance with the hypothesisf |
| ? | No hypothesis defined (by the review team) | |
| − | The result is not in accordance with the hypothesisf | |
| Cross-cultural validity/measurement invariance | + | No important differences found between group factors (such as age, gender, language) in multiple group factor analysis OR no important DIF for group factors (McFadden’s |
| ? | No multiple group factor analysis OR DIF analysis performed | |
| − | Important differences between group factors OR DIF was found | |
| Criterion validity | + | Correlation with gold standard ≥ 0.70 OR AUC ≥ 0.70 |
| ? | Not all information for ‘+’ reported | |
| − | Correlation with gold standard < 0.70 OR AUC < 0.70 | |
| Responsiveness | + | The result is in accordance with the hypothesisf OR AUC ≥ 0.70 |
| ? | No hypothesis defined (by the review team) | |
| − | The result is not in accordance with the hypothesisf OR AUC < 0.70 |
Adapted from Prinsen et al. [16] under a Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/). The criteria are updated by Prinsen et al. [16] based on, e.g., Terwee et al. [31] and Prinsen et al. [14]
AUC area under the curve, CFA confirmatory factor analysis, CFI comparative fit index, CTT classical test theory, DIF differential item functioning, ICC intraclass correlation coefficient, IRT item response theory, LoA limits of agreement, MIC minimal important change, RMSEA root mean square error of approximation, SEM standard error of measurement, SDC smallest detectable change, SRMR standardized root mean residuals, TLI Tucker–Lewis index, + sufficient, − insufficient, ? indeterminate
aTo rate the quality of the summary score, the factor structures should be equal across studies
bUnidimensionality refers to a factor analysis per subscale, while structural validity refers to a factor analysis of a (multidimensional) patient-reported outcome measure
cAs defined by grading the evidence according to the GRADE approach
dThis evidence may come from different studies
eThe criteria “Cronbach alpha < 0.95” was deleted, as this is relevant in the development phase of a PROM and not when evaluating an existing PROM
fThe results of all studies should be taken together, and it should then be decided if 75% of the results are in accordance with the hypotheses
Modified GRADE approach for evaluation of the overall quality of evidence [16]
| Quality of evidence | Definition |
|---|---|
| High | We are very confident that the true measurement property lies close to that of the estimate of the measurement property |
| Moderate | We are moderately confident in the measurement property estimate: the true measurement property is likely to be close to the estimate of the measurement property, but there is a possibility that it is substantially different |
| Low | Our confidence in the measurement property estimate is limited: the true measurement property may be substantially different from the estimate of the measurement property |
| Very low | We have very little confidence in the measurement property estimate: the true measurement property is likely to be substantially different from the estimate of the measurement property |
Reused from Prinsen et al. [14] under a Creative Commons Attribution 4.0 (http://creativecommons.org/licenses/by/4.0/)