| Literature DB >> 31767589 |
Steven McGee1, Taylor Sipos1, Thomas Allin1, Celia Chen1, Alexandra Greco1, Pavlos Bobos2,3,4, Joy MacDermid1,3,5.
Abstract
OBJECTIVES: The purpose of this systematic review is to identify and synthesise studies evaluating performance-based functional outcome measures designed to evaluate the functional abilities of patients with neck pain.Entities:
Keywords: cervical; functional; neck; outcome measures; psychometric properties
Mesh:
Year: 2019 PMID: 31767589 PMCID: PMC6886974 DOI: 10.1136/bmjopen-2019-031242
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Selection of the studies for inclusion in the systematic review.
Summary of studies reporting psychometric properties of functional-based tests in neck disorder patients
| Study | Population | Sample size (n) | Functional tests | Intervention/test interval | Quality |
| Ljungquist | Neck pain (55%), back pain, multiple pain sites | 53 | PILE-C, PILE-L | N/A | Good (68%) |
| Ljungquist | Neck pain (50%), lumbar pain, thoracic pain, shoulder pain, multiple pain sites | 68 | PILE-C, PILE-L, 2×20 m WWB | 8 days | Very good (79%) |
| Ljungquist | Neck pain, lumbar pain, thoracic pain, shoulder pain, lower extremity pain, multiple pain sites | 235 | PILE-C, PILE-L, 2×20 m WWB | N/A | Very good (82%) |
| Ljungquist | cervical pain (25%), lumbar pain, cervical (25%) and lumbar pain, multiple pain sites | 186 | PILE-C, PILE-L, 2×20 m WWB | 6 months | Very good (79%) |
| Lomond and Cote | Chronic neck and shoulder pain (100%) | 32 | BTEWS II | 9.5 days | Very good (88%) |
| Pierrynowski | Subacute and chronic WAD II | 66 | FIT-HaNSA | 2–7 days | Very good (88%) |
| Reesink | N/A | N/A | Neck-FCE | N/A | N/A |
| Reneman | Chronic multifactorial neck pain | 18 | Neck-FCE | 2 weeks | Good (67%) |
| Trippolini | Sub acute and chronic WAD I and II | 32 | WAD FCE | 7 days | Very good (75%) |
| Trippolini | Sub acute and chronic WAD I and II | 267 | Workwell FCE | N/A | Excellent (92%) |
| Trippolini | Sub acute and chronic WAD I and II | 314 | WAD FCE | N/A | Very good (86%) |
| Van der Meer | Chronic WAD I and II | 40 | Neck FCE | N/A | Very good (86%) |
CBT, cognitive-behavioural therapy; EXP, experimental; F, female; FCE, functional capacity evaluation; FIT-HaNSA, Functional Impairment Test-Hand and Neck/Shoulder/Arm; BTEWS II, Baltimore Therapeutic Equipment work simulator II; M, male; MVA, motor vehicle accident; N/A, not applicable; NRPS, Numeric Pain Rating Scale;PILE-C, Progressive Isoinertial Lifting Evaluation-Cervical; PILE-L, Progressive Isoinertial Lifting Evaluation; PT, physical therapy; WAD, Whiplash Associated Disorder.
Quality of studies on psychometric properties of functional-based tests evaluated in neck disorder patients
| Study | Item evaluation criteria | ||||||||||||
| Q1 | Q2 | Q3 | Q4 | Q5 | Q6 | Q7 | Q8 | Q9 | Q10 | Q11 | Q12 | Total (%) | |
| Trippolini | 2 | 2 | 2 | 2 | 1 | 2 | 2 | 2 | 2 | 2 | 1 | 2 | 92% |
| Lomond and Cote | 2 | 2 | 1 | 2 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 88% |
| Pierrynowski | 2 | 2 | 1 | 2 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 88% |
| Trippolini et al | 2 | 2 | 2 | 0 | 1 | N/A | 2 | 2 | 2 | 2 | 2 | 2 | 86% |
| Van der Meer | 2 | 1 | 2 | 1 | 2 | N/A | 2 | 1 | 2 | 2 | 1 | 2 | 86% |
| Ljungquist | 2 | 2 | 2 | 0 | 0 | N/A | 2 | 2 | 2 | 2 | 2 | 2 | 82% |
| Ljungquist | 2 | 1 | 1 | 2 | 0 | 2 | 2 | 2 | 2 | 2 | 1 | 2 | 79% |
| Ljungquist | 1 | 1 | 1 | 2 | 1 | 1 | 2 | 2 | 2 | 2 | 2 | 2 | 79% |
| Trippolini | 2 | 2 | 1 | 1 | 0 | 0 | 2 | 2 | 2 | 2 | 2 | 2 | 75% |
| Ljungquist | 2 | 1 | 1 | 2 | 0 | N/A | 2 | 1 | 2 | 1 | 1 | 2 | 68% |
| Reneman | 1 | 2 | 1 | 1 | 1 | 0 | 1 | 2 | 2 | 2 | 2 | 1 | 67% |
| Reesink | – | – | – | – | – | – | – | – | – | – | – | – | N/A |
12-item evaluation tool (QACMRR) designed to assess the quality of studies determining measurement properties in outcome measures. Questions 1–12 in the tool evaluate aspects of study question, study design, measurements, analyses, and study recommendations.
*Paper is not applicable for completion of study quality tool
†KGV, known-groups validity
‡STC, sensitivity-to-change
§Rel, reliability
KGV, known-groups validity; rel, reliability; STC, sensitivity-to-change.
Psychometric properties of the functional capacity evaluation
| FCE battery | Type of properties | Statistical test | Value | Interpretation |
| Neck FCE | Test-retest | ICC | 0.39–0.96 | Poor-excellent |
| Measurement Error | Ratio of LoA | 32.0%–56.5% | ||
| Convergent Validity | Pearson or Spearman correlation | NDI total: 0.39–0.62 | Weak to moderate very weak to strong | |
| WAD FCE | Test-retest Reliability | ICC | 0.66–0.96 | moderate-excellent |
| Convergent Validity | Pearson Correlation | Pain* 0.31–0.39 | Weak | |
| Discriminative Validity (German vs Non-German) | Linear Regression Analysis | p<0.001 | Significant for All Tasks | |
| Discriminative Validity (sex) | t-test | p<0.001 | Significant for Two Tasks | |
| Workwell FCE | Convergent Validity | Pearson or Spearman Correlation | Work Capacity: 0.1–0.3 | Very Weak – weak |
| Predictive Validity | Pearson or Spearman Correlation | 0.06–0.39 | Very weak - Weak |
*Pain measured via Numeric Rating Scale.
FCE, Functional Capacity Evaluation; HADS-A, Hospital Anxiety and Depression Scale – Anxiety; HADS-D, Hospital Anxiety and Depression Scale – Depression; ICC, Intraclass correlation coefficient; LoA, Limits of Agreement; Mod, Moderate; NDI, Neck Disability Index; Neg, Negligible; SFS, Spinal Function Sort; Sig, Significant.
Summary of Fit-HaNSA’s psychometric properties in neck disorder patients
| Test | Type of property | Statistical test | Value | Interpretation |
| Fit-HaNSA | Intra-rater Reliability | ICC | 0.78 | good |
| Fit-HaNSA | Inter-rater Reliability | ICC | 0.84 | good |
| Fit-HaNSA | Measurement Error | SEM | 76 s | |
| Fit-HaNSA | Convergent Validity | Spearman Rank Correlation* | <0.4 ->0.75 | Weak – Strong |
| Fit-HaNSA | Discriminative WAD II vs Control | F-test | 62.6,<p,0.001 | Significant |
| Fit-HaNSA Functional Sub-tasks | Intra-rater reliability | ICC | 0.70–0.72 | moderate |
| Inter-reliability | ICC | 0.54–0.80 | –moderate – good | |
| Convergent Validity | Spearman Rank Correlation* | <0.4 ->0.75 | Weak - Strong | |
| Discriminative Validity WAD II vs Control | F-test | 42.0–53.3, p<0.001 | Significant |
*Correlations completed with Numeric Pain Rating Scale, Neck Disability Index, Disabilities of Arm, Shoulder, Hand and six cervical range of motion tests
Fit-HaNSA, Functional Impairment Test, Hand and Neck/Shoulder/Arm; ICC, Intraclass correlation coefficient; LOA95, 95% Limits of Agreement; MDC90, 90% Minimal Detectable Change; Mod, Moderate; SEM, SE of Measurement; WAD, Whiplash Associated Disorder.
Psychometric properties of Baltimore Therapeutic Equipment work simulator II–Power Output task
| Test | Type of property | Statistical test | Value | Interpretation |
| BTEWS II | Test-retest reliability | ICC | 0.53 | moderate |
| Spearman | 0.37 | Poor | ||
| BTEWS II | Measurement Error | SEM | 30.25 | |
| MDC90 | 70.59 | |||
| BTEWS II | Convergent Validity | Spearman | Not Reported | Weak |
| BTEWS II | Discriminative Validity (Pain vs Control) | Two-way Repeated Measures ANOVA | Not Reported | Non-significant |
*Spearman correlations completed with Numeric Rating Scale, Neck Disability Index and Shoulder Pain and Disability Index.
ANOVA, analysis of variance; ICC, intraclass correlation coefficient; MDC90, 90% minimal detectable change; SEM, SE of measurement.
Psychometric properties of performance-based tests included in physiotherapy test package
| Test | Type of property | Statistical test | Value | Interpretation |
| PILE-C | Inter-rater reliability | Mean difference |
0.24 2.46 and 1.82 | |
| PILE-C | Inter-rater reliability | Repeatability (2X SD) % of range | M=3.93; F=1.19 | |
| PILE-C | Convergent validity | Spearman correlation | CR-10: 0.55–0.65* | Moderate - Strong very weak - moderate |
| PILE-C | Discriminative: spinal pain vs control | Sensitivity and specificity | 0.93, 0.69 | Strong – Very Strong |
| PILE-C | Discriminative: spinal pain vs control | Wilcoxon sign ranked test | p=0.008 | Significant |
| PILE-C | Discriminative: high vs low pain intensity | Mann-Whitney U | p=0.003 | Significant |
| PILE-C | Discriminative: high vs low Pain behaviour | Mann-Whitney U | p=0.005 | Significant |
| PILE-C | Discriminative: high vs low perceived exertion | Mann-Whitney U | p=0.154 | Non-significant |
| PILE-C | Sensitivity to change | Effect Size | Subjects improving: 0.39–0.73 | Small – Moderate |
| PILE-L | Inter-rater reliability | Mean difference |
0.11 2.33 and 2.11 | |
| PILE-L | Intra-rater reliability | Repeatability | M=4.0; F=3.59 | |
| PILE-L | Convergent validity | Spearman correlation | CR-10: 0.11–0.45 | very weak – moderate very weak – moderate |
| PILE-L | Discriminative: spinal pain vs no spinal pain | Sensitivity and specificity | 0.85, 0.65 | Strong – Very Strong |
| PILE-L | Discriminative: spinal pain vs control | Wilcoxon sign ranked test | p=0.002 | Significant |
| PILE-L | Discriminative: high vs low pain intensity | Mann-Whitney U | p=0.001 | Significant |
| PILE-L | Discriminative: high vs low pain behaviour | Mann-Whitney U | p<0.001 | Significant |
| PILE-L | Discriminative: high vs low perceived exertion | Mann-Whitney U | p<0.001 | Significant |
| PILE-L | Sensitivity to change | Effect size | Subjects improving: 0.02–1.08 | Trivial – Large |
| 2×20 m WWB | Inter-rater reliability | Mean difference | 0.05 | |
| 2×20 m WWB | Intra-rater reliability | Repeatability | 3.2 | |
| 2×20 m WWB | Convergent validity | Spearman correlation | CR-10: 0.55–0.65Borg RPE: 0.10–0.48 | Moderate - Strong very weak – moderate |
| 2×20 m WWB | Discriminative: spinal pain vs control | Wilcoxon sign ranked test | p=0.014 | Significant |
| 2×20 m WWB | Discriminative: high vs low pain intensity | Mann Whitney U | p<0.001 | Significant |
| 2×20 m WWB | Discriminative: high vs low pain behaviour | Mann Whitney U | p<0.001 | Significant |
| 2×20 m WWB | Discriminative: high vs low perceived exertion | Mann Whitney U | p<0.001 | Significant |
| 2×20 m WWB | Sensitivity to change | Effect size | Subjects improving: 0.38–0.78 | Small – Moderate |
*CR-10: Measurement of pain construct
F, Female; KGV, Known-groups Validity; LoA, Limits of Agreement; M, Male; Mod., Moderate; Neg., Negligible;PILE-C, Progressive Iso-intertial Lifting Evaluation – Cervical; PILE-L, Progressive Iso-intertial Lifting Evaluation – Lumbar; RPE, Rating of perceived exertion.