| Literature DB >> 25880033 |
Margit Alt Murphy1, Carol Resteghini2, Peter Feys3, Ilse Lamers4.
Abstract
BACKGROUND: Although use of standardized and scientifically sound outcome measures is highly encouraged in clinical practice and research, there is still no clear recommendation on which tools should be preferred for upper extremity assessment after stroke. As the aims, objectives and methodology of the existing reviews of the upper extremity outcome measures can vary, there is a need to bring together the evidence from existing multiple reviews. The purpose of this review was to provide an overview of evidence of the psychometric properties and clinical utility of upper extremity outcome measures for use in stroke, by systematically evaluating and summarizing findings from systematic reviews.Entities:
Mesh:
Year: 2015 PMID: 25880033 PMCID: PMC4359448 DOI: 10.1186/s12883-015-0292-6
Source DB: PubMed Journal: BMC Neurol ISSN: 1471-2377 Impact factor: 2.474
Figure 1Flowchart over the search strategy and article selection process (according to the PRISMA guidelines).
Methodological quality assessment of the included systematic reviews
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
| ||
| Ashford 2008 [ | Identify valid and reliable OM (real-life function) | Yes/84 | Yes | Yes | Yes | Yes | Level | Standard & criteria |
| Baker 2011 [ | Selection strategy and identification of scientifically sound UE OM suitable for robot trials | Yes/230 | Yes | - | - | Yes | Partly | Standard |
| Connell 2012 [ | Review psychometrics and clinical utility of UE OM | Yes/NR | Yes | Yes | Yes | Yes | Yes | Standard & criteria |
| Croarkin 2004 [ | Review and evaluate psychometrics of UE motor function tests | Yes/170 | Yes | Yes | - | Yes | Yes | Standard & criteria |
| Gebruers 2010 [ | Assess psychometrics and clinical applicability of accelerometry measures | Yes/25 | Yes | Yes | - | Yes | yes | No |
| Hillier 2010 [ | Develop and evaluate a process of OM selection for community settings | Yes/300 | Yes | - | 20% | No | No | Standard |
| Lemmens 2012 [ | Identify, evaluate, categorize valid and reliable activity level UE OM | Yes/747 | Yes | Yes | - | Yes | No | No |
| Platz 2005 [ | Review evidence of psychometric properties of OM for spasticity | Yes/110 | Yes | Yes | Yes | Yes | Partly | No |
| Simpson 2013 [ | Review the responsiveness of OM for UE recovery | Yes/68 | Yes | - | - | Yes | Yes | No |
| Sivan 2011 [ | Classify, evaluate UE OM used in robot-assisted trials | Yes/28 | Yes | Yes | Yes | Yes | Level | Standard & criteria |
| Tse 2013 [ | Identify, evaluate the psychometrics of participation OM | Yes/119 | Yes | - | - | Yes | Yes | Standard & criteria |
| Van Peppen 2007 [ | Develop clinical practice guideline for physiotherapy (OM, intervention, prognosis) | Yes/32 | Yes | Yes | Yes | Yes | Yes | Standard & criteria |
| Velstra 2011 [ | Review reliability, responsiveness and content validity of UE OM | Yes/44 | Yes | Yes | - | Yes | No | Not reported |
Abbreviations: OM outcome measures, UE upper extremity.
Overview of the measurement level, target population, upper extremity outcome measures included in the reviews and recommended or meeting the criteria of psychometrics as reported in the primary reviews
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|
|
|
|
| |||||
| Ashford 2008 [ | Activity (real-life functioning) | stroke, brain injury | 6 | MAL (12,14, 26, 28 items), ABILHAND, Leeds Adult Spasticity Impact Scale | 1 |
| Met 9 of 11 criteria |
| Baker 2011 [ | Body function and activity (robot-assisted trials) | stroke | 25 | ARAT, CAHAI, 10s test, AMAT, WMFT, FMA, MSS, MAS, DeSouza, RMA, STREAM, MESUPE, MI, NHPT, FAT, Sodring Motor Evaluation Test, Sollerman, MCA, MMAC, BBT, Functional Test; Patient-reported: DHI, MAL, ABILHAND, UMAQS | 3 Additional scales (2) |
| MOT and FDA standards; psychometrics provided for CAHAI, STREAM, ABILHAND |
| Connell 2012 [ | Body function and activity (clinical utility) | neurologic conditions | 11 | BBT, NHPT, ARAT, ABILHAND, MAL (14,26), RMA, MSS, Sollerman, Simplified STREAM, Fitts Reaching test | 2 |
| Clinical utility criteria of ≥8; criteria of validity, intra/inter-rater reliability, ability to detect change |
| Croarkin 2004 [ | Body function and functional limitation (no disability scales) | stroke | 9 | ARAT, CMSA, FMA, MMAC, MAS, MCA, MI, NHPT, RMA | 6 |
| Met 2 of 3 criteria: validity, inter-rater, test-retest reliability; psychometrics provided |
| Gebruers 2010 [ | Activity (accelerometry) | stroke | NA | Accelerometry | NA | NA | No specific criteria; psychometrics provided |
| Hillier 2010 [ | ICF (clinical use) | stroke | 7 | Manual muscle testing, Tardieu Scale, WMFT, Grip strength, CAHAI, Hand Active Sensation Test, NHPT | 2 |
| Standards described for reliability, validity, responsiveness, utility; psychometrics not provided |
| Lemmens 2012 [ | Activity | stroke, CP | 17 | AMAT, CAHAI, FAT, TEMPA, ARAT, JHFT, MESUPE, WMFT, ABILHAND, MAL, DHI, UBDS, Actual amount of use, Functional test, MFT, Hand Function Survey, Accelerometry | 9 | AMAT, CAHAI, FAT, UBDS, ARAT, JHFT, WMFT, DHI, MAL-26 | No specific criteria; reference provided for validity, reliability, responsiveness; psychometrics not provided |
| Platz 2005 [ | Body function (spasticity) | stroke, MS, SCI and CP with spasticity | 11 | Ashworth Scale (original, modified, velocity corrected), Muscle Tone Scale, Modified Tardieu Scale, VAS for tone, Tone assessment Scale, ROM (goniometer, estimation), Finger curl test, Tendon reflex scale | 0 | - | No specific criteria; psychometrics not provided |
| Simpson 2013 [ | Activity (responsiveness) | stroke | 14 | ABILHAND, AMAT, ARAT, Accelerometry, CAHAI, DHI, FAT, Functional Test, Hand Function Survey, JHFT, MAL, SIS, TEMPA, WMFT | 5 | ABILHAND, ARAT, MAL, SIS, WMFT | No specific criteria; MCID values provided |
| Sivan 2011 [ | ICF (robot-assisted trials) | Stroke | 17 | FMA, MSS, CMSA, Ashworth Scale, MRC, Kinematics, Grip strength, NHPT, BBT, ARAT, WMFT, CAHAI, AMAT, RMA (arm), FAT, MAS, ABILHAND | 5 |
| Criteria high/excellent/moderate for validity, reliability, responsiveness provided |
| Tse 2013 [ | Participation | stroke | 1 | SIS | 0 |
| Criteria for reliability, internal consistency, validity |
| van Peppen 2007 [ | ICF (clinical utility for physiotherapy practice) | stroke | 10 | Core set: MI, FAT; Optional: ROM, Numeric Pain Rating Scale, Nottingham Sensory Assessment, Modified Ashworth Scale, FMA, Hand volumeter, ARAT, NHPT | 2 |
| Level of evidence at least 2 (psychometric properties, clinical utility, ICF) |
| Velstra 2011 [ | ICF (reliability, responsiveness) | stroke, tetraplegia, peripheral or reumathology conditions | 8 | Ashworth Scale, ARAT, MAL, WMFT, JHFT, FMA, Muscle strength, ROM | 2 | ARAT, MAL | No specific criteria; grading very good/good for reliability, internal consistency and responsiveness provided; psychometrics not provided |
*Reviews did not provide standards, criteria or sufficient information on psychometrics needed for qualification; OM printed in italic were included into the final set (n = 13); Abbreviations: ICF International Classification of Functioning, Disability and Health, OM outcome measures, UE upper extremity, MS Multiple sclerosis, SCI spinal cord injury, CP cerebral palsy, CAHAI Chedoke Arm Hand Activity Inventory, STREAM Stroke Rehabilitation, Assessment Movement, BBT Box and Block Test, ARAT Action Research Arm Test, NHPT Nine Hole Peg Test, FMA Fugl-Meyer Assessment (motor), MI Motoricity Index, CMSA Chedoke-McMaster Stroke Assessment, MAS Motor Assessment Scale, MAL Motor Activity Log, SIS Stroke Impact Scale, WMFT Wolf Motor Function Test, FAT Frenchay Arm Test (abbreviations for all OM are provided in the end section of the paper).
Figure 2Overview of outcome measures (OM) included in the reviews more than once (gray bars) and the number of times the OM met the criteria set for psychometric properties as reported in the reviews (black bars).
Summary of psychometric properties and clinical utility of the outcome measures of impaired body function that met the standards or criteria set for the psychometric properties by the authors of the reviews
|
|
|
|
|
| |||
|---|---|---|---|---|---|---|---|
|
| [ | [ | [ | [ | [ | [ | [ |
|
| |||||||
| Content validity | + | NA | |||||
| Internal consistency | *** | + | NA | ||||
| Construct validity | + | *** | *** | + | ** | ||
| Concurrent validity | + | *** | + | + | + | ||
| Floor/ceiling effect | ** | NR | NR | ||||
| Intra-rater, test-retest reliability | + | *** | + | NR | + | *** | |
| Inter-rater reliability | + | *** | + | + | *** | + | |
| Responsiveness | ** | * | NR | *** | |||
| MCID, points | 7p | NR | NR | ||||
| MDC/SDD, points | 12p | ||||||
|
| |||||||
| Time to administer, minutes | 20 | 10-15 | 60 | varies | |||
| Administration burden | ** | * | ** | ||||
+Met the criteria set by the authors of the reviews (not graded); ***High/excellent/very good; **Moderate/good/adequate; *Low/poor; Administration burden includes: administration, scoring, interpretation and cost. Abbreviations: NR Not reported, NA Not applicable, p points, s seconds, MCID Minimal clinically important difference, MDC Minimal detectable change, SDD smallest detectable difference, empty space: not covered by the reviews; FMA Fugl-Meyer Assessment (motor), MI Motoricity Index, CMSA Chedoke-McMaster Stroke Assessment, STREAM Stroke Rehabilitation, Assessment Movement, FMA demonstrated a high level of measurement quality and clinical utility.
Summary of psychometric properties and clinical utility of the outcome measures of activity limitation that met the standards or criteria set for the psychometric properties by the authors of the reviews
|
|
|
|
|
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| [ | [ | [ | [ | [ | [ | [ | [ | [ | [ | [ | [ | [ | [ | [ |
|
| |||||||||||||||
| Content validity | + | + | NA | ||||||||||||
| Internal consistency | *** | + | *** | + | NA | ||||||||||
| Construct validity | *** | + | ** | + | *** | *** | + | ** | NR | + | *** | + | *** | ||
| Concurrent validity | + | + | + | *** | + | + | + | + | |||||||
| Floor/ceiling effect | NR | * | NR | * | * | * | NR | NR | |||||||
| Intra-rater/test-retest reliability | + | *** | + | *** | + | NR | *** | + | + | *** | *** | + | |||
| Inter-rater reliability | + | *** | + | *** | + | *** | *** | + | *** | *** | + | *** | + | *** | |
| Responsiveness | NR | ** | + | *** | ** | + | ** | * | NR | NR | |||||
| MCID, points, seconds | 6 blocks | 6p | 6.3p | 12p | 0.26-0.35 logitsa,d | NR | 32.8 s | ||||||||
| 12-17pc | 0.14-1.2pa,b,c | ||||||||||||||
| MDC/SDD/SRD | + | + | 1.3p | 32.8 s | |||||||||||
|
| |||||||||||||||
| Time to administer, minutes | 2 | 10 | 25 | 10-12 | ≥10 | 5-10 | 20-30 | 2 | |||||||
| Administration burden | *** | ** | ** | ** | ** | *** | ** | *** | |||||||
+Met the criteria set by the authors of the reviews (not graded); ***high/excellent/very good; **moderate/good/adequate; *low/poor; aequal to effect size 0.2, bclinical scale anchor, cglobal rating, dpercentage of recovery (Simpson [35]); empty space, not covered by the reviews; Administration burden includes: administration, scoring, interpretation and cost. Abbreviations: NA Not applicable, NR not reported, MCID minimal clinically important difference, MDC minimal detectable change, SDD smallest detectable difference, SRD smallest real difference, BBT Box and Block Test, ARAT Action Research Arm Test, CAHAI Chedoke Arm Hand Activity Inventory, WMFT Wolf Motor Function Test, FAT Frenchay Arm Test, MAS Motor Assessment Scale, NHPT Nine Hole Peg Test, BBT, ARAT, CAHAI, WMFT, ABILHAND demonstrated a high level of measurement quality and clinical utility.
Standards and criteria for psychometrics and clinical utility provided by the authors of the reviews
|
|
|
|---|---|
| Ashford [ | Content validity, internal consistency, construct validity, test-retest reliability, agreement, responsiveness, interpretability: adequate design, method and results (Chronbach’s α: adequate 0.7-0.9, ICC: > 0.70); minimal clinically important difference presented, floor/ceiling effect ≤ 15%, time to administer < 10 min, administration burden: easy to sum up the items |
| Baker [ | Psychometric testing have been performed |
| Connell [ | Clinical utility criteria of ≥ 8points (time to administer and interpret ≤ 30 min, cost ≤ ₤ 100, simple equipment, portability), reliability/validity (kappa, correlation coefficients, ICC/r: strong ≥ 0.80, moderate 0.6-0.8, weak 0.4-0.6), ability to detect change (measurement error, standardized response mean, standardized error of measurement, limits of agreement, minimal detectable change) |
| Croarkin [ | Significant correlations (p < 0.05) for test-retest, inter-rater reliability and validity (convergent, concurrent): level of evidence 1 = meets all 3 psychometrics criteria, level 2 = meets 2 of 3 criteria |
| Hillier [ | Sound psychometrics: content and construct validity, reliability, sensitivity to change, utility (interpretability, acceptability, relevance) |
| Simpson [ | MCID values calculated (related to effect size 0.2, anchor-based method using clinical scale, global rating, percentage of recovery) |
| Sivan [ | Test-retest reliability (ICC/kappa: high/excellent ≥ 0.75, moderate 0.40-0.74, poor <0.40); internal consistency (Chronbach’s α: high/excellent > 0.80, adequate 0.70-0.79, low < 0.70); validity (correlation coefficient: excellent r > 0.60, adequate 0.30-0.59, poor <0.30), area under the curve: excellent > 0.90, adequate 0.70-0.89, poor < 0.70); responsiveness (effect size: large > 0.8, moderate 0.5-0.79, small <0.50; other adequate responsiveness methods, MCID value); floor/ceiling effect (excellent 0%, adequate < 20%, poor > 20%), respondent burden: (time, acceptance: excellent < 15 min, adequate: longer time, lower acceptability; poor: lengthy, acceptability problem); administrative burden (excellent: scoring by hand, easy to interpret, adequate: computer scoring, obscure interpretation; poor: costly, complex scoring/interpretation) |
| Tse [ | Inter-rater, test-retest reliability (kappa/r/ICC > 0.75), internal consistency (Chronbach’s α > 0.80), content validity, construct validity (adequate method, r ≥ 0.60) |
| van Peppen [ | Valid for stroke, test-retest reliability and concurrent validity (ICC/r ≥ 0.70), responsiveness (high/low), time to administer ≤ 15 min, test-protocol available: level of evidence 1 = meets all 6 criteria, level 2 = meets 5 of 6 criteria |
| Velstra [ | Reliability (correlation coefficient, kappa, Chronbach’s α, ICC): very good or good/moderate; Responsiveness (effect size, standardized response mean): moderate or large |
Abbreviations: ICC Intraclass coefficient, MCID minimal clinically important difference.
Figure 3Publication years for the primary references used in the systematic reviews and years when the reviews were performed, presented separately for every outcome measure included into the final set of measures.
Figure 4Total number of references used in the reviews and number of references that were only used in one review (unique references) presented for outcome measures included into the final set.