| Literature DB >> 30870999 |
Fredrik Öhberg1, Tomas Bäcklund2, Nina Sundström3, Helena Grip4.
Abstract
Ordinal scales with low resolution are used to assess arm function in clinic. These scales may be improved by adding objective kinematic measures. The aim was to analyze within-subject, inter-rater and overall reliability (i.e., including within-subject and inter-rater reliability) and check the system's validity of kinematic measures from inertial sensors for two such protocols on one person. Twenty healthy volunteers repeatedly performed two tasks, finger-to-nose and drinking, during two test sessions with two different raters. Five inertial sensors, on the forearms, upper arms and xiphoid process were used. Comparisons against an optical camera system evaluated the measurement validity. Cycle time, range of motion (ROM) in shoulder and elbow were calculated. Bland⁻Altman plots and linear mixed models including the generalizability (G) coefficient evaluated the reliability of the measures. Within-subject reliability was good to excellent in both tests (G = 0.80⁻0.97) and may serve as a baseline when assessing upper extremities in future patient groups. Overall reliability was acceptable to excellent (G = 0.77⁻0.94) for all parameters except elbow axial rotation in finger-to-nose task and both elbow axial rotation and flexion/extension in drinking task, mainly due to poor inter-rater reliability in these parameters. The low to good reliability for elbow ROM probably relates to high within-subject variability. The sensors provided good to excellent measures of cycle time and shoulder ROM in non-disabled individuals and thus have the potential to improve today's assessment of arm function.Entities:
Keywords: arm function; inertial sensor; inter-rater reliability; kinematics; upper limb
Mesh:
Year: 2019 PMID: 30870999 PMCID: PMC6427602 DOI: 10.3390/s19051241
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The picture shows sensor placement on one arm and on the xiphoid process.
Figure 2Examples of a subject’s movement pattern during the finger-to-nose and drinking tasks for one rater. The patterns are illustrated using a normalized time window that was defined by the beginning and end of each repetition, giving 10 movement curves for each test. The illustration from the drinking task is collected from the test where the glass was placed 30 cm in front of the subject. The abbreviations are F (flexion; positive angles), E (extension; negative angles), Ab (abduction; positive angles), Ad (adduction; negative angles), R in (pronation(elbow); inward humeral rotation (shoulder); positive angles) and R out (supination (elbow); outward humeral rotation (shoulder); negative angles).
Figure 3Bland–Altman analysis of the inertial and optical sensor systems. The data from one subject in one session of 10 repetitions. (a) finger-to-nose and (b) drinking task.
Figure 4Illustration of inter-rater reliability between the two raters for the outcome measures of the (a) finger-to-nose and (b) drinking task. The regression lines are estimated using QR decomposition. Data of all 10 repetitions and 20 subjects are plotted.
Figure 5Bland–Altman analysis of the two raters for all outcome measures in the (a) finger-to-nose and (b) drinking task. Data of all 10 repetitions and 20 subjects are plotted.
Linear mixed effect model for the finger-to-nose task.
| Side | Reliability | Cycle Time | Elbow FE | Elbow R | Shoulder FE | Shoulder AbAd | Shoulder R | |
|---|---|---|---|---|---|---|---|---|
| Dominant | Within-subject | 0.89 | 0.91 | 0.97 | 0.89 | 0.92 | 0.91 | |
| Inter-rater | 0.84 | 0.77 | 0.15 | 0.78 | 0.67 | 0.73 | ||
| G-coefficients | Overall | 0.94 | 0.88 | 0.26 | 0.91 | 0.83 | 0.87 | |
| Non-dominant | Within-subject | 0.91 | 0.94 | 0.96 | 0.90 | 0.92 | 0.94 | |
| Inter-rater | 0.83 | 0.62 | 0.30 | 0.74 | 0.76 | 0.82 | ||
| Overall | 0.94 | 0.77 | 0.47 | 0.88 | 0.87 | 0.92 | ||
| Dominant | Intercept | 2.35 s *** | 58.66° *** | −53.66° *** | 22.42° *** | −20.08° *** | 20.72° *** | |
| (0.12) | (4.14) | (4.70) | (2.81) | (1.72) | (1.89) | |||
| σ2rat | 0.00 | 13.71 | 0.00 | 3.99 | 0.00 | 0.29 | ||
| σ2rep | 0.00 | 0.00 | 5.90 | 0.76 | 0.62 | 0.14 | ||
| σ2subj | 0.28 | 186.75 | 113.81 | 107.42 | 48.16 | 60.17 | ||
| Model and | σ2rep_rat | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | |
| variance | σ2subj_rat | 0.03 | 37.48 | 629.23 | 17.02 | 18.76 | 16.16 | |
| components | σ2subj_rep | 0.01 | 2.72 | 1.10 | 2.20 | 0.57 | 0.92 | |
| σ2residual | 0.02 | 6.32 | 14.05 | 8.43 | 4.55 | 6.49 | ||
| Non-dominant | Intercept | 2.31 s *** | 61.35° *** | −54.47° *** | 21.87° *** | −16.26° *** | 19.94° *** | |
| (0.13) | (4.04) | (4.14) | (2.64) | (2.69) | (2.13) | |||
| σ2rat | 0.00 | 9.61 | 0.00 | 3.57 | 4.35 | 0.00 | ||
| σ2rep | 0.00 | 0.30 | 3.94 | 0.02 | 0.00 | 0.06 | ||
| σ2subj | 0.29 | 173.26 | 148.50 | 87.90 | 85.66 | 79.07 | ||
| σ2rep_rat | 0.00 | 0.00 | 0.00 | 0.14 | 0.09 | 0.00 | ||
| σ2subj_rat | 0.03 | 90.53 | 338.92 | 20.29 | 19.94 | 12.80 | ||
| σ2subj_rep | 0.00 | 1.89 | 2.41 | 1.24 | 1.60 | 1.42 | ||
| σ2residual | 0.03 | 4.74 | 14.53 | 6.52 | 3.47 | 4.69 |
*** p < 0.001, ** p < 0.01, * p < 0.05. Statistical model: Response ~ (1 | “Subject”) + (1 | “Rater”) + (1 | “Repetition”) + (1 | “Subject”:”Repetition”) + (1 | “Subject”:”Rater”) + (1 | “Repetition”:”Rater”). The dataset was stratified using the factor “arm dominance”. In the table, σ2sub, σ2rat, σ2rep, σ2rep_rat, σ2subj_rat, σ2subj_rep, and σ2residual are the variance between subjects, among raters, among repetitions, of the interaction between repetitions and raters, subjects and raters, subjects and repetitions, and residuals respectively. Values within parentheses indicate the mean square error of each parameter estimate. The reliability estimates (defined by the G-coefficient, absolute agreement) for the different responses are subdivided into within-subject, inter-rater, and overall reliability. The number of subjects = 20 and the number of raters = 2.
Linear mixed effect model for the drinking task at 30 cm distance.
| Side | Reliability | Cycle Time | Elbow FE | Elbow R | Shoulder FE | Shoulder AbAd | Shoulder R | |
|---|---|---|---|---|---|---|---|---|
| Dominant | Within-subject | 0.92 | 0.88 | 0.97 | 0.85 | 0.80 | 0.90 | |
| Inter-rater | 0.81 | 0.47 | 0.002 | 0.67 | 0.72 | 0.78 | ||
| G-coefficients | Overall | 0.93 | 0.68 | 0 | 0.86 | 0.93 | 0.90 | |
| Non-Dominant | Within-subject | 0.90 | 0.96 | 0.96 | 0.87 | 0.90 | 0.87 | |
| Inter-rater | 0.84 | 0.42 | 0.14 | 0.72 | 0.81 | 0.60 | ||
| Overall | 0.95 | 0.60 | 0.23 | 0.87 | 0.93 | 0.79 | ||
| Dominant | Intercept | 5.65 s *** | 90.76° *** | 20.09° *** | 5.68° ** | 14.69° *** | −5.00° ** | |
| (0.20) | (2.77) | (5.62) | (1.76) | (1.64) | (1.68) | |||
| σ2rat | 0.00 | 0.00 | 0.00 | 0.00 | 0.91 | 0.00 | ||
| σ2rep | 0.00 | 2.38 | 0.00 | 0.16 | 0.00 | 1.04 | ||
| σ2subj | 0.71 | 100.31 | 0.00 | 53.09 | 41.88 | 48.74 | ||
| Model and | σ2rep_rat | 0.00 | 0.88 | 0.00 | 0.00 | 0.14 | 0.00 | |
| variance | σ2subj_rat | 0.10 | 92.30 | 1260.94 | 16.06 | 4.53 | 10.01 | |
| components | σ2subj_rep | 0.01 | 0.00 | 2.08 | 1.22 | 0.57 | 0.77 | |
| σ2residual | 0.06 | 23.05 | 36.25 | 10.46 | 11.02 | 4.44 | ||
| Non-dominant | Intercept | 5.68 s *** | 97.79° *** | 24.75° *** | 2.94° | 15.84° *** | −3.15° | |
| (0.19) | (2.96) | (5.38) | (2.37) | (1.74) | (1.62) | |||
| σ2rat | 0.00 | 0.00 | 0.00 | 0.22 | 0.00 | 0.00 | ||
| σ2rep | 0.00 | 0.03 | 3.67 | 0.00 | 0.00 | 0.00 | ||
| σ2subj | 0.62 | 99.10 | 124.19 | 90.38 | 53.51 | 39.08 | ||
| σ2rep_rat | 0.00 | 0.40 | 0.00 | 0.82 | 0.09 | 0.21 | ||
| σ2subj_rat | 0.05 | 133.03 | 832.17 | 24.28 | 6.91 | 19.92 | ||
| σ2subj_rep | 0.01 | 1.36 | 9.89 | 4.21 | 0.68 | 1.33 | ||
| σ2residual | 0.07 | 7.43 | 23.90 | 11.52 | 5.94 | 7.22 |
*** p < 0.001, ** p < 0.01, * p < 0.05. Statistical model: Response ~ (1 | “Subject”) + (1 | “Rater”) + (1 | “Repetition”) + (1 | “Subject”:”Repetition”) + (1 | “Subject”:”Rater”) + (1 | “Repetition”:”Rater”). The dataset was stratified using the factor “arm dominance”. In the table, σ2sub, σ2rat, σ2rep, σ2rep_rat, σ2subj_rat, σ2subj_rep, and σ2residual are the variance between subjects, among raters, among repetitions, of the interaction between repetitions and raters, subjects and raters, subjects and repetitions, and residuals respectively. Values within parentheses indicate the mean square error of each parameter estimate. The reliability estimates (defined by the G-coefficient, absolute agreement) for the different responses are subdivided into within-subject, inter-rater, and overall reliability. The number of subjects = 20 and the number of raters = 2.
Linear mixed effect model for the drinking task at 7 cm distance.
| Side | Reliability | Cycle Time | Elbow FE | Elbow R | Shoulder FE | Shoulder AbAd | Shoulder R | |
|---|---|---|---|---|---|---|---|---|
| Dominant | Within-subject | 0.88 | 0.97 | 0.93 | 0.93 | 0.82 | 0.88 | |
| Inter-rater | 0.72 | 0.46 | 0.15 | 0.73 | 0.81 | 0.74 | ||
| G-coefficients | Overall | 0.88 | 0.64 | 0.26 | 0.87 | 0.90 | 0.89 | |
| Non-dominant | Within-subject | 0.88 | 0.97 | 0.93 | 0.90 | 0.88 | 0.86 | |
| Inter-rater | 0.75 | 0.49 | 0.25 | 0.50 | 0.59 | 0.67 | ||
| Overall | 0.90 | 0.67 | 0.40 | 0.69 | 0.77 | 0.86 | ||
| Dominant | Intercept | 5.11 *** | 56.22 *** | −12.61 * | 29.97 *** | −8.33 *** | 12.15 *** | |
| (0.18) | (4.80) | (4.96) | (2.72) | (1.98) | (2.45) | |||
| σ2rat | 0.00 | 20.72 | 0.00 | 3.32 | 0.00 | 2.33 | ||
| σ2rep | 0.01 | 0.97 | 0.48 | 0.00 | 7.53 | 0.36 | ||
| σ2subj | 0.58 | 167.39 | 128.74 | 101.04 | 57.90 | 86.98 | ||
| Model and | σ2rep_rat | 0.00 | 0.00 | 0.00 | 0.41 | 0.00 | 0.00 | |
| variance | σ2subj_rat | 0.15 | 166.03 | 717.93 | 25.69 | 9.82 | 17.05 | |
| components | σ2subj_rep | 0.01 | 1.55 | 4.79 | 1.19 | 1.22 | 2.13 | |
| σ2residual | 0.08 | 10.10 | 60.34 | 8.84 | 5.92 | 12.41 | ||
| Non-dominant | Intercept | 5.11 *** | 60.41 *** | −13.04 ** | 29.66 *** | −4.29 * | 15.13 *** | |
| (0.17) | (4.53) | (4.40) | (2.53) | (2.17) | (1.99) | |||
| σ2rat | 0.00 | 17.59 | 0.00 | 2.75 | 1.59 | 0.00 | ||
| σ2rep | 0.01 | 0.31 | 1.51 | 0.00 | 0.51 | 0.00 | ||
| σ2subj | 0.51 | 154.37 | 146.22 | 67.17 | 56.97 | 64.87 | ||
| σ2rep_rat | 0.00 | 0.00 | 0.32 | 0.26 | 0.10 | 0.00 | ||
| σ2subj_rat | 0.10 | 134.95 | 433.00 | 54.84 | 30.80 | 19.13 | ||
| σ2subj_rep | 0.00 | 1.45 | 8.57 | 1.65 | 2.91 | 0.68 | ||
| σ2residual | 0.07 | 8.45 | 32.93 | 11.19 | 9.16 | 13.18 |
*** p < 0.001, ** p < 0.01, * p < 0.05. Statistical model: Response ~ (1 | “Subject”) + (1 | “Rater”) + (1 | “Repetition”) + (1 | “Subject”:”Repetition”) + (1 | “Subject”:”Rater”) + (1 | “Repetition”:”Rater”). The dataset was stratified using the factor “arm dominance”. In the table, σ2sub, σ2rat, σ2rep, σ2rep_rat, σ2subj_rat, σ2subj_rep, and σ2residual are the variance between subjects, among raters, among repetitions, of the interaction between repetitions and raters, subjects and raters, subjects and repetitions, and residuals respectively. Values within parentheses indicate the mean square error of each parameter estimate. The reliability estimates (defined by the G-coefficient, absolute agreement) for the different responses are subdivided into within-subject, inter-rater, and overall reliability. The number of subjects = 20 and the number of raters = 2.
Linear mixed effect model for the drinking task at 50 cm distance.
| Side | Reliability | Cycle Time | Elbow FE | Elbow R | Shoulder FE | Shoulder AbAd | Shoulder R | |
|---|---|---|---|---|---|---|---|---|
| Dominant | Within-subject | 0.91 | 0.87 | 0.98 | 0.86 | 0.91 | 0.86 | |
| Inter-rater | 0.78 | 0.51 | 0.02 | 0.72 | 0.79 | 0.70 | ||
| Overall | 0.90 | 0.73 | 0.04 | 0.90 | 0.92 | 0.87 | ||
| G-coefficients | ||||||||
| Non-dominant | Within-subject | 0.89 | 0.94 | 0.97 | 0.87 | 0.85 | 0.86 | |
| Inter-rater | 0.83 | 0.25 | 0.24 | 0.67 | 0.73 | 0.76 | ||
| Overall | 0.94 | 0.39 | 0.39 | 0.85 | 0.89 | 0.92 | ||
| Dominant | Intercept | 6.32 *** | 106.47 *** | 17.23 ** | −8.03 *** | 15.69 *** | −3.65 * | |
| (0.21) | (2.58) | (5.91) | (2.36) | (1.83) | (1.75) | |||
| σ2rat | 0.00 | 3.17 | 0.00 | 0.00 | 0.00 | 0.58 | ||
| σ2rep | 0.00 | 0.02 | 0.00 | 0.13 | 0.00 | 1.25 | ||
| σ2subj | 0.77 | 74.92 | 25.21 | 100.03 | 61.76 | 46.20 | ||
| σ2rep_rat | 0.00 | 0.00 | 1.48 | 0.00 | 0.00 | 0.00 | ||
| Model and | σ2subj_rat | 0.17 | 51.27 | 1342.60 | 20.93 | 10.28 | 12.11 | |
| variance | σ2subj_rep | 0.03 | 0.00 | 4.31 | 0.00 | 0.49 | 0.38 | |
| components | σ2residual | 0.06 | 18.56 | 24.63 | 18.96 | 6.63 | 8.23 | |
| Non-dominant | Intercept | 6.28 *** | 114.38 *** | 29.04 *** | −11.99 *** | 16.16 *** | −4.74 ** | |
| (0.19) | (3.86) | (6.40) | (2.56) | (1.77) | ||||
| σ2rat | 0.00 | 20.71 | 0.00 | 1.20 | 0.00 | 0.08 | ||
| σ2rep | 0.00 | 0.23 | 3.11 | 0.00 | 0.00 | 0.30 | ||
| σ2subj | 0.62 | 37.73 | 302.47 | 97.08 | 53.01 | 48.27 | ||
| σ2rep_rat | 0.00 | 0.00 | 0.80 | 0.00 | 0.12 | 0.00 | ||
| σ2subj_rat | 0.07 | 94.76 | 935.13 | 30.81 | 11.60 | 6.90 | ||
| σ2subj_rep | 0.02 | 3.66 | 5.39 | 2.83 | 2.25 | 0.18 | ||
| σ2residual | 0.07 | 6.41 | 22.83 | 16.34 | 8.68 | 8.61 |
*** p < 0.001, ** p < 0.01, * p < 0.05. Statistical model: Response ~ (1 | “Subject”) + (1 | “Rater”) + (1 | “Repetition”) + (1 | “Subject”:”Repetition”) + (1 | “Subject”:”Rater”) + (1 | “Repetition”:”Rater”). The dataset was stratified using the factor “arm dominance”. In the table, σ2sub, σ2rat, σ2rep, σ2rep_rat, σ2subj_rat, σ2subj_rep, and σ2residual are the variance between subjects, among raters, among repetitions, of the interaction between repetitions and raters, subjects and raters, subjects and repetitions, and residuals respectively. Values within parentheses indicate the mean square error of each parameter estimate. The reliability estimates (defined by the G-coefficient, absolute agreement) for the different responses are subdivided into within-subject, inter-rater, and overall reliability. The number of subjects = 20 and the number of raters = 2.
Figure A1Illustrates the overall reliability (i.e., G-coefficient) at different distance to the glass, with different number of repetitions (assuming there is only one rater present) for the drinking task, with data from one rater. The results are subdivided into dominant arm (left) and non-dominant arm (right) for the different outcome measures. The dotted line marks the level for acceptable reliability; G = 0.7.
Figure 6Illustrates the overall reliability (i.e., G-coefficient) estimated with a different number of repetitions for the finger-to-nose and drinking tasks based on one single rater. The results are subdivided into dominant arm (left) and non-dominant arm (right) for the different outcome measures. The dotted line marks the level for acceptable reliability, G = 0.7.