| Literature DB >> 32450851 |
Mary Roduta Roberts1, Megan Cook2, Iris C I Chao2.
Abstract
BACKGROUND: A common feature of performance assessments is the use of human assessors to render judgements on student performance. From a measurement perspective, variability among assessors when assessing students may be viewed as a concern because it negatively impacts score reliability and validity. However, from a contextual perspective, variability among assessors is considered both meaningful and expected. A qualitative examination of assessor cognition when assessing student performance can assist in exploring what components are amenable to improvement through enhanced rater training, and the extent of variability when viewing assessors as contributing their individual expertise. Therefore, the purpose of this study was to explore assessor cognition as a source of score variability in a performance assessment of practice-based competencies.Entities:
Keywords: Assessor cognition; Generalizability theory; Mixed method; OSCE; Occupational therapy; Performance assessment; Think aloud; Validity
Mesh:
Year: 2020 PMID: 32450851 PMCID: PMC7249646 DOI: 10.1186/s12909-020-02077-6
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Mean ratings by domain and assessor
| d1 | 5.14 (1.66) | 5.14 (0.53) | 5.50 (1.29) | 5.23 (0.72) | 5.82 (1.25) | 5.23 (0.93) | 5.36 (0.80) |
| d2 | 5.29 (1.44) | 4.86 (1.23) | 4.92 (1.71) | 4.75 (1.22) | 5.00 (0.77) | 4.92 (0.95) | 5.32 (1.14) |
| d3 | 5.29 (1.54) | 5.14 (0.95) | 5.36 (1.34) | 5.46 (0.97) | 5.80 (1.03) | 4.67 (0.89) | 4.82 (0.98) |
| d4 | 5.36 (1.39) | 4.86 (0.95) | 5.86 (0.95) | 5.38 (1.04) | 5.91 (1.30) | 4.54 (0.88) | 5.36 (0.81) |
| d5 | 5.29 (1.33) | 5.07 (1.49) | 5.71 (0.99) | 5.46 (0.97) | 5.00 (1.00) | 4.62 (0.87) | 5.00 (0.77) |
| d6 | 5.07 (1.82) | 4.43 (1.60) | 5.57 (0.85) | 5.31 (0.85) | 5.09 (0.94) | 5.00 (1.34) | 4.65 (1.11) |
| Holistic | 8.31 (0.96) | 7.89 (0.63) | 8.89 (1.05) | 7.96 (0.44) | 7.64 (0.34) | 8.12 (0.75) | 7.96 (0.49) |
| d1 | 6.43 (0.76) | 5.93 (1.32) | 5.62 (0.87) | 5.92 (1.04) | 5.92 (1.04) | 5.73 (0.90) | 5.62 (1.04) |
| d2 | 6.08 (1.03) | 6.14 (1.16) | 5.38 (0.65) | 4.92 (1.55) | 5.92 (1.32) | 5.91 (0.70) | 5.08 (1.04) |
| d3 | 6.07 (0.99) | 5.00 (1.46) | 5.33 (0.65) | 4.46 (1.05) | 5.60 (0.97) | 5.82 (0.87) | 4.69 (1.18) |
| d4 | 6.00 (0.85) | 5.14 (1.79) | 5.62 (0.87) | 4.92 (0.86) | 5.62 (1.26) | 5.82 (0.75) | 4.15 (0.99) |
| d5 | 5.43 (1.01) | 5.57 (1.45) | 5.46 (0.78) | 5.15 (1.21) | 5.54 (1.39) | 4.73 (0.65) | 4.46 (0.88) |
| d6 | 5.54 (1.05) | 4.92 (1.82) | 5.17 (0.39) | 4.46 (1.61) | 4.83 (1.53) | 5.27 (0.90) | 4.77 (1.09) |
| Holistic | 8.36 (0.93) | 8.75 (1.07) | 8.15 (0.47) | 7.82 (0.71) | 8.12 (0.85) | 7.95 (0.39) | 8.04 (0.85) |
The score of each domain ranges from 1 to 7; the holistic score ranges from 1 to 10. Standard deviations are presented in parentheses
Note: d = domain; d1 = professionalism; d2 = communication; d3 = theory, models and/or frame of reference; d4 = knowledge of client; d5 = clinical reasoning; d6 = evidence-based practice
Pearson correlations between domains by assessor
| Assessor | d2 | d3 | d4 | d5 | d6 | holistic | |
|---|---|---|---|---|---|---|---|
| 1 | 0.92 | 0.86 | 0.91 | 0.89 | 0.89 | 0.96 | |
| 2 | 0.50 | −0.04 | 0.04 | 0.37 | 0.46 | 0.51 | |
| 3 | 0.87 | 0.34 | 0.10 | 0.14 | 0.37 | 0.58 | |
| 4 | 0.47 | 0.45 | 0.31 | 0.66 | 0.68 | 0.53 | |
| 5 | 0.41 | −0.20 | 0.10 | 0.28 | 0.10 | 0.15 | |
| 6 | 0.36 | 0.50 | 0.44 | 0.75 | 0.63 | 0.76 | |
| 7 | 0.61 | 0.65 | 0.38 | 0.65 | 0.64 | 0.73 | |
| 8 | 0.88 | 0.77 | 0.65 | 0.72 | 0.52 | 0.88 | |
| 9 | 0.70 | 0.59 | 0.52 | 0.58 | 0.70 | 0.77 | |
| 10 | 0.39 | 0.69 | 0.34 | 0.58 | 0.62 | 0.57 | |
| 11 | 0.36 | −0.04 | 0.55 | 0.27 | 0.62 | 0.58 | |
| 12 | 0.25 | 0.79 | 0.35 | 0.39 | 0.11 | 0.68 | |
| 13 | 0.27 | 0.31 | 0.36 | −0.14 | 0.10 | 0.29 | |
| 14 | 0.49 | 0.64 | 0.30 | 0.48 | 0.65 | 0.77 | |
| 1 | 0.79 | 0.94 | 0.88 | 0.82 | 0.88 | ||
| 2 | −0.51 | −0.28 | 0.09 | 0.42 | 0.28 | ||
| 3 | 0.44 | 0.35 | 0.28 | 0.60 | 0.79 | ||
| 4 | 0.84 | 0.78 | 0.86 | 0.67 | 0.94 | ||
| 5 | 0.53 | 0.62 | 0.69 | 0.68 | 0.83 | ||
| 6 | 0.54 | 0.94 | 0.46 | 0.80 | 0.78 | ||
| 7 | 0.57 | 0.18 | 0.34 | 0.69 | 0.91 | ||
| 8 | 0.77 | 0.65 | 0.80 | 0.63 | 0.94 | ||
| 9 | 0.40 | 0.50 | 0.45 | 0.33 | 0.71 | ||
| 10 | 0.44 | 0.69 | 0.56 | 0.81 | 0.73 | ||
| 11 | 0.33 | 0.37 | 0.85 | 0.51 | 0.69 | ||
| 12 | 0.11 | 0.40 | 0.42 | 0.28 | 0.61 | ||
| 13 | −0.03 | −0.03 | −0.06 | 0.04 | −0.02 | ||
| 14 | 0.63 | 0.48 | 0.14 | 0.68 | 0.66 | ||
| 1 | 0.77 | 0.79 | 0.73 | 0.87 | |||
| 2 | 0.37 | 0.21 | 0.06 | 0.03 | |||
| 3 | 0.72 | 0.78 | 0.75 | 0.78 | |||
| 4 | 0.84 | 0.87 | 0.65 | 0.85 | |||
| 5 | 0.46 | −0.05 | 0.13 | 0.55 | |||
| 6 | 0.55 | 0.82 | 0.59 | 0.66 | |||
| 7 | 0.32 | 0.83 | 0.74 | 0.70 | |||
| 8 | 0.62 | 0.60 | 0.65 | 0.82 | |||
| 9 | 0.79 | 0.79 | 0.81 | 0.86 | |||
| 10 | 0.26 | 0.44 | 0.37 | 0.79 | |||
| 11 | 0.41 | 0.46 | 0.31 | 0.46 | |||
| 12 | 0.63 | 0.61 | 0.53 | 0.83 | |||
| 13 | 0.25 | 0.08 | 0.83 | 0.61 | |||
| 14 | 0.69 | 0.55 | 0.84 | 0.84 | |||
| 1 | 0.94 | 0.90 | 0.89 | ||||
| 2 | 0.23 | 0.45 | 0.36 | ||||
| 3 | 0.84 | 0.87 | 0.74 | ||||
| 4 | 0.71 | 0.32 | 0.66 | ||||
| 5 | 0.39 | 0.27 | 0.80 | ||||
| 6 | 0.46 | 0.77 | 0.81 | ||||
| 7 | 0.32 | 0.64 | 0.12 | ||||
| 8 | 0.64 | 0.35 | 0.68 | ||||
| 9 | 0.94 | 0.78 | 0.88 | ||||
| 10 | 0.51 | 0.59 | 0.55 | ||||
| 11 | 0.33 | 0.45 | 0.59 | ||||
| 12 | 0.97 | 0.73 | 0.71 | ||||
| 13 | 0.30 | 0.37 | 0.71 | ||||
| 14 | 0.68 | 0.50 | 0.44 | ||||
| 1 | 0.92 | 0.93 | |||||
| 2 | 0.53 | 0.71 | |||||
| 3 | 0.84 | 0.74 | |||||
| 4 | 0.82 | 0.92 | |||||
| 5 | 0.59 | 0.60 | |||||
| 6 | 0.67 | 0.75 | |||||
| 7 | 0.61 | 0.63 | |||||
| 8 | 0.73 | 0.82 | |||||
| 9 | 0.89 | 0.87 | |||||
| 10 | 0.81 | 0.73 | |||||
| 11 | 0.73 | 0.84 | |||||
| 12 | 0.72 | 0.71 | |||||
| 13 | 0.48 | 0.57 | |||||
| 14 | 0.29 | 0.48 | |||||
| 1 | 0.92 | ||||||
| 2 | 0.70 | ||||||
| 3 | 0.86 | ||||||
| 4 | 0.75 | ||||||
| 5 | 0.49 | ||||||
| 6 | 0.78 | ||||||
| 7 | 0.66 | ||||||
| 8 | 0.69 | ||||||
| 9 | 0.80 | ||||||
| 10 | 0.69 | ||||||
| 11 | 0.90 | ||||||
| 12 | 0.65 | ||||||
| 13 | 0.84 | ||||||
| 14 | 0.91 |
Note: d = domain; d1 = professionalism; d2 = communication; d3 = theory, models and/or frame of reference; d4 = knowledge of client; d5 = clinical reasoning; d6 = evidence-based practice
N = 92 (listwise deletion)
Pearson correlations between domains and the holistic score
| d2 | d3 | d4 | d5 | d6 | holistic | |
|---|---|---|---|---|---|---|
| d1 | 0.59 | 0.45 | 0.40 | 0.44 | 0.52 | 0.62 |
| d2 | 0.44 | 0.45 | 0.47 | 0.51 | 0.69 | |
| d3 | 0.64 | 0.54 | 0.59 | 0.62 | ||
| d4 | 0.64 | 0.61 | 0.57 | |||
| d5 | 0.68 | 0.72 | ||||
| d6 | 0.71 |
Note: d = domain; d1 = professionalism; d2 = communication; d3 = theory, models and/or frame of reference; d4 = knowledge of client; d5 = clinical reasoning; d6 = evidence-based practice
Weighted average of variance components and proportions for student performance ratings from a two-facet fully crossed p x d x r design
| 9.3 | 88.65 | 9.55 | 0.64 | 0.40 | |
| 5.0 | 8.87 | 1.77 | 0.03 | 0.03 | |
| 1.0 | 4.06 | 4.06 | 0.05 | 0.03 | |
| 46.4 | 35.09 | 0.76 | 0.16 | 0.13 | |
| 9.3 | 10.30 | 1.11 | 0.12 | 0.08 | |
| 5.0 | 6.90 | 1.38 | 0.08 | 0.05 | |
| 46.4 | 20.39 | 0.44 | 0.42 | 0.29 |
Weighted average of variance components and proportions for student performance ratings from a one-facet fully crossed p x r design
| 12.00 | 1315.64 | 105.77 | 49.35 | 0.81 | |
| 1.00 | 41.71 | 41.71 | 2.86 | 0.07 | |
| 12.00 | 88.29 | 7.07 | 7.07 | 0.13 |
Mark the square that best represents the student’s performance
| Needs work | Intermediate | Proficient | Distinguished | |
|---|---|---|---|---|
| Not presenting self professionally (attire, grooming) for interview. | Does not use appropriate professional language during interview. | Does not introduce self but otherwise conducts self professionally. | Introduces self clearly as sole occupational therapist, conveys a sense of self-confidence during interview. | |
| Unable to build rapport during interview | Uses appropriate communication style to interact but needs to be more concise/organized in relaying information or lacks active listening skills. | Uses appropriate communication style to interact but some improvements could be made in relaying information non-verbal cues and/or evidence of active listening. | Manages time for responses, discussion in timeframe given, demonstrates appropriate non-verbal cues, demonstrates active listening | |
| Unable to provide theory, model or FOR to apply to case study. | Limited ability to link theory, models or FORs to case study. | Partially able to link theory, models and FOR to clinical practice | Demonstrates ability to fully link theory, models and FOR to clinical practice | |
| Basic knowledge of the client | Able to elaborate on the client however, difficulty linking the different components | Able to elaborate on the client linking some of the different components | In-depth understanding of client. Holistic, comprehensive of the different aspects | |
| Unable to provide clinical reasoning to support assessment findings, interventions and goals | Demonstrates incomplete clinical reasoning paired with explanation of client assessment findings, interventions and goals | Demonstrates adequate clinical reasoning paired with explanation of client assessment findings, interventions and goals | Demonstrates robust clinical reasoning paired with explanation of client assessment findings, interventions and goals | |
| Unable to articulate need for evidence-based practice | Demonstrates knowledge of evidence-based practice but unable to apply approach to clinical decisions for client | Demonstrates limited knowledge of evidence-based practice related to clinical decisions for client | Demonstrates robust knowledge of evidence-based practice related to clinical decisions for client |
Considering the whole interview, how would you rate the student’s performance?
______/10