| Literature DB >> 20882335 |
M J B Govaerts1, L W T Schuwirth, C P M Van der Vleuten, A M M Muijtjens.
Abstract
Traditional psychometric approaches towards assessment tend to focus exclusively on quantitative properties of assessment outcomes. This may limit more meaningful educational approaches towards workplace-based assessment (WBA). Cognition-based models of WBA argue that assessment outcomes are determined by cognitive processes by raters which are very similar to reasoning, judgment and decision making in professional domains such as medicine. The present study explores cognitive processes that underlie judgment and decision making by raters when observing performance in the clinical workplace. It specifically focuses on how differences in rating experience influence information processing by raters. Verbal protocol analysis was used to investigate how experienced and non-experienced raters select and use observational data to arrive at judgments and decisions about trainees' performance in the clinical workplace. Differences between experienced and non-experienced raters were assessed with respect to time spent on information analysis and representation of trainee performance; performance scores; and information processing--using qualitative-based quantitative analysis of verbal data. Results showed expert-novice differences in time needed for representation of trainee performance, depending on complexity of the rating task. Experts paid more attention to situation-specific cues in the assessment context and they generated (significantly) more interpretations and fewer literal descriptions of observed behaviors. There were no significant differences in rating scores. Overall, our findings seemed to be consistent with other findings on expertise research, supporting theories underlying cognition-based models of assessment in the clinical workplace. Implications for WBA are discussed.Entities:
Mesh:
Year: 2010 PMID: 20882335 PMCID: PMC3068251 DOI: 10.1007/s10459-010-9250-7
Source DB: PubMed Journal: Adv Health Sci Educ Theory Pract ISSN: 1382-4996 Impact factor: 3.853
Fig. 11-Dimensional overall performance rating (R1)
Fig. 26-Dimensional global rating scale clinical competencies (R2)
Verbal protocol coding schemes
| Nature of statement |
| 1. Descriptions: (literal) descriptions of student behaviour (“he is smiling to the patient”; “he asks if this happened before”) |
| 2. Inferences: interpretations and abstractions of performance (“he is an authoritarian doctor”; “he is clearly a young professional”; “it seems that he takes no pleasure in being a doctor”) |
| 3. Evaluations: normative judgments, referring to implicit or explicit standards (“his physical examination skills are very poor”; “overall, his performance is satisfactory”) |
| 4. Contextual cue: remarks referring to case-specific or context-specific cues such as patient characteristics, setting of the patient encounter, context of the assessment task (“this patient is very talkative”; “this looks like a hospital setting, not general practice”; “he is being videotaped”) |
| 5. Self-monitoring: reflective remarks, nuancing (“although I am not sure if I saw this correctly”; “on hindsight I shouldn’t have…” “……. but on the other hand most senior residents do not know how to handle these problems either”); self-instruction and structuring of rating process (“first I am going to look at ….”; “when evaluating performance I always look at atmosphere and balance”); explication of standards and performance theory (“one should always start with open-ended questions”; “from a first-year resident I expect……”) |
| 6. Residual category: repetitions, remarks not directly related to the rating task (e.g. statements related to the experiment; supervisory interventions) |
| Clinical presentation |
| 1. Dermatological problem (DVD 1) |
| 2. Cardiological problem (DVD 2) |
| Verbal protocol |
| VP1: Verbal protocol at T1, initial representation of student behaviour |
| VP2: Verbal protocol at T1, while filling out the one-dimensional rating scale (overall judgment) |
| VP3: Verbal protocol at T2, after viewing DVD; overall judgment of student performance while filling out one-dimensional rating scale |
| VP4: Verbal protocol while filling out 6-dimensional rating scale |
Time needed for problem representation (T1) and performance ratings per DVD, for each group of raters
| Variable | DVD 1 (prototypical, derma case) | DVD 2 (complex, cardio case) | ||
|---|---|---|---|---|
| Experts ( | Non-experts ( | Experts ( | Non-experts ( | |
| T1 (seconds) | 112.0 (121) | 109.5 (237) | 260.0 (308) | 1390 (110) |
| R1T1 (rating at T1) | 2.0 (2)a | 2.0 (2) | 3.0 (1)a | 3.0 (1)a |
| R1T2 (rating at T2, after viewing entire DVD) | 2.0 (1)b | 2.0 (1) | 2.0 (2)b | 2.5 (1)b |
Presented are the median and the inter-quartile range (in parentheses). Experts take significantly (U = 79.00, p = .03, ES = 0.38) more time for monitoring and gathering of information than novices when observing performance on DVD 2 (cardio case). Rating scores are based on a 5-point scale (1 = poor, 5 = outstanding)
Values in the same column (DVD 1 and DVD 2 resp.) with different superscripts differ significantly (Wilcoxon Signed Ranks test, p < .05)
Percentages of statements in verbal protocols for experienced raters (Exp) and non-experienced raters (Non-Exp)
| Variable | Overall (VP1 + VP2 + VP3 + VP4) | VP1 | VP2 | VP3 | VP4 | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Exp | Non-Exp | Exp | Non-Exp | Exp | Non-Exp | Exp | Non-Exp | Exp | Non-Exp | |
|
|
|
|
|
|
|
|
|
|
| |
| Descriptives | 19.8 (13.2) | 25.3 (11.0)a | 19.8 (20.1) | 26.1 (12.8) | 10.8 (18.2) | 10.7 (18,6) | 16.9 (16.3) | 29.4 (13.3)a | 18.2 (16.5) | 20.4 (10.6) |
| Inferences | 19.0 (7.9) | 14.7 (5.1)a | 38.9 (22.9) | 37.5 (21.2) | 14.8 (25.6) | 20.0 (25,1) | 14.5 (9.8) | 6.8 (9.2)a | 13.5 (14.0) | 5.6 (4.6)a |
| Evaluations | 24.4 (10.4) | 24.9 (4.7) | 7.6 (16.0) | 5.4 (10.1) | 33.3 (15.1) | 18.2 (20,0)a | 24.9 (16.3) | 25.9 (12.6) | 35.9 (17.3) | 41.3 (17.3) |
| Contextual cues | 12.9 (7.7) | 10.4 (7.3) | 13.2 (7.7) | 6.1 (14.6)a | 8.1 (13.5) | .0 (13.7) | 18.3 (15.0) | 9.8 (11.1) | 10.0 (9.1) | 8.8 (8.6) |
| Self-monitoring | 20.4 (8.8) | 20.5 (10.7) | 15.2 (10.8) | 16.7 (10.6) | 27.9 (22.7) | 40.4 (35.9) | 20.2 (13.2) | 19.4 (10.4) | 17.2 (14.6) | 13.5 (13.4) |
Presented are the median en inter-quartile range (in parentheses)
VP1 verbal protocol at T1, initial representation of student behavior; VP2 verbal protocol at T1, while filling out the one-dimensional rating scale (overall judgment); VP3 verbal protocol at T2, after viewing entire DVD, overall judgment of student performance while filling out one-dimensional rating scale; VP4 verbal protocol while filling out 6-dimensional rating scale
aIndicates significant differences between experienced and non-experienced raters [Mann–Whitney U test, p < .05]