| Literature DB >> 29065875 |
Peter Yeates1,2, Katherine Woolf3, Emyr Benbow4,5, Ben Davies6, Mairhead Boohan7, Kevin Eva8.
Abstract
BACKGROUND: Asian medical students and doctors receive lower scores on average than their white counterparts in examinations in the UK and internationally (a phenomenon known as "differential attainment"). This could be due to examiner bias or to social, psychological or cultural influences on learning or performance. We investigated whether students' scores or feedback show influence of ethnicity-related bias; whether examiners unconsciously bring to mind (activate) stereotypes when judging Asian students' performance; whether activation depends on the stereotypicality of students' performances; and whether stereotypes influence examiner memories of performances.Entities:
Keywords: Assessment; Differential attainment; Ethnicity; Medical education; Stereotypes
Mesh:
Year: 2017 PMID: 29065875 PMCID: PMC5655938 DOI: 10.1186/s12916-017-0943-0
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
Overview of study design
| Instructions | Performancesa | Lexical decision task | Recollection | Demographics | Debrief | |||
| Group | K+/C– | K–/C+ | Mixed | Mixed | ||||
| Stereotype-consistent | Asian1 | White1 | Asian2 | Asian | ||||
| Stereotype-inconsistent | White1 | Asian1 | White2 | White | ||||
aOrder of performance was counterbalanced within groups (half of each group saw K+/C–, K–/C+, mixed; the other half saw mixed, K–/C+, K+/C–)
K+ good knowledge demonstrated in performance, K– poor knowledge demonstrated
C+ good communication demonstrated in performance, C– poor communication demonstrated
Comparison of participant characteristics between groups
| Characteristic | Group A (viewed stereotype-consistent performances) | Group B (viewed stereotype-inconsistent performances) | Significance |
|---|---|---|---|
| Sex: | Frequency |
| |
| Male | 40 (51%) | 29 (37%) | 0.078 |
| Female | 39 (49%) | 50 (63%) | |
| Clinical speciality: | |||
| Anaesthetics | 7 | 2 | 0.09 (Fisher exact) |
| Diagnostic specialities | 5 | 2 | |
| Hospital medicine | 24 | 31 | |
| Surgery | 10 | 4 | |
| Emergency medicine | 3 | 0 | |
| Child health | 3 | 7 | |
| Women’s health | 2 | 7 | |
| General practice | 15 | 17 | |
| Psychiatry | 4 | 3 | |
| Non-hospital medical specialities | 1 | 0 | |
| Public health | 0 | 1 | |
| Other | 6 | 5 | |
| Medical school: | |||
| A | 5 | 13 | 0.15 |
| B | 20 | 21 | |
| C | 9 | 4 | |
| D | 10 | 12 | |
| Others | 34 | 26 | |
| Participant ethnicity: | |||
| White | 61 | 66 | 0.11 (Fisher exact) |
| Indian subcontinent | 11 | 7 | |
| Other minority ethnicity | 6 | 1 | |
| Prefer not to say | 2 | 5 | |
| Median |
| ||
| Year of qualification | 1995 | 1992 | 0.75 |
| Years examining OSCEs | 5 | 5 | 0.47 |
| OSCEs/year | 2 | 2 | 0.79 |
Comparison of scores (knowledge, communication, overall scores) and feedback (focus and valence) by performance and group
| Performance | Group A (viewed stereotype-consistent performances) | Group B (viewed stereotype-inconsistent performances) |
|---|---|---|
| Performance 1: K+/C– | Asian student | White student |
| Scores | ||
| Knowledge | 5.6 (5.3–5.9) | 5.6 (5.4–5.9) |
| Communication | 2.4 (2.2–2.6) | 2.5 (2.3–2.7) |
| Overall | 3.2 (3.0–3.5) | 3.1 (2.8–3.4) |
| Feedback | ||
| Focus | 2.7 (2.5–3.0) | 2.7 (2.4–2.9) |
| Valence | –2.6 (–2.9 to –2.4) | –2.6 (–2.9 to –2.4) |
| Performance 2: K–/C+ | White student | Asian student |
| Scores | ||
| Knowledge | 3.0 (2.8–3.3) | 3.0 (2.8–3.2) |
| Communication | 5.6 (5.5–5.8) | 5.7 (5.6–5.9) |
| Overall | 3.4 (3.1–3.6) | 3.3 (3.1–3.5) |
| Feedback | ||
| Focus | –0.3 (–0.5 to 0.0) | –0.4 (–0.7 to 0.0) |
| Valence | –1.7 (–2.0 to –1.4) | –1.8 (–2.1 to –1.5) |
| Performance 3: mixed | Asian student | White student |
| Scores | ||
| Knowledge | 3.1 (2.9–3.3) | 3.1 (2.9–3.3) |
| Communication | 3.5 (3.3–3.7) | 3.6 (3.4–3.8) |
| Overall | 2.9 (2.7–3.0) | 2.8 (2.6–3.0) |
| Feedback | ||
| Focus | 1.6 (1.3–1.9) | 1.5 (1.2–1.9) |
| Valence | –2.8 (–3.1 to –2.6) | –3.0 (–3.3 to –2.7) |
All comparisons are non-significant
Proportions of each statement type marked as true, by group, within the recognition test of memory
| Proportion of statements marked “True” (95% CIs) | ||
|---|---|---|
| Statement type: | Group A (recalling Asian student) | Group B (recalling white student) |
| Real statements | ||
| Stereotype-consistent | 69% (65–73%) | 68% (64–71%) |
|
|
|
|
| Invented statements | ||
|
|
|
|
| Stereotype-inconsistent | 34% (31–36%) | 37% (34–40%) |
Theorised comparisons are highlighted in boldface and are non-significant