| Literature DB >> 23899223 |
I C McManus1, Andrew T Elder, Jane Dacre.
Abstract
BACKGROUND: Bias of clinical examiners against some types of candidate, based on characteristics such as sex or ethnicity, would represent a threat to the validity of an examination, since sex or ethnicity are 'construct-irrelevant' characteristics. In this paper we report a novel method for assessing sex and ethnic bias in over 2000 examiners who had taken part in the PACES and nPACES (new PACES) examinations of the MRCP(UK).Entities:
Mesh:
Year: 2013 PMID: 23899223 PMCID: PMC3737060 DOI: 10.1186/1472-6920-13-103
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
The table shows separately for analyses of hawk-dove differences, male–female differences, White-nonWhite differences, and differences between odd and even numbered candidates (see columns), the numbers of examiners who reached statistical significance (rows) on various criteria
| | ||||||||
|---|---|---|---|---|---|---|---|---|
| | | | ||||||
| Negative effect: P < .05 corrected | 34 (1.9%) | 35 (2.3%) | 0 | 0 | 2 (0.1%) | 1 (0.1%) | 0 | 0 |
| Negative effect: P < .05 uncorrected (chance expectation = 2.5%) | 198 (11.1%) | 235 (15.7%) | 73 (4.1%) | 63 (4.2%) | 73 (4.4%) | 48 (3.6%) | 60 (3.2%) | 51 (3.2%) |
| Not significant (uncorrected, p > .05) | 1339 (74.8%) | 989 (66.0%) | 1638 (91.5%) | 1379 (92.1%) | 1491 (90.4%) | 1229 (92.2%) | 1680 (93.9%) | 1396 (93.2%) |
| Positive effect: P < .05 uncorrected (chance expectation = 2.5%) | 192 (10.7%) | 200 (13.4%) | 79 (4.4%) | 55 (3.7%) | 82 (5.0%) | 55 (4.1%) | 50 (3.0%) | 51 (3.6%) |
| Positive effect: P < .05 corrected | 27 (1.5%) | 39 (2.6%) | 0 | 0 | 1 (0.1%) | 0 | 0 | 0 |
Levels of statistical significance are divided into five groups, those who are significant at a Bonferroni corrected level of p < .05 (first and fifth rows), those who are significant at a non-Bonferroni-corrected level of p < .05 (second and fourth rows), and those who are not significant at a non-Bonferroni-corrected level of p < .05 (middle row). ‘Positive’ refers., arbitrarily, to examiners being more hawkish (i.e. giving lower overall scores), giving higher scores to male candidates, giving higher scores to White candidates, or giving higher scores to odd-numbered candidates. By chance alone one would expect 95% of candidates to be in the ‘non-significant’ group, with the remaining 5% of candidates distributed evenly between negative and positive effects.
Figure 1The individual graphs show for PACES diets 1–26 the indices for hawkishness (top left), sex bias (top right), ethnic bias (lower left), and even-number bias (lower right). Each point represents an individual examiner, plotted against the number of candidates examined, and with the significance indicated (grey, NS; orange and green p < .05 uncorrected; red and blue, p < .05 Bonferroni corrected).
Figure 2The individual graphs show for the first six diet of nPACES (27 to 32) the indices for hawkishness (top left), sex bias (top right), ethnic bias (lower left), and even-number bias (lower right). Each point represents an individual examiner, plotted against the number of candidates examined, and with the significance indicated (grey, NS; orange and green p < .05 uncorrected; red and blue, p < .05 Bonferroni corrected).
Figure 3Data from a simulated analysis of a single examiner at each station, based on the data for PACES diets 1–26. Only ethnic bias is considered, for simplicity, with the top graph showing a simple comparison of White with non-white candidates, and the lower graph showing a more complex analysis in which individual marks are compared with a basket of marks awarded by other examiners at other stations. Each point represents an individual examiner, plotted against the number of candidates examined, and with the significance indicated (grey, NS; orange and green p < .05 uncorrected; red and blue, p < .05 Bonferroni corrected).