| Literature DB >> 35039023 |
Peter Yeates1,2, Gareth McCray3, Alice Moult3, Natalie Cope3, Richard Fuller4, Robert McKinley3.
Abstract
BACKGROUND: Ensuring equivalence of examiners' judgements across different groups of examiners is a priority for large scale performance assessments in clinical education, both to enhance fairness and reassure the public. This study extends insight into an innovation called Video-based Examiner Score Comparison and Adjustment (VESCA) which uses video scoring to link otherwise unlinked groups of examiners. This linkage enables comparison of the influence of different examiner-groups within a common frame of reference and provision of adjusted "fair" scores to students. Whilst this innovation promises substantial benefit to quality assurance of distributed Objective Structured Clinical Exams (OSCEs), questions remain about how the resulting score adjustments might be influenced by the specific parameters used to operationalise VESCA. Research questions, How similar are estimates of students' score adjustments when the model is run with either: fewer comparison videos per participating examiner?; reduced numbers of participating examiners?Entities:
Keywords: Assessment; Assessor variability; Distributed assessments; Many Facet Rasch Modelling; Objective Structured Clinical Exams; Psychometrics; Test Equating
Mesh:
Year: 2022 PMID: 35039023 PMCID: PMC8764767 DOI: 10.1186/s12909-022-03115-1
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Fig. 1Illustration of manipulation of linking process. This figure Illustrates the procedures used to create fewer linking videos per participating examiner (top) and fewer participating examiners (bottom) by deleting videos from the original dataset
Correlations between score adjustments derived from all linking videos (ABCD) and score adjustments derived from different permutations of fewer linking videos
| Correlation Coefficient (rho) | ||
|---|---|---|
| ABCD | 1 | - |
| 3 linking videos per participating examiner | ||
| ABC | 0.91 | 0.000 |
| ABD | 0.97 | 0.000 |
| ACD | 0.87 | 0.000 |
| BCD | 0.95 | 0.000 |
| Median | 0.93 | IQRs (0.90–0.95) |
| 2 linking videos per participating examiner | ||
| AB | 0.94 | 0.000 |
| AC | 0.80 | 0.000 |
| AD | 0.88 | 0.000 |
| BC | 0.86 | 0.000 |
| BD | 0.84 | 0.000 |
| CD | 0.65 | 0.000 |
| Median | 0.85 | IQRs (0.81–0.87) |
| 1 linking video per participating examiner | ||
| A | 0.91 | 0.000 |
| B | 0.49 | 0.000 |
| C | 0.36 | 0.000 |
| D | 0.55 | 0.000 |
| Median | 0.52 | IQRs (0.46–64) |
Spearman’s correlations (with associated p values) between score adjustments derived from different pairs of overlapping (white background) or non-overlapping (grey background) linking videos
| AB | AC | AD | BC | BD | CD | |
|---|---|---|---|---|---|---|
| AB | 1 | 0.846 | 0.922 | 0.789 | 0.774 | 0.579 |
| < 0.001 | < 0.001 | < 0.001 | < 0.001 | < 0.001 | ||
| AC | 1 | 0.778 | 0.73 | 0.412 | 0.649 | |
| < 0.001 | < 0.001 | < 0.001 | < 0.001 | |||
| AD | 1 | 0.621 | 0.762 | 0.625 | ||
| < 0.001 | < 0.001 | < 0.001 | ||||
| BC | 1 | 0.627 | 0.425 | |||
| < 0.001 | < 0.001 | |||||
| BD | 1 | 0.535 | ||||
| < 0.001 | ||||||
| CD | 1 |
Correlations between score adjustments derived from using linking scores provided by all participating (76%) of examiners and score adjustments derived from linking scores provided by different permutations of fewer participating examiners
| Correlation Coefficient | ||
|---|---|---|
| All participating examiners (76%) | 1.000 | - |
| Score adjustments derived from 70% examiner participation | ||
| Combination A | 0.99 | 0.000 |
| Combination B | 0.95 | 0.000 |
| Combination C | 0.86 | 0.000 |
| Combination D | 0.98 | 0.000 |
| Combination E | 0.97 | 0.000 |
| Median | 0.97 | IQR (0.91–0.99) |
| Score adjustments derived from 60% examiner participation | ||
| Combination A | 0.94 | 0.000 |
| Combination B | 0.98 | 0.000 |
| Combination C | 0.92 | 0.000 |
| Combination D | 0.95 | 0.000 |
| Combination E | 0.98 | 0.000 |
| Median | 0.95 | IQR (0.93–0.98) |
| Score adjustments derived from 50% examiner participation | ||
| Combination A | 0.65 | 0.000 |
| Combination B | 0.78 | 0.000 |
| Combination C | 0.83 | 0.000 |
| Combination D | 0.29 | 0.002 |
| Combination E | 0.93 | 0.000 |
| Median | 0.78 | IQR (0.47–0.88) |