| Literature DB >> 36199083 |
Patrick J McGown1, Celia A Brown2, Ann Sebastian1, Ricardo Le2, Anjali Amin1, Andrew Greenland1, Amir H Sam3.
Abstract
BACKGROUND: Standard setting for clinical examinations typically uses the borderline regression method to set the pass mark. An assumption made in using this method is that there are equal intervals between global ratings (GR) (e.g. Fail, Borderline Pass, Clear Pass, Good and Excellent). However, this assumption has never been tested in the medical literature to the best of our knowledge. We examine if the assumption of equal intervals between GR is met, and the potential implications for student outcomes.Entities:
Keywords: Assessment; Borderline pass; Borderline regression; Clinical examination; Pass mark; Standard setting
Mesh:
Year: 2022 PMID: 36199083 PMCID: PMC9536020 DOI: 10.1186/s12909-022-03753-5
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 3.263
Fig. 1Example of global ratings. Legend: Example of Global Ratings scale used at Imperial College School of Medicine
Fig. 2Pass mark calculation by borderline regression
Examiner perceptions of GR locations across the performance spectrum
| Global rating | Examiner Mean | 95% Confidence interval of the mean | Standard deviation |
|---|---|---|---|
| Borderline Pass | 0.33 | 0.30–0.35 | 0.15 |
| Clear Pass | 0.55 | 0.53–0.57 | 0.11 |
| Good | 0.77 | 0.76–0.79 | 0.09 |
Legend: Data from 117 examiners. Scale defined as typical ‘Fail’ candidate at 0.00 and typical ‘Excellent’ candidate at 1.00
Examiner perceptions of intervals between global scoring domains
| GR interval | Mean interval | 95% Confidence interval of the GR interval | T-statistic | |
|---|---|---|---|---|
| Fail to Borderline Pass | 0.33 | 0.30–0.35 | 5.84 | < 0.001 |
| Borderline Pass to Clear Pass | 0.22 | 0.20–0.23 | 4.33 | < 0.001 |
| Clear Pass to Good | 0.23 | 0.21–0.24 | 3.50 | < 0.001 |
| Good to Excellent | 0.23 | 0.21–0.24 | 2.57 | 0.006 |
Legend: Data from 117 examiners. Null hypothesis that each interval between GRs = 0.25
Fig. 3Example of recalculated regression line using adjusted global ratings
Effect of adjusted regression calculation on examination and student outcomes
| Change in exam pass mark (percentage points) | Change in exam-level fails | Stations with increased pass mark | Stations with increased integer pass mark | Change in station fails/total number of student-station interactions | Change in exam-level fails | |
|---|---|---|---|---|---|---|
| Exam 1 | + 0.9 | 0/310 | 36/36 | 7/36 | + 16/3720 | 0/310 |
| Exam 2 | + 0.7 | 0/289 | 10/10 | 2/10 | + 29/2890 | 0/289 |
| Exam 3 | + 1.2 | 0/287 | 24/24 | 7/24 | + 12/2296 | 0/287 |
| Exam 4 | + 1.2 | + 1/305 | 24/24 | 11/24 | + 35/2440 | 0/305 |
| Overall | + 1.0 | + 1/1191 | 94/94 | 37/94 | + 92/11346 | 0/1191 |
Legend: Data from four summative finals examinations at the institution studied
Effects of standard and adjusted regression calculations on fail rates
| Station fails using standard regression calculation | Fail rate (standard calculation) | Station fails using adjusted regression calculation | Fail rate (adjusted calculation) | z-statistic | ||
|---|---|---|---|---|---|---|
| Exam 1 | 198/3720 | 0.05 | 214/3720 | 0.06 | 1.17 | 0.243 |
| Exam 2 | 118/2890 | 0.04 | 147/2890 | 0.05 | 2.73 | 0.006 |
| Exam 3 | 167/2296 | 0.07 | 179/2296 | 0.08 | 0.96 | 0.335 |
| Exam 4 | 206/2440 | 0.08 | 241/2440 | 0.10 | 2.44 | 0.014 |
| Overall | 689/11346 | 0.06 | 781/11346 | 0.07 | 3.62 | < 0.001 |
Legend: Data from four summative finals examinations at the institution studied. Null hypothesis that station fail rate using adjusted regression calculation = station fail rate using standard calculation (two-tailed test)
Fig. 4Overall student scores in relation to pass mark and average mark
Fig. 5Increased pass mark with adjusted regression calculation but no integer change
Fig. 6Change in integer pass mark using adjusted regression calculation versus standard
Fig. 7Absolute change in exam-level fail rates by station passing requirements (adjusted regression calculation versus standard). Legend: Change in exam-level fail rates per 100 students (absolute rate change in percentage points) by station passing requirements at different thresholds, when using adjusted borderline regression calculations versus standard calculations. Data from Exam 1 (12 stations), Exam 2 (10 stations), Exam 3 (8 stations) and Exam 4 (10 stations). Datapoints shown for station passing thresholds equate to ≥4/8, ≥5/10 or ≥ 6/12 stations (≥50%), ≥7/12 stations (≥58.3%), ≥6/10 stations (≥60%), ≥5/8 stations (≥62.5%), ≥8/12 stations (≥66.6%), ≥7/10 stations (≥70%) and ≥ 6/8 or ≥ 9/12 stations (≥75%)