| Literature DB >> 26520472 |
Abstract
OBJECTIVES: The principal aim of this study is to provide an account of variation in UK undergraduate medical assessment styles and corresponding standard setting approaches with a view to highlighting the importance of a UK national licensing exam in recognizing a common standard. <br> METHODS: Using a secure online survey system, response data were collected during the period 13 - 30 January 2014 from selected specialists in medical education assessment, who served as representatives for their respective medical schools. <br> RESULTS: Assessment styles and corresponding choices of standard setting methods vary markedly across UK medical schools. While there is considerable consensus on the application of compensatory approaches, individual schools display their own nuances through use of hybrid assessment and standard setting styles, uptake of less popular standard setting techniques and divided views on norm referencing. <br> CONCLUSIONS: The extent of variation in assessment and standard setting practices across UK medical schools validates the concern that there is a lack of evidence that UK medical students achieve a common standard on graduation. A national licensing exam is therefore a viable option for benchmarking the performance of all UK undergraduate medical students.Entities:
Keywords: assessment; benchmarking; national licensing exam; standard setting; undergraduate medical education
Mesh:
Year: 2015 PMID: 26520472 PMCID: PMC4640898 DOI: 10.5116/ijme.560e.c964
Source DB: PubMed Journal: Int J Med Educ ISSN: 2042-6372
Assessment styles for Finals ordered according to popularity (n = 26)
| Assessment style* | Frequency (%) | 95% CI** |
|---|---|---|
| MCQ | 24 (92.3) | (86.5, 95.7) |
| OSCE | 25 (96.2) | (91.4, 98.3) |
| SAQ | 9 (34.6) | (27.1, 43.0) |
| MiniCeX | 3 (11.5) | (7.2, 18.0) |
| OSLER | 3 (11.5) | (7.2, 18.0) |
| Essay | 2 (7.7) | (4.3, 13.5) |
| Portfolio | 2 (7.7) | (4.3, 13.5) |
| Portfolio viva | 2 (7.7) | (4.3, 13.5) |
| DOPS | 1 (3.8) | (1.7, 8.6) |
| Long case | 1 (3.8) | (1.7, 8.6) |
| Ward simulation exercise† | 1 (3.8) | (1.7, 8.6) |
*Abbreviations in row headers are defined in the methods section of this paper under 'Use of terminology'. **95% CIs were calculated using the Wilson score method11 with a correction for finite populations.12 †This category was volunteered by a respondent under ‘Other’ and hence added retrospectively to the response options in the presentation of findings.
Choice of standard setting method(s) according to assessment style for Finals*
| Standard setting method | MCQ | OSCE | SAQ | MiniCeX | OSLER | Essay | Portfolio | Portfolio viva | DOPS | Long case | Ward simulation exercise | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Frequency (%) | ||||||||||||
| Anchor statements with common marking scheme | 1 (3.8) | 2 (7.4) | - | 2 (66.7) | 1 (33.3) | 1 (50) | 1 (50) | - | 1 (100) | - | - | 9 (11.7) |
| Anchor statements with common marking scheme and fixed pass mark | - | - | - | 1 (33.3) | - | - | - | 1 (50) | - | - | - | 2 (2.6) |
| Angoff | 15 (57.7) | 2 (7.4) | 7 (77.8) | - | - | 1 (50) | - | - | - | - | - | 25 (32.5) |
| Angoff and Hofstee | 2 (7.7) | - | - | - | - | - | - | - | - | - | - | 2 (2.6) |
| Borderline group† | - | 9 (33.3) | - | - | - | - | - | 1 (50) | - | - | - | 10 (13.0) |
| Borderline group and Hofstee | - | 1 (3.7) | - | - | - | - | - | - | - | - | - | 1 (1.3) |
| Borderline regression | - | 12 (44.4) | - | - | 1 (33.3) | - | - | - | - | 1 (100) | - | 14 (18.2) |
| Contrasting-groups | - | - | - | - | - | - | 1 (50) | - | - | - | - | 1 (1.3) |
| Ebel | 5 (19.2) | 1 (3.7) | 1 (11.1) | - | - | - | - | - | - | - | - | 7 (9.1) |
| Ebel and Rasch analysis‡ | 1 (3.8) | - | - | - | - | - | - | - | - | - | - | 1 (1.3) |
| Fixed pass mark | 1 (3.8) | - | - | - | 1 (33.3) | - | - | - | - | - | - | 2 (2.6) |
| Hofstee | 1 (3.8) | - | 1 (11.1) | - | - | - | - | - | - | - | - | 2 (2.6) |
| Rasch analysis | - | - | - | - | - | - | - | - | - | - | 1 (100) | 1 (1.3) |
| Total | 26 (100) | 27 (100) | 9 (100) | 3 (100) | 3 (100) | 2 (100) | 2 (100) | 2 (100) | 1 (100) | 1 (100) | 1 (100) | 77 (100) |
*Abbreviations in column headers are defined in the methods section of this paper under 'Use of terminology'. The symbol ‘-’ is used to denote all instances where the corresponding combination of standard setting technique and assessment style was not selected by any respondent as pertaining to Finals at their medical school. †In conjunction with the OSCE, one school specified use of the median (rather than the mean) in deriving the pass mark. ‡Here, Rasch analysis was presented by the respondent school as being used to monitor or moderate marks derived by the Ebel method.
Assessment styles for non-Finals in clinical years and for pre-clinical years
| Assessment style* | Non-Finals in clinical years (n=25) | Pre-clinical years (n=22) | ||
|---|---|---|---|---|
| Frequency (%) | 95% CI** | Frequency (%) | 95%CI** | |
| MCQ | 25 (100) | (96.6, 100) | 22 (100) | (94.0, 100) |
| OSCE | 25 (100) | (96.6, 100) | 21 (95.5) | (87.0, 98.5) |
| SSC | 18 (72.0) | (63.0, 79.5) | 13 (59.1) | (46.5, 70.6) |
| SAQ | 15 (60.0) | (50.7, 68.6) | 15 (68.2) | (55.7, 78.5) |
| Portfolio | 8 (32.0) | (24.0, 41.2) | 5 (22.7) | (14.0, 34.7) |
| Essay | 6 (24.0) | (17.0, 32.7) | 8 (36.4) | (25.4, 49.0) |
| OSPE/Anatomy spot exam | 6 (24.0) | (17.0, 32.7) | 8 (36.4) | (25.4, 49.0) |
| MiniCeX | 5 (20.0) | (13.6, 28.4) | 1 (4.5) | (1.5, 13.0) |
| Oral presentations | 5 (20.0) | (13.6, 28.4) | 2 (9.1) | (4.1, 19.0) |
| In vitro clinical competencies† | 2 (8.0) | (4.2, 14.6) | 1 (4.5) | (1.5, 13.0) |
| OSLER | 2 (8.0) | (4.2, 14.6) | 0 (0.0) | (0, 6.0) |
| Viva (other) | 1 (4.0) | (1.6, 9.4) | 0 (0.0) | (0, 6.0) |
*Abbreviations in row headers are defined in the methods section of this paper under 'Use of terminology'. **95% CIs were calculated using the Wilson score method11 with a correction for finite populations12. †This category was volunteered by a respondent under ‘Other’ and hence added retrospectively to the response options in the presentation of findings.
Choice of standard setting method(s) according to assessment style for clinical non-Finals*
| Standard setting method | MCQ | OSCE | SAQ | MiniCeX | OSLER | Essay | Portfolio | DOPS | Long case | OSPE/Anatomyspot exam | Oral presentations | SSC | Viva(Other) | Other | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Frequency (%)† | |||||||||||||||
| Anchor statements with common marking scheme (CMS) | 1 (4.5) | - | 2 (13.3) | 5 (100) | - | 2 (33.3) | 7 (87.5) | 1 (100) | 1 (100) | 1 (20) | 5 (100) | 11 (64.7) | 1 (100) | 2 (40) | 39 (32.5) |
| Anchor statements with CMS and Fixed pass mark | 1 (4.5) | 2 (7.4) | - | - | - | 1 (16.7) | - | - | - | - | - | 3 (17.6) | - | 2 (40) | 9 (7.5) |
| Angoff | 16 (72.7) | 3 (11.1) | 12 (80) | - | - | 1 (16.7) | - | - | - | 3 (60) | - | 1 (5.9) | - | - | 36 (30) |
| Borderline group | - | 7 (25.9) | - | - | - | - | - | - | - | - | - | - | - | - | 7 (5.8) |
| Borderline group and Hofstee‡ | - | 1 (3.7) | - | - | - | - | - | - | - | - | - | - | - | - | 1 (0.8) |
| Borderline regression | 1 (4.5) | 14 (51.9) | - | - | 1 (50) | - | - | - | - | - | - | 1 (5.9) | - | - | 17 (14.2) |
| Contrasting-groups | - | - | - | - | - | - | 1 (12.5) | - | - | - | - | - | - | - | 1 (0.8) |
| Ebel | - | - | - | - | - | - | - | - | - | 1 (20) | - | - | - | - | 1 (0.8) |
| Fixed pass mark | 1 (4.5) | - | - | - | 1 (50) | 2 (33.3) | - | - | - | - | - | 1 (5.9) | - | - | 5 (4.2) |
| Hofstee | 2 (9.1) | - | 1 (6.7) | - | - | - | - | - | - | - | - | - | - | 1 (20) | 4 (3.3) |
| Total | 22 (100) | 27 (100) | 15 (100) | 5 (100) | 2 (100) | 6 (100) | 8 (100) | 1 (100) | 1 (100) | 5 (100) | 5 (100) | 17 (100) | 1 (100) | 5 (100) | 120 |
*Abbreviations in column headers are defined in the methods section of this paper under 'Use of terminology'. The symbol ‘-’ is used to denote all instances where the corresponding combination of standard setting technique and assessment style was not selected by any respondent as pertaining to clinical non-Finals at their medical school. †Frequencies and percentages pertain to instances of use of the given standard setting approach for the listed assessment type and as such, may include multiple instances for a given medical school. ‡Here, it was reported that the two standard setting methods were applied separately for the first of two phases of the assessment and the higher of the resultant two pass marks assigned to this first phase of the assessment. Interestingly, students who did not pass this first phase of assessment would in turn require to sit a second phase for which the pass mark was determined using an Angoff approach.
Choice of standard setting method(s) according to assessment style for pre-clinical years*
| Standard setting method | MCQ | OSCE | SAQ | MiniCeX | OSLER | Essay | Portfolio | Portfolio viva | DOPS | Long case | Ward simulation exercise | OSPE/ Anatomy spot exam | SSC | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Frequency (%)† | ||||||||||||||
| Anchor Statements with common marking scheme (CMS) | 1 (3.3) | 2 (6.9) | 1 (7.7) | 2 (66.7) | 1 (33.3) | 1 (20) | 1 (100) | - | 1 (100) | - | - | 2 (40) | 1 (100) | 13 (13.7) |
| Anchor statements with CMS and fixed pass mark | - | - | - | 1 (33.3) | - | - | - | 1 (50) | - | - | - | - | - | 2 (2.1) |
| Angoff | 16 (53.3) | 2 (6.9) | 8 (61.5) | - | - | 1 (20) | - | - | - | - | - | 1 (20) | - | 28 (29.5) |
| Angoff and Hofstee | 2 (6.7) | - | - | - | - | - | - | - | - | - | - | - | - | 2 (2.1) |
| Borderline group | - | 11 (37.9) | - | - | - | - | - | 1 (50) | - | - | - | - | - | 12 (12.6) |
| Borderline group and Hofstee | - | 1 (3.4) | - | - | - | - | - | - | - | - | - | - | - | 1 (1.1) |
| Borderline regression | - | 12 (41.4) | - | - | 1 (33.3) | - | - | - | - | 1 (100) | - | - | - | 14 (14.7) |
| Ebel | 5 (16.7) | 1 (3.4) | 1 (7.7) | - | - | - | - | - | - | - | - | - | - | 7 (7.4) |
| Ebel and Rasch analysis | 1 (3.3) | - | - | - | - | - | - | - | - | - | - | - | - | 1 (1.1) |
| Fixed pass mark | 3 (10.0) | - | 2 (15.4) | - | 1 (33.3) | 3 (60) | - | - | - | - | - | - | - | 9 (9.5) |
| Hofstee | 2 (6.7) | - | 1 (7.7) | - | - | - | - | - | - | - | - | 2 (40) | - | 5 (5.3) |
| Rasch analysis | - | - | - | - | - | - | - | - | - | - | 1 (100) | - | - | 1 (1.1) |
| Total | 30 (100) | 29 (100) | 13 (100) | 3 (100) | 3 (100) | 5 (100) | 1 (100) | 2 (100) | 1 (100) | 1 (100) | 1 (100) | 5 (100) | 1 (100) | 95 (100) |
*Abbreviations in column headers are defined in the methods section of this paper under 'Use of terminology'. The symbol ‘-’ is used to denote all instances where the corresponding combination of standard setting technique and assessment style was not selected by any respondent as pertaining to pre-clinical years at their medical school. †Frequencies and percentages pertain to instances of use of the given standard setting approach for the listed assessment type and as such, may include multiple instances for a given medical school.