| Literature DB >> 21106066 |
Anja Rogausch1, Rainer Hofer, René Krebs.
Abstract
BACKGROUND: Many medical exams use 5 options for multiple choice questions (MCQs), although the literature suggests that 3 options are optimal. Previous studies on this topic have often been based on non-medical examinations, so we sought to analyse rarely selected, 'non-functional' distractors (NF-D) in high stakes medical examinations, and their detection by item authors as well as psychometric changes resulting from a reduction in the number of options.Entities:
Mesh:
Year: 2010 PMID: 21106066 PMCID: PMC3004925 DOI: 10.1186/1472-6920-10-85
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Number of candidates and items per discipline included in the analysis
| Subject | Number of | 2005 | 2006 | 2007 | Total |
|---|---|---|---|---|---|
| Internal medicine/pharmacotherapy | Items | 55 | 50 | 45 | 150 |
| Candidates | 617 | 627 | 644 | ||
| Surgery | Items | 54 | 49 | 63 | 166 |
| Candidates | 618 | 624 | 649 | ||
| Gynaecology and paediatrics | Items | 43 | 36 | 44 | 123 |
| Candidates | 615 | 629 | 646 | ||
| Dermatology, ophthalmology, otorhinolaryngology | Items | 57 | 64 | 59 | 180 |
| Candidates | 611 | 632 | 654 | ||
| Social and preventive Medicine | Items | 41 | 38 | 39 | 118 |
| Candidates | 606 | 645 | 648 | ||
Figure 1Percentages of distractors with different selection rates (i.e. selected by specific proportions of candidates).
Figure 2The delta-median of the functional (grey bars) versus non-functional (black bars) distractors (F-D and NF-D).
Parameter changes related to different numbers of distractors (statistical approach)
| Specialty | Model | % correct (P-value)* | Discrimination* | Reliability* | Standard Reliability** |
|---|---|---|---|---|---|
| Internal Medicine, Pharmacology | |||||
| A) 3 distractors, random | 79.20 | 0.19 | 0.73 | 0.83 | |
| B) 3 distractors, right answer | 80.11 | 0.18 | 0.72 | 0.82 | |
| C) 2 distractors, random | 80.20 | 0.18 | 0.71 | 0.82 | |
| D) 2 distractors, right answer | 82.76 | 0.14 | 0.62 | 0.75 | |
| Surgery (54 items) | |||||
| A) 3 distractors, random | 77.91 | 0.13 | 0.59 | 0.73 | |
| B) 3 distractors, right answer | 78.44 | 0.13 | 0.58 | 0.72 | |
| C) 2 distractors, random | 78.78 | 0.13 | 0.58 | 0.72 | |
| D) 2 distractors, right answer | 80.75 | 0.11 | 0.54 | 0.68 | |
| Social and preventive medicine | |||||
| A) 3 distractors, random | 76.70 | 0.11 | 0.50 | 0.70 | |
| B) 3 distractors, right answer | 77.62 | 0.10 | 0.46 | 0.68 | |
| C) 2 distractors, random | 77.61 | 0.10 | 0.45 | 0.67 | |
| D) 2 distractors, right answer | 80.20 | 0.07 | 0.36 | 0.58 | |
| Pediatrics and gynaecology | |||||
| A) 3 distractors, random | 76.05 | 0.13 | 0.56 | 0.74 | |
| B) 3 distractors, right answer | 76.99 | 0.12 | 0.52 | 0.72 | |
| C) 2 distractors, random | 76.97 | 0.12 | 0.54 | 0.73 | |
| D) 2 distractors, right answer | 79.38 | 0.10 | 0.47 | 0.67 | |
| Dermatology, ophthalmology, otorhinolaryngology (57 items) | |||||
| A) 3 distractors, random | 79.92 | 0.21 | 0.76 | 0.85 | |
| B) 3 distractors, right answer | 80.64 | 0.20 | 0.75 | 0.84 | |
| C) 2 distractors, random | 80.79 | 0.20 | 0.74 | 0.83 | |
| D) 2 distractors, right answer | 82.91 | 0.16 | 0.68 | 0.78 | |
* These indicators related to the sample of positive A-type questions only, thus they differ from the respective indicators relating to the complete examination including negative best answer and extended matching questions.
** Reliability standardized for 100 items
Parameter changes related to different numbers of distractors (expert recommendation)
| Specialty | Model | % correct (P-value)* | Discrimination* | Reliability* | Standard. Reliability** |
|---|---|---|---|---|---|
| Internal Medicine, Pharmacology | |||||
| A) 3 distractors, | 79.49 | 0.19 | 0.73 | 0.83 | |
| B) 3 distractors, right answer | 81.21 | 0.16 | 0.66 | 0.78 | |
| C) 2 distractors, | 81.08 | 0.17 | 0.69 | 0.80 | |
| D) 2 distractors, right answer | 85.07 | 0.11 | 0.53 | 0.67 | |
* These indicators related to the sample of positive A-type questions only, thus they differ from the respective indicators relating to the complete examination including negative best answer and extended matching questions.
** Reliability standardised for 100 items