| Literature DB >> 30404624 |
Nicolette Fozzard1, Andrew Pearson1, Eugene du Toit1, Helen Naug1, William Wen2, Ian R Peak3.
Abstract
BACKGROUND: Multiple choice questions are commonly used in summative assessment. It is still common practice for tertiary institutions and accrediting bodies to use five-option single best answer multiple choice questions, despite a substantial body of evidence showing that multiple choice questions with only three or four options provide effective and discriminatory assessment.Entities:
Keywords: Difficulty factor; Discrimination index; Distractors; Multiple choice question; Reduce options
Mesh:
Year: 2018 PMID: 30404624 PMCID: PMC6223017 DOI: 10.1186/s12909-018-1346-4
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Fig. 1Comparison of student entry scores. Entrants’ high school academic scores (OP) in Chem A (a), Chem B (b), A&P A (c), and A&P B (d). First and second offerings are indicated by black and grey bars respectively
Demographic data of students
| Chem A | Chem B | A&P A | A&P B | |||||
|---|---|---|---|---|---|---|---|---|
| 1st offering | 2nd offering | 1st offering | 2nd offering | 1st offering | 2nd offering | 1st offering | 2nd offering | |
| Number of students | 741 | 767 | 512 | 551 | 736 | 770 | 581 | 614 |
| % Female | 53 | 52 | 54 | 52 | 53 | 53 | 54 | 55 |
| Ave age (start of year) | 20.1 | 19.6 | 19.4 | 19.9 | 19.7 | 19.3 | 19.5 | 19.9 |
| % age 20+ | 27 | 25 | 25 | 28 | 25 | 23 | 25 | 27 |
| % ESLa | 22 | 22 | 21 | 24 | 23 | 22 | 21 | 26 |
| Avg OPb | 7.82 | 7.75 | 6.86 | 7.08 | 7.86 | 7.81 | 7.11 | 6.92 |
| % SESc | 15/65/11 | 14/63/14 | 15/59/11 | 16/65/11 | 15/66/11 | 13/65/13 | 14/60/12 | 17/64/11 |
| % ATSId | 2 | 2 | 1 | 2 | 2 | 2 | 1 | 2 |
aEnglish as a Second Language
bOP is school leaver qualification (scale of 1 to 25, with 1 being the highest achievement)
cSocioeconomic status; may not add up to 100% due to rounding and/or unavailable data
dStudents self-identifying as Aboriginal and Torres Strait Islander
Summary of final exam MCQ assessment
| Course | Weighting assigned to final exam | # MCQs in final exam | % of final exam assessed by MCQ |
|---|---|---|---|
| Chem A | 55% | 25 | 50 |
| Chem B | 55% | 50 | 50 |
| A&P A | 50% | 60 | 44 |
| A&P B | 50% | 60 | 50 |
Percentage of questions with specified number of effective distractors at first offering
| Coursea | Number of Effective Distractors (> 5% of respondents) | ||||
|---|---|---|---|---|---|
| Four | Three | Two | One | None | |
| Chem A ( | 4% | 32% | 24% | 24% | 16% |
| Chem B ( | 22% | 32% | 20% | 18% | 8% |
| A&P A ( | 13% | 42% | 33% | 8% | 3% |
| A&P B ( | 28% | 23% | 27% | 17% | 5% |
an denotes the total number of MCQs used in the final exam and is the denominator for percentages shown
Percentages may not add up to 100% due to rounding
Overall score in MCQ in the two offerings
| Coursea | No. of students sitting exam | Resultsb | |||
|---|---|---|---|---|---|
| 1st offering | 2nd offering | 1st offering | 2nd offering | ||
| Chem A | 741 | 767 | 19.0 ± 4.37 | 18.2 ± 4.47 | <.001 |
| Chem B | 512 | 551 | 33.8 ± 9.11 | 33.3 ± 8.25 | .336 |
| A&P A | 736 | 770 | 40.1 ± 10.77 | 41.4 ± 9.30 | .011 |
| A&P B | 581 | 614 | 39.2 ± 10.00 | 39.6 ± 10.13 | .538 |
an = total number of MCQs used in the final exam and is the denominator for percentages shown. In 1st offering, all questions are five-option, in second offering all questions are four-option
bMean mark ±1 SD; average % mark given in parentheses
Fig. 2Distribution of marks achieved in questions that were kept the same between sequential academic years. First offering had 4 distractors, second offering had 3 distractors. Mark distributions were similar across all courses. Panels: (a) Chem A; (b) Chem B; (c) A&P A; (d) A&P B
Comparison of MCQ marks for re-used questions
| Coursea | No. of students sitting exam | Resultsb | |||
|---|---|---|---|---|---|
| 1st offering | 2nd offering | 1st offering | 2nd offering | ||
| Chem A | 741 | 767 | 11.8 ± 2.6 (78.4%) | 11.6 ± 2.6 (76.6%) | .188 |
| Chem B | 512 | 551 | 17.8 ± 5.2 (65.8%) | 17.5 ± 5.3 (64.6%) | .333 |
| A&P A | 736 | 770 | 13.9 ± 3.7 (69.7%) | 14.5 ± 3.5 (72.2%)% | .007 |
| A&P B | 581 | 614 | 14.1 ± 3.8 (67.1%) | 13.7 ± 3.6 (65.4%) | .116 |
a(number of questions repeated in both offerings/total number of MCQ in the exam)
bMean mark ±1 SD; average % mark on these re-used questions given in parentheses
Question difficulty and discrimination with four and three distractors
| Course | First offering four distractors | Second offering three distractors | Average Difference⋄ | |||
|---|---|---|---|---|---|---|
| DFb | DIb | DFb | DIb | DF | DI | |
| Chem A | .78 ± .14 | .41 ± .16 | .77 ± .14 | .41 ± .18 | −.012 | .01 |
| Chem B | .66 ± .17 | .49 ± .17 | .65 ± .16 | .49 ± .14 | −.012 | −.00 |
| A&P A | .70 ± .17 | .46 ± .19 | .72 ± .17 | .42 ± .16 | .025 | −.04 |
| A&P B | .67 ± .18 | .43 ± .15 | .65 ± .17 | .41 ± .13 | −.016 | −.01 |
| All courses | .69 ± .17 | .45 ± .17 | .69 ± .17 | .44 ± .15 | −.004 | −.01 |
an denotes the number of questions used in first and second offerings, where the least effective distractor was removed at second offering
bMean ± 1 SD
⋄Average difference: the difference in DF and DI between offerings was calculated for each question (second offering minus first offering), and the average change calculated for each course; p-value shown in parentheses
Fig. 3Change in discrimination index and difficulty for individual questions after removing least effective distractor. Most questions showed minimal changes of ≤.1 units in DF and DI. Panels: (a) Chem A (n = 15); (b) Chem B (n = 27); (c) A&P A (n = 20); (d) A&P B (n = 21)
Fig. 4Relationship between item difficulty and discrimination before and after removal of least effective distractor. Numbers indicate the question number in each of the offerings (label above respective marker, or indicated by leader lines). Second order polynomial trend lines (without pre-specified intercept) added to each offering dataset. Panels: (a) Chem A; (b) Chem B; (c) A&P A; (d) A&P B