| Literature DB >> 19580681 |
Marie Tarrant1, James Ware, Ahmed M Mohammed.
Abstract
BACKGROUND: Four- or five-option multiple choice questions (MCQs) are the standard in health-science disciplines, both on certification-level examinations and on in-house developed tests. Previous research has shown, however, that few MCQs have three or four functioning distractors. The purpose of this study was to investigate non-functioning distractors in teacher-developed tests in one nursing program in an English-language university in Hong Kong.Entities:
Mesh:
Year: 2009 PMID: 19580681 PMCID: PMC2713226 DOI: 10.1186/1472-6920-9-40
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Characteristics of the Tests
| Test A | Test B | Test C | Test D | Test E | Test F | Test G | Total | |
| No. of items | 96 | 72 | 86 | 50 | 50 | 60 | 100 | 514 |
| No. of examinees | 146 | 74 | 74 | 73 | 73 | 73 | 75 | 588 |
| Mean test score % (SD) | 67.7 (9.87) | 55.5 (8.52) | 69.2 (10.44) | 72.0 (10.82) | 62.6 (11.28) | 67.8 (10.02) | 65.6 (11.29) | -- |
| Range of test scores (%) | 38–89 | 33–71 | 38–90 | 46–94 | 34–88 | 35–88 | 34–89 | -- |
| KR20 Reliability | .81 | .71 | .82 | .71 | .72 | .70 | .87 | -- |
SD = standard deviation; KR-20 = Kuder-Richardson 20
Distractor Performance
| Test A | Test B | Test C | Test D | |
| No. of items | 96 | 72 | 86 | 50 |
| No. of distractors assessed | 288 | 216 | 258 | 150 |
| Distractors with: | ||||
| Frequency <5% n(%) | 88 (30.6) | 78 (36.1) | 109 (42.2) | 60 (40.0) |
| Discrimination ≥ 0 | 68 (23.6) | 75 (34.7) | 88 (34.1) | 48 (32.0) |
| Both | 40 (13.9) | 39 (18.1) | 60 (23.3) | 34 (22.7) |
| Frequency = 0% n(%) | 12 (4.2) | 24 (11.1) | 42 (16.3) | 21 (14.0) |
| Functioning distractors per test n(%) | 172 (59.7) | 102 (47.2) | 121 (46.9) | 76 (50.7) |
| Functioning distractors per item n(%) | ||||
| None | 8 (8.3) | 11 (15.3) | 13 (15.1) | 7 (14.0) |
| One | 27 (28.1) | 30 (41.7) | 35 (40.7) | 16 (32.0) |
| Two | 43 (44.8) | 26 (36.1) | 28 (32.6) | 22 (44.0) |
| Three | 18 (18.8) | 5 (6.9) | 10 (11.6) | 5 (10.0) |
| Functioning distractors per item M(SD) | 1.74 (.86) | 1.35 (.82) | 1.41 (.89) | 1.50 (.86) |
| Test E | Test F | Test G | Total | |
| No. of items | 50 | 60 | 100 | 514 |
| No. of distractors assessed | 150 | 180 | 300 | 1542 |
| Distractors with: | ||||
| Frequency <5% n(%) | 43 (28.7) | 62 (34.4) | 101(33.7) | 541 (35.1) |
| Discrimination ≥ 0 | 46 (30.7) | 69 (38.3) | 78 (26.0) | 472 (30.6) |
| Both | 26 (17.3) | 33 (18.3) | 44 (14.7) | 276 (17.9) |
| Frequency = 0% n(%) | 15 (10.0) | 21 (11.7) | 23 (7.7) | 158 (10.2) |
| Functioning distractors per test n(%) | 87 (58.0) | 82 (45.6) | 165 (55.0) | 805 (52.2) |
| Functioning distractors per item n(%) | ||||
| None | 6 (12.0) | 8 (13.3) | 10 (10.0) | 63 (12.3) |
| One | 13 (26.0) | 25 (41.7) | 33 (33.0) | 179 (34.8) |
| Two | 19 (38.0) | 24 (40.0) | 39 (39.0) | 201 (39.1) |
| Three | 12 (24.0) | 3 (5.0) | 18 (18.0) | 71 (13.8) |
| Functioning distractors per item M(SD) | 1.74 (.96) | 1.37 (.78) | 1.65 (.89) | 1.54 (.88) |
Relationship between Number of Functioning Distractors and Item Difficulty
| None | One | Two | Three | ||
| Test A | .94 | .77 | .60 | .60*** | -.51*** |
| Test B | .71 | .54 | .48 | .67 | .13 |
| Test C | .88 | .71 | .60 | .62*** | -.44*** |
| Test D | .93 | .72 | .66 | .67** | -.46*** |
| Test E | .78 | .71 | .59 | .51* | -.43** |
| Test F | .77 | .72 | .63 | .55 | -.33** |
| Test G | .89 | .70 | .57 | .62*** | -.34*** |
* p < .05
** p < .01
*** p < .001
Relationship between Number of Functioning Distractors and Point-Biserial Correlation Coefficient
| None | One | Two | Three | ||
| Test A | .09 | .19 | .25 | .28*** | 0.44** |
| Test B | .06 | .18 | .24 | .36* | 0.35** |
| Test C | .10 | .22 | .31 | .36*** | 0.53*** |
| Test D | .17 | .21 | .28 | .40* | 0.45** |
| Test E | .09 | .12 | .33 | .38*** | 0.64*** |
| Test F | .14 | .23 | .26 | .38 | 0.32* |
| Test G | .24 | .22 | .29 | .36* | 0.28** |
* p < .05
** p < .01
*** p < .001
Results of Reallocation of Poor Functioning Distracters
| Test A | Test B | Test C | Test D | Test E | Test F | Test G | Total | |
| Redistributed options n | 88 | 49 | 57 | 30 | 37 | 44 | 79 | 384 |
| Redistribution to keyed option n | 30 | 7 | 20 | 13 | 12 | 21 | 21 | 124 |
| Examinees benefiting % | 4.61 | 7.34 | 4.26 | 4.43 | 5.02 | 4.89 | 5.46 | -- |
| Examinees re-classified as pass n | 1 | 3 | 2 | 1 | 2 | 1 | 1 | 11 |
Comparison of Four-Option and Three-Option Tests Generated by Random Redistribution of Fourth Option
| Mean test score % (SD) | Range of test scores (%) | KR20 Reliability | ||||
| 4-options | 3-options | 4-options | 3-options | 4-options | 3-options | |
| Test A | 67.4 (10.04) | 69.2 (9.46) | 38–90 | 38–90 | .81 | .80 |
| Test B | 55.4 (8.84) | 56.1 (8.61) | 33–71 | 35–71 | .71 | .70 |
| Test C | 69.2 (10.44) | 70.1 (9.86) | 38–90 | 41–90 | .82 | .79 |
| Test D | 71.9 (10.82) | 72.5 (10.70) | 46–94 | 46–94 | .71 | .69 |
| Test E | 64.2 (8.35) | 64.9 (8.21) | 46–84 | 46–84 | .73 | .70 |
| Test F | 67.9 (10.02) | 69.1 (9.79) | 35–88 | 35–90 | .70 | .69 |
| Test G | 65.6 (11.29) | 66.6 (10.80) | 34–89 | 36–89 | .87 | .85 |
| Total (mean) | (65.9) | (66.9) | -- | -- | (0.76) | (0.75) |