| Literature DB >> 29121888 |
Dario Cecilio-Fernandes1, Harro Medema2, Carlos Fernando Collares3, Lambert Schuwirth3,4, Janke Cohen-Schotanus5, René A Tio5.
Abstract
BACKGROUND: Progress testing is an assessment tool used to periodically assess all students at the end-of-curriculum level. Because students cannot know everything, it is important that they recognize their lack of knowledge. For that reason, the formula-scoring method has usually been used. However, where partial knowledge needs to be taken into account, the number-right scoring method is used. Research comparing both methods has yielded conflicting results. As far as we know, in all these studies, Classical Test Theory or Generalizability Theory was used to analyze the data. In contrast to these studies, we will explore the use of the Rasch model to compare both methods.Entities:
Keywords: Assessment; Construct-irrelevant variance; Formula scoring; Multiple choice questions; Number-right scoring; Rasch model; Reliability; Validity
Mesh:
Year: 2017 PMID: 29121888 PMCID: PMC5679154 DOI: 10.1186/s12909-017-1051-8
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Crossover design of tests 1 & 2 versus groups A & B with formula-scoring and number-right scoring conditions per year
| Group A | Group B | |
|---|---|---|
| Test 1 | Formula scoring (FS) | Number-right scoring (NR) |
| Test 2 | Number-right scoring (NR) | Formula scoring (FS) |
Mean, standard deviation, minimum and maximum of measurement, infit, outfit, and error for items and person
| Items | Person | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Measure |
|
| Error | Measure |
|
| Error | |||
| Test 1 | FS | Mean | 0.00 | 1.00 | 1.00 | 0.12 | 0.32 | 1.04 | 1.00 | 0.13 |
| SD | 0.55 | 0.05 | 0.10 | 0.05 | 0.25 | 0.37 | 0.41 | 0.01 | ||
| Minimum | −1.85 | 0.91 | 0.52 | 0.05 | −0.38 | 0.41 | 0.36 | 0.11 | ||
| Maximum | 1.55 | 1.22 | 1.48 | 0.38 | 1.54 | 2.51 | 2.21 | 0.21 | ||
| NR | Mean | 0.00 | 1.00 | 1.00 | 0.20 | 0.24 | 1.00 | 1.00 | 0.23 | |
| SD | 1.17 | 0.06 | 0.10 | 0.07 | 0.50 | 0.09 | 0.20 | 0.01 | ||
| Minimum | −4.15 | 0.84 | 0.79 | 0.17 | −1.32 | 0.81 | 0.72 | 0.22 | ||
| Maximum | 2.57 | 1.12 | 1.37 | 0.71 | 1.57 | 1.29 | 2.58 | 0.26 | ||
| FS | Mean | 0.00 | 1.00 | 1.03 | 0.12 | 0.33 | 1.03 | 1.03 | 0.12 | |
| Test 2 | SD | 0.59 | 0.05 | 0.14 | 0.05 | 0.25 | 0.35 | 0.44 | 0.01 | |
| Minimum | −2.68 | 0.90 | 0.83 | 0.05 | −0.31 | 0.27 | 0.34 | 0.11 | ||
| Maximum | 1.71 | 1.25 | 2.02 | 0.39 | 1.27 | 2.14 | 3.46 | 0.18 | ||
| NR | Mean | 0.00 | 1.00 | 0.99 | 0.18 | 0.24 | 1.00 | 0.99 | 0.22 | |
| SD | 0.90 | 0.06 | 0.10 | 0.03 | 0.51 | 0.10 | 0.15 | 0.01 | ||
| Minimum | −2.32 | 0.85 | 0.66 | 0.17 | −1.53 | 0.78 | 0.66 | 0.22 | ||
| Maximum | 2.42 | 1.13 | 1.19 | 0.30 | 1.75 | 1.32 | 1.53 | 0.27 | ||
FS formula-scoring group, NR number-right scoring group
Differences between formula score and number right from a Rasch perspective, influence on items
| W > R | W = R | ? = P | ? > R | ? = R | P >? | P >?;R | Total | ||
|---|---|---|---|---|---|---|---|---|---|
| Test 1 | FS | NA | NA | 3 | 5 | 0 | 6 | 2 | 16 |
| NR | 5 | 1 | NA | NA | NA | NA | NA | 6 | |
| Test 2 | FS | NA | NA | 1 | 1 | 1 | 11 | 6 | 20 |
| NR | 5 | 1 | NA | NA | NA | NA | NA | 6 | |
FS formula-scoring group, NR number-right scoring group, W Wrong, R Right,? Question Mark, P Penalty. Count, NA not applicable
Person and Item reliability coefficient per test based on the Rasch
| Test 1 | Test 2 | |||
|---|---|---|---|---|
| FS | NR | FS | NR | |
| Person reliability | 0.73 | 0.80 | 0.77 | 0.82 |
| Item reliability | 0.94 | 0.96 | 0.94 | 0.96 |
FS formula-scoring group, NRB number-right scoring group
Fig. 1Map of question difficulty and student ability for Test 1. Left hand side shows questions under the formula scoring method and the right hand side shows questions under the right number scoring method
Fig. 2Map of question difficulty and student ability for Test 2. Left hand side shows questions under the formula scoring method and the right hand side shows questions under the right number scoring method