| Literature DB >> 26900331 |
A Domnich1, D Panatto1, L Arata1, I Bevilacqua1, L Apprato1, R Gasparini1, D Amicizia1.
Abstract
INTRODUCTION: Health-related knowledge is often assessed through multiple-choice tests. Among the different types of formats, researchers may opt to use multiple-mark items, i.e. with more than one correct answer. Although multiple-mark items have long been used in the academic setting - sometimes with scant or inconclusive results - little is known about the implementation of this format in research on in-field health education and promotion.Entities:
Keywords: Multiple answer items; Multiple-mark items; Pick-N items
Year: 2015 PMID: 26900331 PMCID: PMC4753817
Source DB: PubMed Journal: J Prev Med Hyg ISSN: 1121-2233
Description of scoring algorithms used in the study.
| Scoring algorithm | Definition | Reference |
|---|---|---|
| SA1 | S = 1 if IC = 0, otherwise S = 0 | 12, 16, 18, 19 |
| SA2 | S = (CC – IC)/TO | 12 |
| SA3 | S = CC/TO | 12 |
| SA4 | S = MCO/CO – (MIO/(TO – CO)) | 12 |
| SA5 | S = CC/TO – ((TO!/IC!·(TO – IC)!)/2^TO) | 12 |
| SA6 | p = MCO/CO, if p > 0 ⇒ x = MO/TO – CO/TO, otherwise p = S; | 18 |
| SA7 | S = MCO/CO if MCO ≤ CO, otherwise S = 0 | 16 |
| SA8 | S = 1 if IC = 0, S = 0.5 if 0.5·CO ≤ MCO < CO, otherwise S = 0 | 19 |
S: Respondent's score on a multiple-mark item (max = 1); CO: Number of keyed correct options; CC: Correct choices made by a respondent (both marked correct answers and unmarked distracters); IC: Incorrect choices made by a respondent (both marked distracters and unmarked correct answers); TO: Total number of item options; MCO: Correct options marked by a respondent; MIO: Incorrect options marked by a respondent; MO: Options marked by a respondent; p: Points for MCO; x: Penalty.
Fig. 1.Students' scores, by scoring algorithm and survey subset.
Spearman's ρ correlation coefficients between the dichotomous scoring algorithm 1 (SA1) and the other seven partial scoring rules applied to the multiple-mark survey subset (all p < .001).
| Scoring algorithm | ρ | 95% CI |
|---|---|---|
| SA2 | 0.79 | 0.74-0.83 |
| SA3 | 0.73 | 0.67-0.78 |
| SA4 | 0.74 | 0.68-0.79 |
| SA5 | 0.82 | 0.78-0.85 |
| SA6 | 0.81 | 0.77-0.85 |
| SA7 | 0.78 | 0.73-0.82 |
| SA8 | 0.88 | 0.85-0.90 |
Reliability measures of the scoring algorithms (SAs), by survey subset.
| Scoring algorithm | α (95% CI) | N of items needed to reach α = 0.7 | ||
|---|---|---|---|---|
| All (N = 12) | MM (N = 9) | All (N = 12) | MM (N = 9) | |
| SA1 | 0.48 (0.38-0.56) | 0.42 (0.32-0.52) | 31 | 29 |
| SA2 | 0.60 (0.53-0.67) | 0.57 (0.49-0.64) | 19 | 16 |
| SA3 | 0.65 (0.59-0.71) | 0.66 (0.60-0.72) | 15 | 11 |
| SA4 | 0.59 (0.51-0.65) | 0.53 (0.45-0.61) | 20 | 19 |
| SA5 | 0.65 (0.58-0.71) | 0.65 (0.59-0.71) | 16 | 12 |
| SA6 | 0.66 (0.60-0.72) | 0.68 (0.63-0.74) | 15 | 10 |
| SA7 | 0.67 (0.62-0.73) | 0.71 (0.65-0.76) | 14 | - |
| SA8 | 0.60 (0.53-0.68) | 0.59 (0.52-0.66) | 19 | 15 |
Relative efficiency of the scoring algorithms, as measured by the coefficient of effective length of all items (upper right triangle) and only multiple-mark items (lower left triangle).
| Scoring algorithm | SA1 | SA2 | SA3 | SA4 | SA5 | SA6 | SA7 | SA8 |
|---|---|---|---|---|---|---|---|---|
| SA1 | – | 1.63 | 2.01 | 1.56 | 2.01 | 2.10 | 2.20 | 1.63 |
| SA2 | 1.83 | – | 1.24 | 0.96 | 1.24 | 1.29 | 1.35 | 1.00 |
| SA3 | 2.68 | 1.46 | – | 0.77 | 1.00 | 1.05 | 1.09 | 0.81 |
| SA4 | 1.56 | 0.85 | 0.58 | – | 1.29 | 1.35 | 1.41 | 1.04 |
| SA5 | 2.56 | 1.40 | 0.96 | 1.65 | – | 1.05 | 1.09 | 0.81 |
| SA6 | 2.93 | 1.60 | 1.09 | 1.88 | 1.14 | – | 1.05 | 0.77 |
| SA7 | 3.38 | 1.85 | 1.26 | 2.17 | 1.32 | 1.15 | – | 0.74 |
| SA8 | 1.99 | 1.09 | 0.74 | 1.28 | 0.77 | 0.68 | 0.59 | – |
Pairwise comparisons of Cronbach's α coefficients of all items (upper right triangle) and only multiple-mark items (lower left triangle)..
| Scoring algorithm | SA1 | SA2 | SA3 | SA4 | SA5 | SA6 | SA7 | SA8 |
|---|---|---|---|---|---|---|---|---|
| SA1 | – | < 0.001 | < 0.001 | 0.011 | < 0.001 | < 0.001 | < 0.001 | < 0.001 |
| SA2 | 0.003 | – | 0.036 | 0.99 | < 0.001 | < 0.001 | < 0.001 | 0.99 |
| SA3 | < 0.001 | < 0.001 | – | < 0.001 | 0.99 | 0.99 | 0.94 | 0.046 |
| SA4 | 0.31 | 0.23 | < 0.001 | – | < 0.001 | < 0.001 | 0.99 | |
| SA5 | < 0.001 | < 0.001 | 0.99 | < 0.001 | – | 0.99 | 0.048 | 0.001 |
| SA6 | < 0.001 | < 0.001 | 0.99 | < 0.001 | 0.12 | – | 0.51 | < 0.001 |
| SA7 | < 0.001 | < 0.001 | 0.037 | < 0.001 | < 0.001 | 0.004 | – | < 0.001 |
| SA8 | < 0.001 | 0.99 | 0.018 | 0.99 | 0.001 | < 0.001 | < 0.001 | – |
Results are reported as p-values corrected by means of Bonferroni's method.
Difficulty parameters of survey items, as measured by the different scoring algorithms, by survey subset.
| Scoring algorithm | Mean difficulty, | N of easy items | N of difficult items | |
|---|---|---|---|---|
| All (N = 12) | MM (N = 9) | |||
| SA1 | 0.33 (0.27) | 0.27 (0.25) | 0 | 4 |
| SA2 | 0.46 (0.29) | 0.44 (0.32) | 1 | 2 |
| SA3 | 0.64 (0.20) | 0.68 (0.17) | 3 | 0 |
| SA4 | 0.50 (0.26) | 0.50 (0.28) | 1 | 2 |
| SA5 | 0.54 (0.22) | 0.55 (0.22) | 1 | 0 |
| SA6 | 0.57 (0.20) | 0.59 (0.20) | 1 | 0 |
| SA7 | 0.58 (0.21) | 0.60 (0.20) | 1 | 0 |
| SA8 | 0.47 (0.25) | 0.45 (0.26) | 1 | 2 |
Discrimination parameters of survey items, as measured by the different scoring algorithms, by survey subset.
| Scoring algorithm | Mean discrimination index, | N of items with | ||
|---|---|---|---|---|
| All (N = 12) | MM (N = 9) | All (N = 12) | MM (N = 9) | |
| SA1 | 0.33 (0.21) | 0.35 (0.28) | 8 | 5 |
| SA2 | 0.36 (0.12) | 0.38 (0.16) | 11 | 8 |
| SA3 | 0.30 (0.16) | 0.30 (0.10) | 8 | 6 |
| SA4 | 0.36 (0.12) | 0.37 (0.15) | 11 | 9 |
| SA5 | 0.32 (0.15) | 0.30 (0.14) | 9 | 7 |
| SA6 | 0.32 (0.14) | 0.30 (0.11) | 9 | 8 |
| SA7 | 0.32 (0.13) | 0.31 (0.12) | 9 | 8 |
| SA8 | 0.32 (0.16) | 0.32 (0.16) | 10 | 6 |
Fig. 2.Standardized mean differences in total scores between females and males (a), native and immigrant students (b), native male and immigrant male students (c) and native female and immigrant female students (d), by scoring algorithm.
Results of ANOVA models for sex and nationality, by scoring algorithm.
| Scoring algorithm | Sex | Nation | Sex : Nation | |||
|---|---|---|---|---|---|---|
| F | p | F | p | F | p | |
| SA1 | 2.88 | 0.091 | 1.87 | 0.17 | 5.43 | 0.021 |
| SA2 | 3.42 | 0.066 | 3.90 | 0.049 | 8.05 | 0.005 |
| SA3 | 8.07 | 0.005 | 4.13 | 0.043 | 3.07 | 0.081 |
| SA4 | 4.66 | 0.032 | 5.53 | 0.019 | 6.47 | 0.012 |
| SA5 | 6.64 | 0.011 | 3.28 | 0.071 | 5.35 | 0.021 |
| SA6 | 5.79 | 0.017 | 2.53 | 0.11 | 4.41 | 0.037 |
| SA7 | 5.62 | 0.018 | 1.44 | 0.23 | 4.40 | 0.037 |
| SA8 | 3.84 | 0.051 | 1.98 | 0.16 | 5.49 | 0.020 |
Fig. 3.Standardized mean differences in pre- and post-intervention total scores, by scoring algorithm