| Literature DB >> 35991657 |
Qianqian Pan1, Zhehan Jiang2,3.
Abstract
Background: Analyzing distractor qualities of a pediatrics subject test in a national-level examination is vital in developing high-quality items for the discipline. Yet traditional approaches focus on key answers only and therefore are less informative. The number of distractors can also be parsimonized to improve the item development. Materials and methods: From a pediatrics subject test at the national level, raw responses of 44,332 examines to nineteen multiple-choice questions were analyzed, such that the distractor qualities were evaluated via traditional and advanced methods such as canonical correlation index. Additionally, a simulation study was conducted to investigate the impact of eliminating distractor numbers on reliability.Entities:
Keywords: assessment; distractor; pediatrics; psychometrics; reliability
Year: 2022 PMID: 35991657 PMCID: PMC9385996 DOI: 10.3389/fmed.2022.921719
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Trace line plots of two items.
Traditional item analysis results.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | 0.91 | 0.29 | 0.12 | 0.07 | 0.61 |
| 2 | 0.34 | 0.47 | 0.30 | 0.30 | 0.60 |
| 3 | 0.39 | 0.49 | 0.33 | 0.36 | 0.60 |
| 4 | 0.86 | 0.35 | 0.28 | 0.21 | 0.60 |
| 5 | 0.88 | 0.32 | 0.29 | 0.20 | 0.60 |
| 6 | 0.83 | 0.38 | 0.28 | 0.23 | 0.60 |
| 7 | 0.36 | 0.48 | 0.24 | 0.25 | 0.61 |
| 8 | 0.42 | 0.49 | 0.38 | 0.42 | 0.59 |
| 9 | 0.54 | 0.50 | 0.43 | 0.49 | 0.58 |
| 10 | 0.58 | 0.49 | 0.29 | 0.32 | 0.60 |
| 11 | 0.72 | 0.45 | 0.39 | 0.39 | 0.59 |
| 12 | 0.40 | 0.49 | 0.54 | 0.62 | 0.56 |
| 13 | 0.17 | 0.37 | 0.41 | 0.31 | 0.58 |
| 14 | 0.49 | 0.50 | 0.45 | 0.50 | 0.58 |
| 15 | 0.78 | 0.41 | 0.19 | 0.16 | 0.61 |
| 16 | 0.44 | 0.50 | 0.54 | 0.62 | 0.56 |
| 17 | 0.35 | 0.48 | 0.51 | 0.55 | 0.57 |
| 18 | 0.61 | 0.49 | 0.22 | 0.21 | 0.61 |
| 19 | 0.64 | 0.48 | 0.36 | 0.38 | 0.59 |
P.
Distractor analyses results.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | 3 | 2 | −0.11 | 0.22 | – |
| 2 | 1 | 1 | −0.26 | 0.62 | 0.25 |
| 3 | 2 | 2 | −0.21 | 0.46 | 0.13 |
| 4 | 3 | 2 | −0.20 | 0.62 | – |
| 5 | 3 | 1 | −0.21 | 0.70 | – |
| 6 | 2 | 1 | −0.18 | 0.63 | 0.23 |
| 7 | 1 | 2 | −0.19 | 0.38 | 0.24 |
| 8 | 1 | 1 | −0.35 | 0.68 | 0.20 |
| 9 | 1 | 1 | −0.31 | 0.59 | 0.44 |
| 10 | 3 | 3 | −0.18 | 0.38 | – |
| 11 | 2 | 0 | −0.27 | 0.63 | 0.28 |
| 12 | 2 | 2 | −0.47 | 0.75 | 0.30 |
| 13 | 1 | 1 | −0.43 | 0.65 | 0.22 |
| 14 | 2 | 2 | −0.29 | 0.65 | 0.32 |
| 15 | 1 | 0 | −0.12 | 0.39 | 0.06 |
| 16 | 1 | 2 | −0.45 | 0.76 | 0.31 |
| 17 | 1 | 1 | −0.48 | 0.65 | 0.28 |
| 18 | 1 | 1 | −0.14 | 0.37 | 0.33 |
| 19 | 1 | 0 | −0.21 | 0.62 | 0.25 |
R.
Figure 2Trace line plots of all items with assessed nodes and cognitive levels.
Test reliability, test scores, and item parameter estimates after eliminating distractors.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| No elimination | 0.560 | 0.693 | 0.696 | 1.113 | −0.478 | 0.111 | |
| Method 1: Eliminated the least popular distractor | Random assignment | 0.568 | 0.662 | 0.669 | 1.198 | −1.201 | 0.190 |
| Method 2: Randomly eliminated one distractor | Random assignment | 0.591 | 0.650 | 0.656 | 1.143 | −1.379 | 0.199 |
| Method 3: Randomly eliminated one distractor | Original proportion | 0.629 | 0.623 | 0.628 | 1.094 | −3.055 | 0.231 |
|
| |||||||
| Method 1 | Method 2 | Method 3 | |||||
| Original Factor scores | 0.991 | 0.977 | 0.950 | ||||
a, b, and c represent the item discrimination, item difficulty, and guessing parameters from a unidimensional 3PL IRT model.