| Literature DB >> 30483187 |
Daniel M Bolt1, Sora Lee1, James Wollack1, Carol Eckerly2, John Sowles3.
Abstract
Asymmetric IRT models have been shown useful for capturing heterogeneity in the number of latent subprocesses underlying educational test items (Lee and Bolt, 2018a). One potentially useful practical application of such models is toward the scoring of discrete-option multiple-choice (DOMC) items. Under the DOMC format, response options are independently and randomly administered up to the (last) keyed response, and thus the scheduled number of distractor response options to which an examinee may be exposed (and consequently the overall difficulty of the item) can vary. In this paper we demonstrate the applicability of Samejima's logistic positive exponent (LPE) model to response data from an information technology certification test administered using the DOMC format, and discuss its advantages relative to a two-parameter logistic (2PL) model in addressing such effects. Application of the LPE in the context of DOMC items is shown to (1) provide reduced complexity and a superior comparative fit relative to the 2PL, and (2) yield a latent metric with reduced shrinkage at high proficiency levels. The results support the potential use of the LPE as a basis for scoring DOMC items so as to account for effects related to key location.Entities:
Keywords: Samejima's logistic positive exponent (LPE) model; computerized testing; item response theory (IRT); latent ability estimates; multiple-choice
Year: 2018 PMID: 30483187 PMCID: PMC6240662 DOI: 10.3389/fpsyg.2018.02175
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Item characteristic curves (ICCs) for three hypothetical LPE items (a = 1, b = 0 for all items) that vary with respect to ξ.
Mean classical item difficulties (p-values) and discriminations (item-total correlations) as a function of the number of scheduled response options, single keyed, double keyed, and triple keyed items.
| 1 | 0.64 | 0.22 | ||||
| 2 | 0.49 | 0.31 | 0.47 | 0.21 | ||
| 3 | 0.37 | 0.38 | 0.32 | 0.31 | 0.34 | 0.35 |
| 4 | 0.31 | 0.40 | 0.22 | 0.34 | 0.26 | 0.38 |
| 5 | 0.19 | 0.35 | ||||
Empirical comparison of IRT models applied to IT certification data (N = 648).
| 1 | 2PL × Item | 43874.8 | 43142.7 | 732.2 | 44,607 |
| 2 | 2PL × Item × #RespOpt (a and b) | 40681.8 | 39573.7 | 1108.1 | 41,790 |
| 3 | 2PL × Item × #RespOpt (b only) | 40938.9 | 39985.2 | 953.6 | 41,893 |
| 4 | LPE × Item × #RespOpt (ξ | 40798.8 | 40119.8 | 678.9 | 41,478 |
| 5 | LPE × Item (ξ as a linear function of #RespOpt) | 40868.0 | 40185.9 | 682.1 | 41,550 |
Mean (Standard Deviation) of ξ estimates in relation to the number of scheduled response options.
| 1 | 0.619 (0.417) | ||
| 2 | 1.000 (0.485) | 1.013 (0.460) | |
| 3 | 1.487 (0.603) | 1.629 (0.610) | 1.468 (0.379) |
| 4 | 1.819 (0.703) | 2.246 (0.717) | 1.832 (0.285) |
| 5 | 2.534 (0.801) |
Figure 2Estimated item characteristic curves (ICCs) for three DOMC items, IT Certification Test (N = 648). (A) Single-Keyed Item (1–4 Possible Response Options); Item 18 (a = 0.953, b = 1.007, ξ1 = 0.441, ξ2 = 0.742, ξ3 = 1.107, ξ4 = 2.303). (B) Double-Keyed Item (2–4 Possible Response Options); Item 30 (a = 1.461, b = 0.307, ξ1 = 0.526, ξ2 = 0.850, ξ3 = 1.198). (C) Triple-Keyed Item (3–5 Possible Response Options); Item 27 (a = 0.913, b = 0.123, ξ1 = 1.176, ξ2 = 1.450; ξ3 = 1.833).
Figure 3Histograms of proficiency estimates under Model 2 (symmetric 2PL model) and Model 4 (asymmetric LPE model).