| Literature DB >> 33343450 |
Xiaojian Sun1,2, Yanlou Liu3, Tao Xin4, Naiqing Song1,2.
Abstract
Calibration errors are inevitable and should not be ignored during the estimation of item parameters. Items with calibration error can affect the measurement results of tests. One of the purposes of the current study is to investigate the impacts of the calibration errors during the estimation of item parameters on the measurement accuracy, average test length, and test efficiency for variable-length cognitive diagnostic computerized adaptive testing. The other purpose is to examine the methods for reducing the adverse effects of calibration errors. Simulation results show that (1) calibration error has negative effect on the measurement accuracy for the deterministic input, noisy "and" gate (DINA) model, and the reduced reparameterized unified model; (2) the average test lengths is shorter, and the test efficiency is overestimated for items with calibration errors; (3) the compensatory reparameterized unified model (CRUM) is less affected by the calibration errors, and the classification accuracy, average test length, and test efficiency are slightly stable in the CRUM framework; (4) methods such as improving the quality of items, using large calibration sample to calibrate the parameters of items, as well as using cross-validation method can reduce the adverse effects of calibration errors on CD-CAT.Entities:
Keywords: calibration errors; classification accuracy; cognitive diagnosis assessment; test efficiency; variable-length cognitive diagnostic computerized adaptive testing
Year: 2020 PMID: 33343450 PMCID: PMC7738350 DOI: 10.3389/fpsyg.2020.575141
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Simulation design of the current study.
| Model | DINA, RRUM, CRUM | |
| Calibration error | 0, 0.1, 0.2, 0.3 | |
| Termination rule | 0.7, 0.8 | |
| Item quality | High | DINA: |
| RRUM: | ||
| CRUM: | ||
| Low | DINA: | |
| RRUM: | ||
| CRUM: | ||
| Mix | DINA: | |
| RRUM: | ||
| CRUM: | ||
FIGURE 1The correct classification rates for all conditions.
FIGURE 2The average test length and the corresponding SDs for all conditions.
FIGURE 3The relative test efficiency with different calibration errors.
The relative test efficiency for each calibration error.
| DINA | Low | 0.1 | 2.456 | 0.016 | 2.299 | 0.029 |
| 0.2 | 18.782 | 0.200 | 19.528 | 0.259 | ||
| 0.3 | 45.504 | 0.703 | 45.149 | 0.460 | ||
| High | 0.1 | 5.149 | 0.034 | 5.117 | 0.039 | |
| 0.2 | 6.752 | 0.050 | 6.777 | 0.047 | ||
| 0.3 | 8.442 | 0.064 | 8.408 | 0.049 | ||
| Mix | 0.1 | 4.008 | 0.026 | 3.998 | 0.035 | |
| 0.2 | 5.160 | 0.055 | 5.346 | 0.053 | ||
| 0.3 | 11.294 | 0.069 | 11.251 | 0.076 | ||
| RRUM | Low | 0.1 | 1.778 | 0.015 | 1.683 | 0.013 |
| 0.2 | 6.112 | 0.061 | 6.008 | 0.072 | ||
| 0.3 | 9.882 | 0.100 | 9.617 | 0.093 | ||
| High | 0.1 | 3.386 | 0.018 | 3.444 | 0.019 | |
| 0.2 | 5.968 | 0.040 | 5.937 | 0.035 | ||
| 0.3 | 6.120 | 0.028 | 6.134 | 0.039 | ||
| Mix | 0.1 | 3.112 | 0.025 | 2.885 | 0.025 | |
| 0.2 | 5.005 | 0.036 | 5.106 | 0.034 | ||
| 0.3 | 6.802 | 0.060 | 6.766 | 0.082 | ||
| CRUM | Low | 0.1 | 1.019 | 0.002 | 1.017 | 0.002 |
| 0.2 | 1.041 | 0.004 | 1.026 | 0.004 | ||
| 0.3 | 1.076 | 0.006 | 1.054 | 0.008 | ||
| High | 0.1 | 1.007 | 0.000 | 1.005 | 0.000 | |
| 0.2 | 1.013 | 0.001 | 1.006 | 0.001 | ||
| 0.3 | 1.065 | 0.001 | 0.983 | 0.001 | ||
| Mix | 0.1 | 1.007 | 0.000 | 1.013 | 0.000 | |
| 0.2 | 0.997 | 0.001 | 0.978 | 0.001 | ||
| 0.3 | 1.098 | 0.002 | 1.059 | 0.001 | ||