| Literature DB >> 31708833 |
Jiwei Zhang1, Jing Lu2, Feng Chen3, Jian Tao2.
Abstract
In many large-scale tests, it is very common that students are nested within classes or schools and that the test designers try to measure their multidimensional latent traits (e.g., logical reasoning ability and computational ability in the mathematics test). It is particularly important to explore the influences of covariates on multiple abilities for development and improvement of educational quality monitoring mechanism. In this study, motivated by a real dataset of a large-scale English achievement test, we will address how to construct an appropriate multilevel structural models to fit the data in many of multilevel models, and what are the effects of gender and socioeconomic-status differences on English multidimensional abilities at the individual level, and how does the teachers' satisfaction and school climate affect students' English abilities at the school level. A full Gibbs sampling algorithm within the Markov chain Monte Carlo (MCMC) framework is used for model estimation. Moreover, a unique form of the deviance information criterion (DIC) is used as a model comparison index. In order to verify the accuracy of the algorithm estimation, two simulations are considered in this paper. Simulation studies show that the Gibbs sampling algorithm works well in estimating all model parameters across a broad spectrum of scenarios, which can be used to guide the real data analysis. A brief discussion and suggestions for further research are shown in the concluding remarks.Entities:
Keywords: Bayesian estimation; education assessment; multidimensional item response theory; multilevel model; teacher satisfactions
Year: 2019 PMID: 31708833 PMCID: PMC6823212 DOI: 10.3389/fpsyg.2019.02387
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Estimation of simulated item parameter estimation using Gibbs sampling algorithm in simulation study 1.
| 1 | 1 | 1 | − | 0 | 0 | − | 0 | 0 | − |
| 2 | 0 | 0 | − | 1 | 1 | − | 0 | 0 | − |
| 3 | 0.914 | 0.877 | [0.711, 1.044] | 0.686 | 0.672 | [0.551, 0.795] | −1.182 | −1.154 | [−1.327, −1.005] |
| 4 | 1.102 | 1.127 | [0.915, 1.355] | 1.468 | 1.485 | [1.250, 1.717] | 0.441 | 0.426 | [0.203, 0.629] |
| 5 | 2.055 | 2.046 | [1.674, 2.466] | 1.428 | 1.453 | [1.214, 1.678] | −1.197 | −1.367 | [−1.683, −1.101] |
| 6 | 2.291 | 2.361 | [1.876, 2.835] | 1.146 | 1.159 | [0.877, 1.406] | −2.536 | −2.524 | [−3.068, −2.187] |
| 7 | 2.131 | 2.185 | [1.834, 2.576] | 0.758 | 0.760 | [0.595, 0.930] | 1.782 | 1.759 | [1.448, 2.081] |
| 8 | 1.027 | 1.009 | [0.806, 1.214] | 1.720 | 1.736 | [1.491, 2.009] | 0.152 | 0.159 | [−0.229, 0.225] |
| 9 | 0.569 | 0.564 | [0.403, 0.713] | 1.119 | 1.152 | [0.973, 1.324] | 0.964 | 0.927 | [0.735, 1.093] |
| 10 | 0.578 | 0.550 | [0.342, 0.761] | 2.129 | 2.094 | [1.776, 2.471] | 1.462 | 1.485 | [1.215, 1.745] |
| 11 | 0.795 | 0.797 | [0.615, 0.980] | 1.445 | 1.466 | [1.261, 1.691] | 0.619 | 0.600 | [0.376, 0.787] |
| 12 | 2.279 | 2.389 | [1.191, 2.867] | 1.148 | 1.132 | [0.875, 1.412] | −2.020 | −2.028 | [−2.388, −1.696] |
| 13 | 0.714 | 0.616 | [0.391, 0.864] | 2.225 | 2.210 | [1.867, 2.532] | 0.602 | 0.577 | [0.293, 0.826] |
| 14 | 2.200 | 2.216 | [1.797, 2.651] | 1.465 | 1.471 | [1.217, 1.721] | 0.127 | 0.091 | [−0.219, 0.381] |
| 15 | 1.565 | 1.589 | [1.349, 1.847] | 0.728 | 0.711 | [0.558, 0.867] | −0.587 | −0.605 | [−0.817, −0.419] |
| 16 | 2.419 | 2.439 | [2.076, 2.866] | 2.408 | 2.380 | [2.015, 2.796] | −0.218 | −0.225 | [−0.635, 0.094] |
| 17 | 1.561 | 1.595 | [1.342, 1.869] | 1.398 | 1.388 | [1.182, 1.621] | 0.830 | 0.789 | [0.533, 1.022] |
| 18 | 2.457 | 2.470 | [1.981, 2.900] | 2.111 | 2.152 | [1.792, 2.547] | 1.558 | 1.560 | [1.182, 1.926] |
| 19 | 0.714 | 0.686 | [0.545, 0.843] | 0.918 | 0.883 | [0.743, 1.030] | 1.504 | 1.487 | [1.320, 1.670] |
| 20 | 2.447 | 2.482 | [2.023, 2.942] | 1.704 | 1.754 | [1.490, 2.018] | 0.126 | 0.110 | [−0.221, 0.421] |
| 21 | 1.588 | 1.562 | [1.217, 1.905] | 2.170 | 2.177 | [1.825, 2.534] | −0.760 | −0.789 | [−1.123, −0.521] |
| 22 | 1.724 | 1.721 | [1.456, 2.037] | 1.590 | 1.571 | [1.320, 1.800] | 0.769 | 0.671 | [0.397, 0.912] |
| 23 | 2.273 | 2.244 | [1.909, 2.616] | 0.948 | 0.917 | [0.738, 1.119] | 0.265 | 0.105 | [−0.156, 0.343] |
| 24 | 1.228 | 1.198 | [0.902, 1.505] | 2.782 | 2.755 | [2.353, 3.128] | −1.398 | −1.429 | [−1.834, −1.115] |
| 25 | 0.687 | 0.674 | [0.456, 0.923] | 2.261 | 2.275 | [1.925, 2.651] | 1.802 | 1.778 | [1.429, 2.111] |
| 26 | 1.665 | 1.666 | [1.427, 1.928] | 0.572 | 0.568 | [0.443, 0.709] | 0.033 | 0.021 | [−0.172, 0.208] |
| 27 | 2.383 | 2.400 | [1.904, 2.823] | 1.871 | 2.021 | [1.626, 2.359] | 1.307 | 1.285 | [0.915, 1.620] |
| 28 | 1.778 | 1.772 | [1.443, 2.111] | 2.326 | 2.305 | [1.957, 2.641] | −0.871 | −0.875 | [−1.193, −0.581] |
| 29 | 1.522 | 1.541 | [1.175, 1.975] | 2.909 | 2.934 | [2.460, 3.505] | 0.241 | 0.232 | [−0.175, 0.588] |
| 30 | 1.173 | 1.178 | [1.940, 1.434] | 1.703 | 1.710 | [1.458, 1.977] | 0.397 | 0.363 | [0.104, 0.577] |
indicates the constraints for model identification. True denotes the true value of parameter. EAP denotes the expected a priori estimation. HPDI denotes the 95% highest posterior density intervals.
Parameter estimates of the fixed effect, Level-2 variance-covariance and Level-3 variance-covariance in simulation 1.
| γ001 | 1.000 | 0.982 | [0.928, 1.225] | γ002 | −0.350 | −0.377 | [−0.659, −0.115] |
| γ011 | 0.300 | 0.326 | [0.129, 0.510] | γ012 | 0.300 | 0.281 | [−0.046, 0.524] |
| γ101 | 0.500 | 0.521 | [0.244, 0.807] | γ102 | 0.500 | 0.522 | [0.296, 0.824] |
| γ111 | 0.350 | 0.325 | [0.134, 0.501] | γ112 | −1.000 | −0.986 | [−1.234, −0.736] |
| 0.300 | 0.323 | [0.269, 0.387] | |||||
| σ | 0.075 | 0.093 | [0.053, 0.136] | ||||
| σ | 0.075 | 0.093 | [0.053, 0.136] | ||||
| 0.500 | 0.529 | [0.438, 0.648] | |||||
| τ001 | 0.100 | 0.115 | [0.016, 0.380] | τ002 | 0.100 | 0.073 | [−0.058, 0.369] |
| τ011 | 0 | 0.013 | [−0.229, 0.140] | τ012 | 0 | 0.017 | [−0.143, 0.192] |
| τ101 | 0 | 0.013 | [−0.229, 0.140] | τ102 | 0 | 0.017 | [−0.143, 0.192] |
| τ111 | 0.100 | 0.074 | [−0.068, 0.436] | τ112 | 0.100 | 0.119 | [−0.093, 0.298] |
Evaluating the accuracy of item parameter estimation.
| 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 3 | 0.914 | −0.037 | 0.114 | 0.686 | −0.014 | 0.090 | −1.182 | 0.028 | 0.144 |
| 4 | 1.102 | 0.025 | 0.098 | 1.468 | 0.017 | 0.125 | 0.441 | −0.015 | 0.093 |
| 5 | 2.055 | −0.010 | 0.073 | 1.428 | 0.025 | 0.047 | −1.197 | −0.170 | 0.137 |
| 6 | 2.291 | 0.070 | 0.153 | 1.146 | 0.013 | 0.084 | −2.536 | 0.012 | 0.126 |
| 7 | 2.131 | 0.054 | 0.119 | 0.758 | 0.002 | 0.035 | 1.782 | −0.023 | 0.149 |
| 8 | 1.027 | −0.018 | 0.159 | 1.720 | 0.016 | 0.140 | 0.152 | 0.007 | 0.094 |
| 9 | 0.569 | −0.005 | 0.136 | 1.119 | 0.033 | 0.102 | 0.964 | −0.037 | 0.072 |
| 10 | 0.578 | −0.019 | 0.180 | 2.129 | −0.035 | 0.185 | 1.462 | 0.023 | 0.103 |
| 11 | 0.795 | 0.002 | 0.088 | 1.445 | 0.021 | 0.137 | 0.619 | −0.019 | 0.081 |
| 12 | 2.279 | 0.110 | 0.153 | 1.148 | −0.016 | 0.098 | −2.020 | −0.008 | 0.053 |
| 13 | 0.714 | −0.098 | 0.142 | 2.225 | −0.015 | 0.053 | 0.602 | −0.025 | 0.091 |
| 14 | 2.200 | 0.016 | 0.093 | 1.465 | 0.006 | 0.039 | 0.127 | 0.036 | 0.127 |
| 15 | 1.565 | 0.024 | 0.120 | 0.728 | −0.017 | 0.092 | −0.587 | −0.018 | 0.116 |
| 16 | 2.419 | 0.020 | 0.162 | 2.408 | −0.028 | 0.164 | −0.218 | −0.007 | 0.092 |
| 17 | 1.561 | 0.034 | 0.105 | 1.398 | −0.010 | 0.072 | 0.830 | −0.041 | 0.115 |
| 18 | 2.457 | 0.013 | 0.091 | 2.111 | 0.041 | 0.109 | 1.558 | 0.002 | 0.150 |
| 19 | 0.714 | −0.028 | 0.155 | 0.918 | −0.035 | 0.156 | 1.504 | −0.017 | 0.197 |
| 20 | 2.447 | 0.035 | 0.198 | 1.704 | 0.050 | 0.143 | 0.126 | −0.016 | 0.156 |
| 21 | 1.588 | −0.026 | 0.185 | 2.170 | 0.007 | 0.124 | −0.760 | 0.029 | 0.256 |
| 22 | 1.724 | −0.003 | 0.147 | 1.590 | −0.019 | 0.128 | 0.769 | −0.098 | 0.153 |
| 23 | 2.273 | −0.029 | 0.084 | 0.948 | −0.031 | 0.060 | 0.265 | −0.160 | 0.179 |
| 24 | 1.228 | −0.030 | 0.189 | 2.782 | −0.027 | 0.194 | −1.398 | −0.031 | 0.132 |
| 25 | 0.687 | −0.013 | 0.075 | 2.261 | 0.014 | 0.107 | 1.802 | 0.024 | 0.193 |
| 26 | 1.665 | 0.001 | 0.120 | 0.572 | −0.004 | 0.068 | 0.033 | −0.012 | 0.090 |
| 27 | 2.383 | 0.017 | 0.148 | 1.871 | 0.015 | 0.095 | 1.307 | 0.022 | 0.158 |
| 28 | 1.778 | −0.008 | 0.113 | 2.326 | −0.021 | 0.140 | −0.871 | −0.004 | 0.083 |
| 29 | 1.522 | 0.019 | 0.096 | 2.909 | 0.025 | 0.163 | 0.241 | 0.009 | 0.127 |
| 30 | 1.173 | 0.005 | 0.181 | 1.703 | 0.007 | 0.098 | 0.397 | −0.034 | 0.221 |
indicates the constraints for model identification. RMSE denotes the root mean squared error.
Evaluating the accuracy of the two-dimensional fixed effects and variance-covariance components.
| γ001 | 1.000 | −0.018 | 0.082 | γ002 | −0.350 | −0.027 | 0.169 |
| γ011 | 0.300 | 0.026 | 0.156 | γ012 | 0.300 | −0.019 | 0.096 |
| γ101 | 0.500 | 0.021 | 0.148 | γ102 | 0.500 | 0.022 | 0.147 |
| γ111 | 0.350 | −0.025 | 0.173 | γ112 | −1.000 | 0.014 | 0.121 |
| 0.300 | 0.023 | 0.098 | |||||
| σ | 0.075 | 0.018 | 0.163 | ||||
| σ | 0.075 | 0.018 | 0.163 | ||||
| 0.500 | 0.029 | 0.117 | |||||
| τ001 | 0.100 | 0.015 | 0.164 | τ002 | 0.100 | −0.029 | 0.143 |
| τ011 | 0 | 0.013 | 0.182 | τ012 | 0 | 0.017 | 0.187 |
| τ101 | 0 | 0.013 | 0.182 | τ102 | 0 | 0.017 | 0.187 |
| τ111 | 0.100 | −0.026 | 0.139 | τ112 | 0.100 | 0.019 | 0.167 |
Evaluating the accuracy of the structure parameters in the simulation 2.
| 40 | −0.089 | 0.031 | 0.046 | 0.438 | 0.064 | 0.038 | |
| 1000 | 100 | 0.073 | 0.191 | 0.078 | 0.195 | −0.037 | 0.203 |
| 200 | 0.094 | 0.174 | −0.063 | 0.160 | 0.081 | 0.198 | |
| 40 | 0.056 | 0.206 | 0.117 | 0.319 | 0.105 | 0.207 | |
| 2000 | 100 | 0.028 | 0.167 | 0.064 | 0.177 | −0.069 | 0.189 |
| 200 | −0.041 | 0.152 | −0.037 | 0.154 | 0.021 | 0.156 | |
| 40 | 0.039 | 0.231 | 0.055 | 0.213 | 0.032 | 0.195 | |
| 3000 | 100 | −0.035 | 0.189 | 0.082 | 0.246 | −0.058 | 0.145 |
| 200 | 0.017 | 0.159 | 0.041 | 0.147 | 0.045 | 0.132 | |
The VC stands for the abbreviation of variance-covariance.
Estimated DIC values for the three models fitted to the English test data.
| Model 1 | 134,470 | 1,010,030 | 1,144,500 |
| Model 2 | 79,065 | 891,425 | 970,490 |
| Model 3 | 81,607 | 895,073 | 976,680 |
Parameter estimation of the multilevel multidimensional IRT model for vocabulary cognitive ability.
| γ001 | 0.760 | 0.186 | [0.391, 1.137] |
| γ011( | 0.502 | 0.143 | [0.223, 0.788] |
| γ021( | 0.225 | 0.149 | [−0.068, 0.520] |
| γ101( | 0.642 | 0.128 | [0.390, 0.893] |
| γ201( | 0.339 | 0.160 | [0.025, 0.657] |
| 0.537 | 0.124 | [0.227, 1.200] | |
| 0.004 | 0.126 | [−0.228, 0.241] | |
| −0.006 | 0.164 | [−0.344, 0.383] | |
| 0.247 | 0.134 | [0.112, 0.541] | |
| −0.064 | 0.112 | [−0.292, 0.110] | |
| 0.030 | 0.191 | [0.015, 0.043] | |
ST, teacher satisfaction; CT, climate; SES, socioeconomic-status; GD, gender. EAP denotes the expected a posteriori estimation. SD denotes the standard deviation. HPDI is the 95% highest posterior density interval.
Parameter estimation of the multilevel multidimensional IRT model for table computing ability.
| γ004 | 0.255 | 0.130 | [−0.003, 0.514] |
| γ014( | 0.039 | 0.104 | [−0.165, 0.246] |
| γ024( | 0.295 | 0.101 | [0.099, 0.498] |
| γ104( | 0.596 | 0.126 | [0.351, 0.849] |
| γ204( | −0.266 | 0.120 | [−0.506, -0.026] |
| 0.447 | 0.144 | [0.201, 0.970] | |
| 0.082 | 0.084 | [−0.043, 0.269] | |
| −0.041 | 0.100 | [−0.223, 0.098] | |
| 0.226 | 0.106 | [0.101, 0.485] | |
| −0.014 | 0.069 | [−0.160, 0.114] | |
| 0.022 | 0.102 | [0.015, 0.035] | |
ST, teacher satisfaction; CT, climate; SES, socioeconomic-status; GD, gender. EAP denotes the expected a posteriori estimation. SD denotes the standard deviation. HPDI is the 95% highest posterior density interval.
Figure 1Parameters of estimation a, a, a, and a for subscale 1 (items 1–40), subscale 2 (items 41–64), subscale 3 (items 65–104), and subscale 4 (items 105–124).
Parameter estimation of the multilevel multidimensional IRT model for diagnosing ability of grammar structure.
| γ001 | 0.760 | 0.186 | [0.391, 1.137] |
| γ011( | 0.502 | 0.143 | [0.223, 0.788] |
| γ021( | 0.225 | 0.149 | [−0.068, 0.520] |
| γ101( | 0.642 | 0.128 | [0.390, 0.893] |
| γ201( | 0.339 | 0.160 | [0.025, 0.657] |
| 0.537 | 0.124 | [0.227, 1.200] | |
| 0.004 | 0.126 | [−0.228, 0.241] | |
| −0.006 | 0.164 | [−0.344, 0.383] | |
| 0.247 | 0.134 | [0.112, 0.541] | |
| −0.064 | 0.112 | [−0.292, 0.110] | |
| 0.030 | 0.191 | [0.015, 0.043] | |
ST, teacher satisfaction; CT, climate; SES, socioeconomic-status; GD, gender. EAP denotes the expected a posteriori estimation. SD denotes the standard deviation. HPDI is the 95% highest posterior density interval.
Parameter estimation of the multilevel multidimensional IRT model for reading comprehension ability.
| γ003 | 0.919 | 0.187 | [0.548, 1.293] |
| γ013( | 0.332 | 0.148 | [0.041, 0.624] |
| γ023( | 0.081 | 0.168 | [−0.249, 0.417] |
| γ103( | 0.542 | 0.118 | [0.308, 0.780] |
| γ203( | 0.232 | 0.155 | [−0.070, 0.544] |
| 0.535 | 0.111 | [0.223, 1.220] | |
| 0.040 | 0.198 | [−0.156, 0.275] | |
| −0.024 | 0.153 | [−0.342, 0.264] | |
| 0.207 | 0.133 | [0.091, 0.456] | |
| 0.004 | 0.089 | [−0.170, 0.182] | |
| 0.037 | 0.177 | [0.027, 0.052] | |
ST, teacher satisfaction; CT, climate; SES, socioeconomic-status; GD, gender. EAP denotes the expected a posteriori estimation. SD denotes the standard deviation. HPDI is the 95% highest posterior density interval.