| Literature DB >> 32265810 |
Xiaomin Li1, Wen-Chung Wang2, Qin Xie3.
Abstract
In recent decades, cognitive diagnostic models (CDMs) have been intensively researched and applied to various educational and psychological tests. However, because existing CDMs fail to consider rater effects, the application of CDMs to constructed-response (CR) items that involve human raters is seriously limited. Given the popularity of CR items, it is desirable to develop new CDMs that are capable of describing and estimating rater effects on CR items. In this study, we developed such new CDMs within the frameworks of facets models and hierarchical rater models, using the log-linear cognitive diagnosis model as a template. The parameters of the new models were estimated with the Markov chain Monte Carlo methods implemented in the freeware JAGS. Simulations were conducted to evaluate the parameter recovery of the new models. Results showed that the parameters were recovered fairly well and the more data there were, the better the recovery. Implications and applications of the new models were illustrated with an empirical study that adopted a fine-grained checklist to assess English academic essays.Entities:
Keywords: cognitive diagnostic models; facets models; hierarchical rater models; item response theory; rater effect
Year: 2020 PMID: 32265810 PMCID: PMC7105832 DOI: 10.3389/fpsyg.2020.00525
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Q-matrix for the ten items in the simulations.
| Item | Attribute 1 | Attribute 2 | Attribute 3 | Attribute 4 | Attribute 5 |
| 1 | 1 | 0 | 0 | 0 | 0 |
| 2 | 0 | 1 | 0 | 0 | 0 |
| 3 | 0 | 0 | 1 | 0 | 0 |
| 4 | 0 | 0 | 0 | 1 | 0 |
| 5 | 0 | 0 | 0 | 0 | 1 |
| 6 | 1 | 1 | 0 | 0 | 0 |
| 7 | 0 | 1 | 1 | 0 | 0 |
| 8 | 0 | 0 | 1 | 1 | 0 |
| 9 | 0 | 0 | 0 | 1 | 1 |
| 10 | 1 | 0 | 0 | 0 | 1 |
Generating values, bias, root mean square error (RMSE), and profile recovery rates (%) in simulation study I (Facets-CDM).
| Complete design | Balanced design | Unbalanced design | Random design | ||||||||||||||
| Facets-CDM | Standard CDM | Facets-CDM | Standard CDM | Facets-CDM | Standard CDM | Facets-CDM | Standard CDM | ||||||||||
| Par. | Gen | Bias | RMSE | Bias | RMSE | Bias | RMSE | Bias | RMSE | Bias | RMSE | Bias | RMSE | Bias | RMSE | Bias | RMSE |
| λ1,0 | –2.00 | –0.13 | 0.21 | 0.30 | 0.30 | –0.21 | 0.28 | 0.08 | 0.11 | –0.12 | 0.22 | 0.16 | 0.19 | –0.12 | 0.22 | 0.17 | 0.19 |
| λ2,0 | –1.40 | –0.12 | 0.19 | 0.23 | 0.23 | –0.19 | 0.25 | 0.04 | 0.07 | –0.21 | 0.25 | –0.02 | 0.06 | –0.13 | 0.18 | 0.11 | 0.13 |
| λ3,0 | –1.79 | –0.13 | 0.20 | 0.27 | 0.27 | –0.18 | 0.25 | 0.09 | 0.16 | –0.18 | 0.24 | 0.09 | 0.13 | –0.20 | 0.25 | 0.10 | 0.14 |
| λ4,0 | –1.37 | –0.15 | 0.21 | 0.20 | 0.20 | –0.19 | 0.26 | 0.02 | 0.09 | –0.17 | 0.23 | 0.08 | 0.09 | –0.21 | 0.28 | 0.03 | 0.08 |
| λ5,0 | –1.85 | –0.12 | 0.21 | 0.28 | 0.28 | –0.28 | 0.31 | 0.01 | 0.09 | –0.16 | 0.21 | 0.10 | 0.15 | –0.16 | 0.19 | 0.17 | 0.20 |
| λ6,0 | –2.42 | –0.14 | 0.23 | 0.33 | 0.33 | –0.26 | 0.33 | –0.06 | 0.16 | –0.11 | 0.18 | –0.05 | 0.18 | –0.30 | 0.36 | –0.05 | 0.13 |
| λ7,0 | –1.57 | –0.10 | 0.20 | 0.26 | 0.27 | –0.12 | 0.22 | 0.01 | 0.16 | –0.09 | 0.29 | –0.03 | 0.16 | –0.11 | 0.24 | 0.07 | 0.10 |
| λ8,0 | –1.95 | –0.10 | 0.21 | 0.31 | 0.32 | –0.17 | 0.24 | 0.00 | 0.14 | –0.18 | 0.23 | –0.02 | 0.17 | –0.19 | 0.24 | –0.02 | 0.18 |
| λ9,0 | –2.07 | –0.11 | 0.20 | 0.32 | 0.33 | –0.22 | 0.31 | –0.06 | 0.23 | –0.09 | 0.18 | 0.03 | 0.13 | –0.12 | 0.27 | 0.07 | 0.21 |
| λ10,0 | –1.69 | –0.10 | 0.22 | 0.28 | 0.29 | –0.18 | 0.28 | –0.05 | 0.12 | –0.08 | 0.29 | 0.02 | 0.16 | –0.09 | 0.17 | 0.09 | 0.12 |
| λ1,1 | 3.72 | –0.01 | 0.09 | –0.62 | 0.62 | –0.03 | 0.20 | –0.63 | 0.65 | –0.13 | 0.18 | –0.84 | 0.84 | –0.03 | 0.14 | –0.60 | 0.62 |
| λ2,1 | 2.82 | –0.06 | 0.10 | –0.55 | 0.55 | –0.15 | 0.21 | –0.56 | 0.58 | –0.19 | 0.24 | –0.58 | 0.59 | –0.18 | 0.20 | –0.65 | 0.65 |
| λ3,1 | 3.41 | 0.03 | 0.08 | –0.55 | 0.56 | –0.01 | 0.18 | –0.58 | 0.60 | –0.05 | 0.17 | –0.61 | 0.62 | –0.05 | 0.11 | –0.67 | 0.68 |
| λ4,1 | 2.91 | 0.01 | 0.08 | –0.50 | 0.50 | –0.06 | 0.10 | –0.42 | 0.43 | –0.09 | 0.14 | –0.58 | 0.59 | –0.04 | 0.18 | –0.50 | 0.52 |
| λ5,1 | 3.38 | 0.00 | 0.10 | –0.56 | 0.57 | –0.05 | 0.10 | –0.53 | 0.54 | –0.03 | 0.12 | –0.59 | 0.60 | –0.08 | 0.14 | –0.63 | 0.65 |
| λ6,1 | 1.26 | 0.04 | 0.10 | –0.12 | 0.14 | –0.02 | 0.18 | 0.02 | 0.16 | –0.13 | 0.20 | 0.04 | 0.15 | 0.14 | 0.29 | 0.13 | 0.27 |
| λ7,1 | 1.04 | –0.04 | 0.07 | –0.20 | 0.20 | –0.03 | 0.26 | –0.01 | 0.21 | –0.15 | 0.28 | –0.04 | 0.19 | –0.09 | 0.20 | –0.10 | 0.17 |
| λ8,1 | 1.18 | –0.05 | 0.09 | –0.20 | 0.21 | –0.03 | 0.15 | –0.04 | 0.13 | –0.04 | 0.22 | 0.00 | 0.25 | –0.03 | 0.24 | 0.05 | 0.26 |
| λ9,1 | 1.00 | –0.03 | 0.09 | –0.16 | 0.17 | –0.01 | 0.20 | 0.05 | 0.23 | –0.09 | 0.23 | 0.08 | 0.26 | –0.08 | 0.27 | 0.00 | 0.23 |
| λ10,1 | 0.96 | 0.00 | 0.07 | –0.14 | 0.15 | –0.01 | 0.15 | 0.06 | 0.14 | –0.03 | 0.17 | 0.07 | 0.17 | –0.04 | 0.10 | –0.02 | 0.11 |
| λ6,2 | 2.16 | –0.01 | 0.24 | –0.42 | 0.48 | 0.07 | 0.32 | –0.72 | 0.76 | 0.20 | 0.51 | –0.79 | 0.82 | 0.00 | 0.48 | –0.70 | 0.81 |
| λ7,2 | 2.14 | 0.07 | 0.19 | –0.29 | 0.32 | –0.13 | 0.39 | –0.78 | 0.81 | 0.14 | 0.39 | –0.58 | 0.66 | 0.24 | 0.40 | –0.47 | 0.54 |
| λ8,2 | 1.98 | 0.09 | 0.20 | –0.28 | 0.31 | –0.03 | 0.22 | –0.56 | 0.59 | 0.13 | 0.43 | –0.51 | 0.61 | 0.26 | 0.39 | –0.55 | 0.63 |
| λ9,2 | 2.05 | 0.08 | 0.15 | –0.33 | 0.35 | 0.12 | 0.52 | –0.44 | 0.63 | 0.20 | 0.34 | –0.67 | 0.76 | 0.27 | 0.49 | –0.38 | 0.51 |
| λ10,2 | 2.12 | –0.02 | 0.21 | –0.39 | 0.43 | 0.10 | 0.27 | –0.52 | 0.59 | 0.19 | 0.28 | –0.63 | 0.64 | 0.05 | 0.24 | –0.55 | 0.59 |
| η1 | 0.57 | –0.01 | 0.15 | –0.01 | 0.18 | 0.09 | 0.17 | –0.02 | 0.11 | ||||||||
| η2 | 0.59 | 0.01 | 0.13 | 0.05 | 0.17 | 0.11 | 0.18 | –0.04 | 0.15 | ||||||||
| η3 | 0.70 | 0.04 | 0.16 | 0.04 | 0.17 | 0.12 | 0.18 | –0.02 | 0.13 | ||||||||
| η4 | 1.83 | 0.00 | 0.17 | 0.05 | 0.15 | 0.10 | 0.18 | –0.03 | 0.13 | ||||||||
| η5 | –0.50 | –0.04 | 0.16 | –0.03 | 0.14 | 0.10 | 0.20 | –0.07 | 0.18 | ||||||||
| η6 | –0.56 | 0.03 | 0.16 | –0.06 | 0.17 | 0.08 | 0.17 | –0.09 | 0.15 | ||||||||
| η7 | –0.10 | 0.01 | 0.18 | 0.03 | 0.23 | 0.09 | 0.18 | –0.04 | 0.17 | ||||||||
| η8 | –1.05 | 0.01 | 0.17 | 0.07 | 0.17 | 0.08 | 0.17 | –0.03 | 0.17 | ||||||||
| η9 | 0.55 | 0.04 | 0.15 | 0.04 | 0.13 | 0.10 | 0.19 | –0.04 | 0.14 | ||||||||
| η10 | –2.03 | 0.02 | 0.19 | 0.10 | 0.24 | 0.16 | 0.21 | 0.04 | 0.14 | ||||||||
| Minimum | 96.41 | 94.27 | 68.80 | 59.83 | 67.44 | 58.61 | 68.42 | 62.40 | |||||||||
| Maximum | 99.15 | 97.00 | 75.67 | 67.85 | 73.22 | 65.24 | 77.21 | 69.83 | |||||||||
| Mean | 97.58 | 95.78 | 71.02 | 63.50 | 70.14 | 62.16 | 72.62 | 66.20 | |||||||||
| 0.66 | 0.64 | 2.14 | 2.77 | 1.45 | 1.83 | 2.89 | 2.29 | ||||||||||
Number of ratees under the incomplete designs in simulation study I (Facets-CDM).
| Rater | ||||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| 50 | 50 | 50 | ||||||||
| Total | 150 | 150 | 150 | 150 | 150 | 150 | 150 | 150 | 150 | 150 |
| 50 | 50 | 50 | ||||||||
| 68 | 68 | 68 | ||||||||
| 44 | 44 | 44 | ||||||||
| 58 | 58 | 58 | ||||||||
| 35 | 35 | 35 | ||||||||
| 51 | 51 | 51 | ||||||||
| 50 | 50 | 50 | ||||||||
| 55 | 55 | 55 | ||||||||
| 40 | 40 | 40 | ||||||||
| 49 | 49 | 49 | ||||||||
| Total | 139 | 167 | 162 | 170 | 137 | 144 | 136 | 156 | 145 | 144 |
| Total | 134 | 155 | 157 | 141 | 168 | 158 | 152 | 130 | 153 | 152 |
Q-matrix of the 52 criteria in the empirical example.
| Attribute | Attribute | ||||||||||||
| Item | 1 | 2 | 3 | 4 | 5 | 6 | Item | 1 | 2 | 3 | 4 | 5 | 6 |
| 1 | 1 | 1 | 1 | 1 | 1 | 0 | 27 | 0 | 0 | 1 | 0 | 0 | 0 |
| 2 | 1 | 0 | 0 | 0 | 0 | 0 | 28 | 0 | 0 | 1 | 0 | 0 | 0 |
| 3 | 1 | 0 | 0 | 0 | 0 | 0 | 29 | 0 | 0 | 1 | 0 | 0 | 0 |
| 4 | 0 | 1 | 1 | 1 | 0 | 0 | 30 | 0 | 0 | 1 | 0 | 0 | 0 |
| 5 | 0 | 1 | 1 | 1 | 0 | 0 | 31 | 0 | 0 | 1 | 0 | 0 | 1 |
| 6 | 1 | 0 | 1 | 1 | 1 | 0 | 32 | 0 | 0 | 1 | 0 | 0 | 1 |
| 7 | 1 | 1 | 1 | 1 | 0 | 0 | 33 | 0 | 0 | 1 | 0 | 0 | 0 |
| 8 | 1 | 1 | 0 | 0 | 0 | 0 | 34 | 0 | 0 | 1 | 0 | 0 | 0 |
| 9 | 1 | 1 | 0 | 0 | 0 | 0 | 35 | 0 | 0 | 1 | 0 | 0 | 0 |
| 10 | 1 | 1 | 0 | 0 | 0 | 0 | 36 | 0 | 0 | 1 | 1 | 0 | 0 |
| 11 | 1 | 0 | 0 | 1 | 0 | 0 | 37 | 0 | 0 | 1 | 0 | 0 | 0 |
| 12 | 1 | 1 | 0 | 1 | 0 | 0 | 38 | 0 | 0 | 1 | 0 | 0 | 0 |
| 13 | 1 | 0 | 0 | 0 | 0 | 0 | 39 | 0 | 0 | 1 | 1 | 0 | 0 |
| 14 | 0 | 1 | 0 | 0 | 1 | 0 | 40 | 0 | 0 | 1 | 1 | 0 | 0 |
| 15 | 0 | 1 | 0 | 0 | 0 | 0 | 41 | 0 | 0 | 0 | 1 | 0 | 0 |
| 16 | 0 | 1 | 0 | 0 | 0 | 0 | 42 | 0 | 0 | 0 | 1 | 0 | 0 |
| 17 | 0 | 1 | 0 | 0 | 0 | 0 | 43 | 0 | 0 | 0 | 1 | 0 | 0 |
| 18 | 0 | 1 | 0 | 0 | 0 | 0 | 44 | 0 | 0 | 0 | 1 | 0 | 0 |
| 19 | 0 | 1 | 0 | 0 | 0 | 0 | 45 | 0 | 0 | 1 | 1 | 0 | 0 |
| 20 | 0 | 1 | 0 | 0 | 0 | 0 | 46 | 0 | 0 | 0 | 1 | 0 | 1 |
| 21 | 0 | 1 | 1 | 1 | 0 | 0 | 47 | 0 | 0 | 0 | 1 | 0 | 1 |
| 22 | 0 | 1 | 1 | 1 | 0 | 0 | 48 | 0 | 0 | 1 | 0 | 0 | 1 |
| 23 | 0 | 1 | 1 | 1 | 0 | 0 | 49 | 0 | 0 | 0 | 0 | 0 | 1 |
| 24 | 0 | 1 | 1 | 1 | 0 | 0 | 50 | 0 | 0 | 0 | 0 | 0 | 1 |
| 25 | 0 | 1 | 0 | 0 | 0 | 0 | 51 | 0 | 0 | 1 | 1 | 1 | 0 |
| 26 | 0 | 0 | 1 | 0 | 0 | 0 | 52 | 0 | 0 | 1 | 1 | 1 | 0 |
Means and standard deviations for raters’ scorings across all indicators in the empirical example.
| Rater | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| Mean | 0.41 | 0.74 | 0.68 | 0.68 | 0.57 | 0.58 | 0.55 | 0.84 | 0.59 |
| 0.28 | 0.26 | 0.29 | 0.26 | 0.28 | 0.31 | 0.27 | 0.18 | 0.33 |
Model fit statistics of the three models in the empirical example.
| Model | AIC | BIC | |
| DINA | 0.36 | 17004 | 17625 |
| Facets DINA | 0.44 | 16690 | 17331 |
| HRM DINA | 0.56 | 10490 | 11167 |
Estimates for the guessing and slip parameters yielded by the three models in the empirical example.
| Item | Guessing | Slip | ||||
| DINA | Facets-DINA | HRM-DINA | DINA | Facets-DINA | HRM-DINA | |
| 1 | 0.49 | 0.49 | 0.48 | 0.03 | 0.02 | 0.00 |
| 2 | 0.47 | 0.47 | 0.42 | 0.02 | 0.03 | 0.00 |
| 3 | 0.49 | 0.49 | 0.49 | 0.09 | 0.08 | 0.12 |
| 4 | 0.46 | 0.46 | 0.46 | 0.05 | 0.06 | 0.00 |
| 5 | 0.24 | 0.30 | 0.12 | 0.34 | 0.36 | 0.43 |
| 6 | 0.50 | 0.49 | 0.49 | 0.05 | 0.00 | 0.00 |
| 7 | 0.16 | 0.17 | 0.00 | 0.52 | 0.50 | 0.70 |
| 8 | 0.09 | 0.11 | 0.00 | 0.18 | 0.20 | 0.23 |
| 9 | 0.38 | 0.41 | 0.28 | 0.01 | 0.04 | 0.00 |
| 10 | 0.31 | 0.31 | 0.14 | 0.23 | 0.21 | 0.30 |
| 11 | 0.18 | 0.17 | 0.01 | 0.26 | 0.27 | 0.31 |
| 12 | 0.46 | 0.46 | 0.38 | 0.07 | 0.07 | 0.01 |
| 13 | 0.48 | 0.48 | 0.43 | 0.08 | 0.08 | 0.06 |
| 14 | 0.49 | 0.49 | 0.49 | 0.05 | 0.00 | 0.00 |
| 15 | 0.15 | 0.14 | 0.00 | 0.45 | 0.44 | 0.56 |
| 16 | 0.48 | 0.48 | 0.48 | 0.08 | 0.08 | 0.06 |
| 17 | 0.47 | 0.47 | 0.46 | 0.15 | 0.15 | 0.17 |
| 18 | 0.26 | 0.41 | 0.23 | 0.23 | 0.25 | 0.31 |
| 19 | 0.46 | 0.45 | 0.44 | 0.15 | 0.13 | 0.16 |
| 20 | 0.38 | 0.24 | 0.31 | 0.10 | 0.07 | 0.07 |
| 21 | 0.25 | 0.16 | 0.09 | 0.31 | 0.22 | 0.13 |
| 22 | 0.49 | 0.48 | 0.48 | 0.12 | 0.06 | 0.01 |
| 23 | 0.50 | 0.49 | 0.49 | 0.01 | 0.01 | 0.00 |
| 24 | 0.48 | 0.48 | 0.48 | 0.11 | 0.13 | 0.06 |
| 25 | 0.45 | 0.46 | 0.45 | 0.07 | 0.08 | 0.05 |
| 26 | 0.07 | 0.12 | 0.00 | 0.79 | 0.81 | 1.00 |
| 27 | 0.31 | 0.33 | 0.02 | 0.66 | 0.67 | 0.96 |
| 28 | 0.40 | 0.47 | 0.40 | 0.15 | 0.18 | 0.19 |
| 29 | 0.47 | 0.47 | 0.46 | 0.04 | 0.05 | 0.03 |
| 30 | 0.44 | 0.37 | 0.29 | 0.31 | 0.26 | 0.39 |
| 31 | 0.39 | 0.36 | 0.19 | 0.45 | 0.42 | 0.59 |
| 32 | 0.30 | 0.42 | 0.25 | 0.24 | 0.26 | 0.31 |
| 33 | 0.18 | 0.10 | 0.02 | 0.37 | 0.32 | 0.41 |
| 34 | 0.08 | 0.15 | 0.01 | 0.54 | 0.56 | 0.77 |
| 35 | 0.42 | 0.47 | 0.33 | 0.21 | 0.24 | 0.19 |
| 36 | 0.23 | 0.34 | 0.06 | 0.25 | 0.26 | 0.27 |
| 37 | 0.45 | 0.47 | 0.42 | 0.11 | 0.12 | 0.09 |
| 38 | 0.13 | 0.17 | 0.01 | 0.56 | 0.59 | 0.70 |
| 39 | 0.48 | 0.48 | 0.48 | 0.05 | 0.04 | 0.00 |
| 40 | 0.00 | 0.01 | 0.00 | 0.93 | 0.95 | 1.00 |
| 41 | 0.01 | 0.07 | 0.00 | 0.54 | 0.58 | 0.88 |
| 42 | 0.37 | 0.35 | 0.39 | 0.26 | 0.24 | 0.41 |
| 43 | 0.23 | 0.33 | 0.12 | 0.28 | 0.30 | 0.34 |
| 44 | 0.21 | 0.33 | 0.05 | 0.33 | 0.38 | 0.49 |
| 45 | 0.19 | 0.21 | 0.02 | 0.07 | 0.18 | 0.02 |
| 46 | 0.28 | 0.27 | 0.09 | 0.02 | 0.11 | 0.00 |
| 47 | 0.49 | 0.49 | 0.48 | 0.04 | 0.07 | 0.01 |
| 48 | 0.49 | 0.49 | 0.49 | 0.01 | 0.02 | 0.00 |
| 49 | 0.49 | 0.49 | 0.49 | 0.00 | 0.02 | 0.00 |
| 50 | 0.04 | 0.24 | 0.00 | 0.43 | 0.69 | 0.95 |
| 51 | 0.33 | 0.47 | 0.24 | 0.13 | 0.31 | 0.37 |
| 52 | 0.03 | 0.30 | 0.00 | 0.40 | 0.66 | 0.92 |
Rater severity and variability yielded from the HRM DINA model in the empirical example.
| Rater | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| Severity | 0.40 | 0.02 | 0.01 | 0.02 | 0.01 | 0.02 | 0.24 | 0.04 | 0.02 |
| SE | 0.02 | 0.05 | 0.06 | 0.06 | 0.05 | 0.06 | 0.00 | 0.07 | 0.02 |
| Variability | 0.37 | 0.84 | 0.69 | 0.70 | 0.53 | 0.52 | 0.76 | 1.28 | 0.49 |
| SE | 0.01 | 0.05 | 0.04 | 0.04 | 0.03 | 0.03 | 0.04 | 0.10 | 0.02 |
Fair scores and observed scores for selected cases in the real data.
| Student index | Estimated profile | Rater | Observed scores | Fair scores | Difference |
| 21 | 1,1,1,1,1,0 | 1 | 23 | 40 | −17 |
| 23 | 0,1,1,1,1,1 | 1 | 13 | 36 | −23 |
| 30 | 0,0,0,1,0,0 | 1 | 15 | 22 | −7 |
| 69 | 1,0,1,1,0,1 | 8 | 44 | 38 | 6 |
| 230 | 1,1,1,1,0,1 | 8 | 42 | 33 | 9 |