| Literature DB >> 29867664 |
Dubravka Svetina1, Yanan Feng1, Justin Paulsen1, Montserrat Valdivia1, Arturo Valdivia2, Shenghai Dai3.
Abstract
The rise in popularity and use of cognitive diagnostic models (CDMs) in educational research are partly motivated by the models' ability to provide diagnostic information regarding students' strengths and weaknesses in a variety of content areas. An important step to ensure appropriate interpretations from CDMs is to investigate differential item functioning (DIF). To this end, the current simulation study examined the performance of three methods to detect DIF in CDMs, with particular emphasis on the impact of Q-matrix misspecification on methods' performance. Results illustrated that logistic regression and Mantel-Haenszel had better control of Type I error than the Wald test; however, high power rates were found using logistic regression and Wald methods, only. In addition to the tradeoff between Type I error control and acceptable power, our results suggested that Q-matrix complexity and item structures yield different results for different methods, presenting a more complex picture of the methods' performance. Finally, implications and future directions are discussed.Entities:
Keywords: Q-matrix misspecification; cognitive diagnostic models; differential item functioning; test bias; validity
Year: 2018 PMID: 29867664 PMCID: PMC5958216 DOI: 10.3389/fpsyg.2018.00696
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Sample Q-matrix for five attributes across four items.
| Attributes/skills (K) | ||||||
|---|---|---|---|---|---|---|
| Skill 1 | Skill 2 | Skill 3 | Skill 4 | Skill 5 | ||
| Items | Item 1 | 0 | 0 | 0 | 1 | 0 |
| Item 2 | 1 | 0 | 1 | 0 | 0 | |
| Item 3 | 0 | 1 | 1 | 1 | 0 | |
| Item 4 | 0 | 0 | 1 | 0 | 1 | |
Summary of DIF conditions∗.
| DIF type | DIF size | Δ | Δs |
|---|---|---|---|
| Uniform DIF | Moderate | +0.075 | +0.075 |
| –0.075 | –0.075 | ||
| Large | +0.10 | +0.10 | |
| –0.10 | –0.10 | ||
| Nonuniform DIF | Moderate | +0.075 | –0.075 |
| +0.075 | 0 | ||
| –0.075 | +0.075 | ||
| –0.075 | 0 | ||
| 0 | –0.075 | ||
| 0 | +0.075 | ||
| Large | +0.10 | –0.10 | |
| +0.10 | 0 | ||
| –0.10 | +0.10 | ||
| –0.10 | 0 | ||
| 0 | –0.10 | ||
| 0 | +0.10 |
Average Type I error rates for conditions with attribute correlation of 0.50 and large DIF.
| Group misspecification | Position misspecification | Mantel–Haenszel | Logistic | Wald | |||
|---|---|---|---|---|---|---|---|
| 2 or Fewer K | 3 or More K | 2 or Fewer K | 3 or More K | 2 or Fewer K | 3 or More K | ||
| Neither | 0.05 | 0.04 | 0.06 | 0.04 | 0.14 | 0.12 | |
| Both groups | At random | 0.05 | 0.05 | 0.08 | 0.05 | 0.16 | 0.11 |
| 2 or Fewer | 0.05 | 0.04 | 0.07 | 0.05 | 0.12 | 0.09 | |
| 3 or More | 0.06 | 0.05 | 0.07 | 0.05 | 0.12 | 0.10 | |
| Focal only | At random | 0.40 | 0.19 | 0.40 | 0.19 | 0.47 | 0.28 |
| 2 or Fewer | 0.46 | 0.05 | 0.45 | 0.06 | 0.59 | 0.14 | |
| 3 or More | 0.06 | 0.37 | 0.07 | 0.37 | 0.17 | 0.48 | |
| Neither | 0.05 | 0.04 | 0.07 | 0.05 | 0.14 | 0.11 | |
| Both groups | At random | 0.05 | 0.04 | 0.07 | 0.05 | 0.12 | 0.11 |
| 2 or Fewer | 0.05 | 0.04 | 0.06 | 0.05 | 0.09 | 0.09 | |
| 3 or More | 0.05 | 0.05 | 0.06 | 0.05 | 0.10 | 0.09 | |
| Focal only | At random | 0.40 | 0.19 | 0.41 | 0.19 | 0.45 | 0.26 |
| 2 or Fewer | 0.44 | 0.05 | 0.46 | 0.06 | 0.57 | 0.14 | |
| 3 or More | 0.05 | 0.37 | 0.08 | 0.37 | 0.16 | 0.47 | |
Average power rates for conditions with attribute correlation of 0.50 and large DIF.
| Group misspecification | Position misspecification | Mantel–Haenszel | Logistic | Wald | |||
|---|---|---|---|---|---|---|---|
| 2 or Fewer K | 3 or More K | 2 or Fewer K | 3 or More K | 2 or Fewer K | 3 or More K | ||
| Neither | 0.44 | 0.52 | 0.80 | 0.93 | 0.96 | 0.89 | |
| Both groups | At random | 0.71 | 0.52 | 0.90 | 0.93 | 0.87 | 0.74 |
| 2 or Fewer | 0.65 | 0.53 | 0.84 | 0.92 | 0.86 | 0.70 | |
| 3 or More | 0.45 | 0.10 | 0.81 | 0.87 | 0.96 | 0.81 | |
| Focal only | At random | 0.74 | 0.47 | 0.86 | 0.86 | 0.99 | 0.80 |
| 2 or Fewer | 0.66 | 0.43 | 0.81 | 0.87 | 0.96 | 0.75 | |
| 3 or More | 0.42 | 0.69 | 0.78 | 0.95 | 0.95 | 0.96 | |
| Neither | 0.71 | 0.71 | 0.75 | 0.78 | 0.79 | 0.69 | |
| Both groups | At random | 0.70 | 0.73 | 0.75 | 0.79 | 0.70 | 0.60 |
| 2 or Fewer | 0.68 | 0.73 | 0.74 | 0.82 | 0.70 | 0.53 | |
| 3 or More | 0.69 | 0.72 | 0.73 | 0.82 | 0.80 | 0.64 | |
| Focal only | At random | 0.78 | 0.69 | 0.82 | 0.74 | 0.89 | 0.68 |
| 2 or Fewer | 0.79 | 0.71 | 0.82 | 0.81 | 0.84 | 0.62 | |
| 3 or More | 0.70 | 0.74 | 0.75 | 0.80 | 0.80 | 0.91 | |