| Literature DB >> 26941699 |
Yoon Soo Park1, Young-Sun Lee2, Kuan Xing3.
Abstract
This study investigates the impact of item parameter drift (IPD) on parameter and ability estimation when the underlying measurement model fits a mixture distribution, thereby violating the item invariance property of unidimensional item response theory (IRT) models. An empirical study was conducted to demonstrate the occurrence of both IPD and an underlying mixture distribution using real-world data. Twenty-one trended anchor items from the 1999, 2003, and 2007 administrations of Trends in International Mathematics and Science Study (TIMSS) were analyzed using unidimensional and mixture IRT models. TIMSS treats trended anchor items as invariant over testing administrations and uses pre-calibrated item parameters based on unidimensional IRT. However, empirical results showed evidence of two latent subgroups with IPD. Results also showed changes in the distribution of examinee ability between latent classes over the three administrations. A simulation study was conducted to examine the impact of IPD on the estimation of ability and item parameters, when data have underlying mixture distributions. Simulations used data generated from a mixture IRT model and estimated using unidimensional IRT. Results showed that data reflecting IPD using mixture IRT model led to IPD in the unidimensional IRT model. Changes in the distribution of examinee ability also affected item parameters. Moreover, drift with respect to item discrimination and distribution of examinee ability affected estimates of examinee ability. These findings demonstrate the need to caution and evaluate IPD using a mixture IRT framework to understand its effects on item parameters and examinee ability.Entities:
Keywords: TIMSS; differential item functioning; item parameter drift; item response theory; mixture IRT
Year: 2016 PMID: 26941699 PMCID: PMC4764735 DOI: 10.3389/fpsyg.2016.00255
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Item parameter estimates of the 21 trended anchor items in 1999, 2003, and 2007 TIMSS: Unidimensional 3PL IRT model.
| 1 | 0.65 (0.01) | 0.47 (0.01) | 0.62 (0.01) | −1.35 (0.04) | −2.52 (0.05) | −2.03 (0.04) | 0.19 (0.00) | 0.22 (0.01) | 0.22 (0.01) |
| 2 | 1.26 (0.02) | 0.92 (0.02) | 0.98 (0.02) | −0.61 (0.02) | −0.81 (0.02) | −0.95 (0.02) | 0.15 (0.00) | 0.16 (0.00) | 0.16 (0.00) |
| 3 | 0.61 (0.01) | 0.49 (0.01) | 0.52 (0.01) | −0.58 (0.03) | −0.31 (0.04) | −0.26 (0.04) | 0.19 (0.00) | 0.26 (0.01) | 0.21 (0.01) |
| 4 | 0.91 (0.02) | 0.90 (0.02) | 1.00 (0.02) | 1.56 (0.03) | 1.51 (0.03) | 1.25 (0.02) | 0.14 (0.00) | 0.18 (0.00) | 0.19 (0.00) |
| 5 | 2.13 (0.04) | 1.46 (0.02) | 2.25 (0.03) | 1.18 (0.02) | 1.19 (0.02) | 1.17 (0.02) | 0.03 (0.00) | 0.03 (0.00) | 0.04 (0.00) |
| 6 | 0.53 (0.01) | 0.42 (0.01) | 0.59 (0.01) | −1.65 (0.05) | −1.15 (0.10) | −0.68 (0.05) | 0.22 (0.01) | 0.29 (0.01) | 0.26 (0.01) |
| 7 | 1.27 (0.02) | 0.93 (0.02) | 1.67 (0.03) | 2.26 (0.05) | 2.68 (0.04) | 2.10 (0.03) | 0.09 (0.00) | 0.09 (0.00) | 0.11 (0.00) |
| 8 | 1.78 (0.03) | 1.35 (0.02) | 1.74 (0.03) | 0.16 (0.01) | 0.26 (0.01) | 0.44 (0.01) | 0.08 (0.00) | 0.09 (0.00) | 0.09 (0.00) |
| 9 | 1.60 (0.03) | 1.09 (0.02) | 1.94 (0.04) | −0.98 (0.02) | −1.54 (0.03) | −0.87 (0.02) | 0.20 (0.00) | 0.15 (0.00) | 0.18 (0.00) |
| 10 | 0.93 (0.02) | 0.62 (0.01) | 0.77 (0.01) | −1.93 (0.04) | −2.52 (0.05) | −2.29 (0.04) | 0.24 (0.01) | 0.19 (0.00) | 0.23 (0.00) |
| 11 | 1.48 (0.03) | 1.01 (0.02) | 1.44 (0.03) | −0.97 (0.02) | −1.68 (0.03) | −1.08 (0.02) | 0.21 (0.00) | 0.16 (0.00) | 0.19 (0.00) |
| 12 | 0.78 (0.01) | 0.64 (0.01) | 1.21 (0.04) | 1.64 (0.03) | 2.12 (0.04) | 1.52 (0.03) | 0.10 (0.00) | 0.15 (0.00) | 0.19 (0.00) |
| 13 | 1.11 (0.02) | 0.99 (0.02) | 1.37 (0.04) | 1.33 (0.03) | 1.51 (0.02) | 1.18 (0.02) | 0.10 (0.00) | 0.07 (0.00) | 0.13 (0.00) |
| 14 | 0.96 (0.02) | 0.80 (0.01) | 0.75 (0.02) | 1.18 (0.03) | 0.95 (0.02) | 0.62 (0.03) | 0.26 (0.00) | 0.27 (0.00) | 0.23 (0.01) |
| 15 | 0.74 (0.01) | 0.66 (0.01) | 0.42 (0.01) | 1.07 (0.03) | 1.16 (0.02) | −2.93 (0.06) | 0.15 (0.00) | 0.14 (0.00) | 0.26 (0.00) |
| 16 | 1.84 (0.04) | 0.94 (0.01) | 1.14 (0.02) | −1.12 (0.03) | −2.33 (0.04) | −2.30 (0.04) | 0.30 (0.01) | 0.22 (0.00) | 0.23 (0.00) |
| 17 | 1.18 (0.02) | 1.04 (0.02) | 1.02 (0.02) | 1.97 (0.04) | 1.80 (0.03) | 2.08 (0.03) | 0.02 (0.00) | 0.02 (0.00) | 0.04 (0.00) |
| 18 | 1.72 (0.03) | 1.40 (0.02) | 1.77 (0.03) | 1.69 (0.03) | 1.92 (0.03) | 1.52 (0.02) | 0.01 (0.00) | 0.02 (0.00) | 0.03 (0.00) |
| 19 | 2.06 (0.04) | 1.49 (0.02) | 1.93 (0.03) | 1.87 (0.04) | 2.18 (0.04) | 1.82 (0.03) | 0.01 (0.00) | 0.01 (0.00) | 0.02 (0.00) |
| 20 | 1.24 (0.02) | 1.18 (0.02) | 1.81 (0.03) | 1.62 (0.03) | 1.77 (0.03) | 1.42 (0.02) | 0.06 (0.00) | 0.05 (0.00) | 0.07 (0.00) |
| 21 | 2.77 (0.05) | 1.39 (0.02) | 1.94 (0.03) | 0.75 (0.02) | 0.43 (0.01) | 0.30 (0.01) | 0.26 (0.00) | 0.20 (0.00) | 0.11 (0.00) |
Estimated using MCMC with 10,000 samples and 4000 samples burn-in; values in parenthesis represent MC Error.
Item parameter estimates of the 21 trended anchor items in 1999, 2003, and 2007 TIMSS: Two-class mixture 3PL IRT model.
| 1 | 1 | 1.44 (0.05) | 0.46 (0.01) | 0.80 (0.07) | −0.43 (0.13) | −3.16 (0.09) | −2.55 (0.12) | 0.23 (0.00) | 0.26 (0.00) | 0.26 (0.00) |
| 2 | 1.68 (0.04) | 1.61 (0.11) | 0.30 (0.01) | −1.08 (0.13) | −2.48 (0.12) | −1.37 (0.12) | 0.23 (0.00) | 0.24 (0.00) | 0.27 (0.00) | |
| 3 | 1.39 (0.06) | 0.42 (0.02) | 1.61 (0.06) | −0.71 (0.12) | −2.15 (0.10) | −0.45 (0.06) | 0.24 (0.00) | 0.27 (0.01) | 0.24 (0.00) | |
| 4 | 1.24 (0.05) | 2.01 (0.10) | 1.26 (0.05) | −0.25 (0.10) | −0.06 (0.02) | −0.49 (0.07) | 0.24 (0.00) | 0.19 (0.01) | 0.25 (0.00) | |
| 5 | 1.30 (0.04) | 1.58 (0.07) | 0.73 (0.05) | 0.08 (0.11) | 0.50 (0.07) | −0.82 (0.10) | 0.24 (0.00) | 0.22 (0.01) | 0.26 (0.00) | |
| 6 | 1.38 (0.05) | 1.40 (0.07) | 2.37 (0.06) | −0.42 (0.13) | −0.47 (0.04) | −1.43 (0.06) | 0.23 (0.00) | 0.25 (0.01) | 0.23 (0.00) | |
| 7 | 1.57 (0.04) | 1.84 (0.07) | 1.59 (0.06) | 0.75 (0.11) | 2.32 (0.09) | 0.33 (0.07) | 0.22 (0.00) | 0.19 (0.00) | 0.22 (0.00) | |
| 8 | 1.53 (0.06) | 1.31 (0.06) | 1.36 (0.06) | −1.03 (0.18) | −1.44 (0.08) | −1.48 (0.09) | 0.24 (0.00) | 0.24 (0.00) | 0.24 (0.00) | |
| 9 | 1.71 (0.04) | 0.99 (0.04) | 1.57 (0.06) | −1.96 (0.11) | −2.62 (0.11) | −0.74 (0.10) | 0.24 (0.00) | 0.25 (0.00) | 0.23 (0.00) | |
| 10 | 1.78 (0.04) | 0.78 (0.05) | 1.12 (0.07) | −0.51 (0.15) | −2.48 (0.12) | −1.13 (0.10) | 0.23 (0.00) | 0.25 (0.01) | 0.25 (0.00) | |
| 11 | 1.68 (0.06) | 1.91 (0.12) | 1.76 (0.07) | −0.15 (0.18) | −2.62 (0.10) | −1.48 (0.11) | 0.22 (0.00) | 0.23 (0.00) | 0.24 (0.00) | |
| 12 | 1.47 (0.05) | 1.85 (0.07) | 1.56 (0.05) | 0.98 (0.11) | 1.77 (0.10) | 1.45 (0.10) | 0.22 (0.00) | 0.24 (0.01) | 0.22 (0.00) | |
| 13 | 1.34 (0.05) | 1.22 (0.06) | 1.82 (0.06) | −0.09 (0.11) | 1.42 (0.12) | 0.72 (0.11) | 0.24 (0.00) | 0.21 (0.01) | 0.20 (0.00) | |
| 14 | 1.43 (0.04) | 1.25 (0.07) | 1.28 (0.06) | −0.43 (0.10) | −0.12 (0.07) | −0.37 (0.10) | 0.24 (0.00) | 0.23 (0.01) | 0.24 (0.00) | |
| 15 | 1.15 (0.04) | 0.52 (0.04) | 1.17 (0.06) | −0.48 (0.08) | −0.09 (0.10) | −1.37 (0.10) | 0.24 (0.00) | 0.24 (0.01) | 0.25 (0.00) | |
| 16 | 1.97 (0.05) | 2.03 (0.12) | 1.44 (0.06) | −0.74 (0.12) | −1.67 (0.10) | −0.74 (0.15) | 0.23 (0.00) | 0.25 (0.01) | 0.24 (0.00) | |
| 17 | 1.65 (0.04) | 0.79 (0.06) | 1.79 (0.04) | 1.27 (0.11) | 0.25 (0.07) | 20.46 (0.06) | 0.22 (0.00) | 0.22 (0.01) | 0.21 (0.00) | |
| 18 | 1.59 (0.05) | 1.38 (0.07) | 1.17 (0.06) | 0.97 (0.11) | 0.78 (0.06) | 0.30 (0.10) | 0.21 (0.00) | 0.23 (0.01) | 0.25 (0.00) | |
| 19 | 1.50 (0.07) | 2.02 (0.07) | 1.12 (0.08) | −0.05 (0.19) | 0.67 (0.04) | 1.12 (0.09) | 0.23 (0.00) | 0.19 (0.00) | 0.25 (0.00) | |
| 20 | 1.32 (0.04) | 2.06 (0.09) | 2.13 (0.05) | 0.37 (0.13) | 0.92 (0.05) | −0.38 (0.07) | 0.24 (0.00) | 0.21 (0.01) | 0.22 (0.00) | |
| 21 | 1.68 (0.05) | 1.44 (0.06) | 1.26 (0.05) | −0.92 (0.16) | −0.91 (0.05) | −0.84 (0.07) | 0.23 (0.00) | 0.24 (0.01) | 0.24 (0.00) | |
| 2 | 1 | 0.67 (0.01) | 1.10 (0.03) | 0.71 (0.02) | −1.29 (0.03) | −1.06 (0.03) | −1.72 (0.04) | 0.19 (0.00) | 0.21 (0.01) | 0.23 (0.00) |
| 2 | 1.37 (0.05) | 1.46 (0.02) | 1.60 (0.05) | −0.54 (0.02) | −0.20 (0.02) | −0.65 (0.02) | 0.16 (0.00) | 0.16 (0.00) | 0.20 (0.00) | |
| 3 | 0.66 (0.02) | 0.97 (0.02) | 0.55 (0.01) | −0.55 (0.03) | 0.14 (0.04) | −0.21 (0.03) | 0.19 (0.00) | 0.29 (0.01) | 0.20 (0.00) | |
| 4 | 0.95 (0.02) | 1.41 (0.03) | 0.92 (0.02) | 1.67 (0.02) | 1.15 (0.03) | 1.55 (0.03) | 0.15 (0.00) | 0.17 (0.00) | 0.19 (0.00) | |
| 5 | 2.27 (0.06) | 3.52 (0.06) | 2.64 (0.05) | 1.22 (0.01) | 0.77 (0.01) | 1.39 (0.01) | 0.03 (0.00) | 0.03 (0.00) | 0.05 (0.00) | |
| 6 | 0.58 (0.02) | 0.86 (0.02) | 0.47 (0.01) | −1.58 (0.05) | −0.76 (0.03) | −0.74 (0.05) | 0.22 (0.01) | 0.22 (0.01) | 0.24 (0.01) | |
| 7 | 1.35 (0.04) | 2.80 (0.08) | 1.80 (0.05) | 2.30 (0.03) | 1.25 (0.01) | 2.54 (0.03) | 0.10 (0.00) | 0.10 (0.00) | 0.12 (0.00) | |
| 8 | 1.79 (0.02) | 2.39 (0.03) | 1.80 (0.04) | 0.24 (0.02) | 0.42 (0.01) | 0.64 (0.01) | 0.08 (0.00) | 0.10 (0.00) | 0.10 (0.00) | |
| 9 | 1.59 (0.03) | 2.51 (0.06) | 2.14 (0.06) | −0.97 (0.02) | −0.56 (0.02) | −0.84 (0.02) | 0.19 (0.00) | 0.19 (0.01) | 0.18 (0.00) | |
| 10 | 0.93 (0.02) | 1.44 (0.03) | 0.86 (0.02) | −1.93 (0.04) | −1.05 (0.03) | −2.18 (0.04) | 0.23 (0.00) | 0.22 (0.01) | 0.24 (0.00) | |
| 11 | 1.54 (0.03) | 1.74 (0.02) | 1.69 (0.06) | −0.92 (0.03) | −0.71 (0.02) | −0.96 (0.03) | 0.22 (0.01) | 0.17 (0.01) | 0.21 (0.01) | |
| 12 | 0.82 (0.02) | 1.58 (0.05) | 1.37 (0.05) | 1.70 (0.03) | 0.96 (0.02) | 1.44 (0.03) | 0.11 (0.00) | 0.15 (0.00) | 0.19 (0.00) | |
| 13 | 1.14 (0.03) | 2.54 (0.04) | 1.46 (0.04) | 1.36 (0.02) | 0.82 (0.01) | 1.16 (0.02) | 0.09 (0.00) | 0.08 (0.00) | 0.12 (0.00) | |
| 14 | 0.97 (0.03) | 1.61 (0.04) | 0.78 (0.03) | 1.20 (0.02) | 0.60 (0.02) | 0.55 (0.03) | 0.25 (0.00) | 0.27 (0.00) | 0.21 (0.00) | |
| 15 | 0.76 (0.02) | 1.47 (0.04) | 0.46 (0.01) | 1.14 (0.02) | 0.85 (0.02) | −2.83 (0.07) | 0.15 (0.00) | 0.17 (0.00) | 0.27 (0.00) | |
| 16 | 1.83 (0.06) | 2.23 (0.05) | 1.49 (0.04) | −1.14 (0.04) | −1.00 (0.02) | −2.18 (0.04) | 0.29 (0.01) | 0.25 (0.01) | 0.24 (0.01) | |
| 17 | 1.23 (0.03) | 2.03 (0.03) | 1.24 (0.04) | 1.99 (0.03) | 1.35 (0.02) | 1.98 (0.04) | 0.02 (0.00) | 0.03 (0.00) | 0.04 (0.00) | |
| 18 | 1.89 (0.06) | 3.15 (0.07) | 1.95 (0.05) | 1.70 (0.02) | 1.31 (0.02) | 1.73 (0.02) | 0.01 (0.00) | 0.02 (0.00) | 0.03 (0.00) | |
| 19 | 2.26 (0.07) | 3.31 (0.06) | 2.31 (0.06) | 1.88 (0.02) | 1.49 (0.02) | 2.01 (0.02) | 0.01 (0.00) | 0.01 (0.00) | 0.02 (0.00) | |
| 20 | 1.30 (0.03) | 2.91 (0.07) | 1.77 (0.04) | 1.65 (0.02) | 1.02 (0.01) | 1.74 (0.02) | 0.06 (0.00) | 0.06 (0.00) | 0.07 (0.00) | |
| 21 | 2.92 (0.05) | 2.99 (0.06) | 2.56 (0.06) | 0.85 (0.02) | 0.46 (0.01) | 0.43 (0.01) | 0.26 (0.00) | 0.22 (0.00) | 0.13 (0.00) | |
Estimated using MCMC with 10,000 samples and 4000 samples burn-in; values in parenthesis represent MC Error.
Figure 1Plots of item parameters by measurement model: TIMSS 1999, 2003, and 2007.
Figure 2Plots of changes in item parameters by item: TIMSS 1999, 2003, and 2007.
Fit statistics for the 21 trended anchor items in 1999, 2003, and 2007 TIMSS.
| 1999 | 44,560 | 44,686 | 45,047 | 44,150 | 44,402 | 45,125 |
| 2003 | 89,510 | 89,636 | 90,040 | 87,980 | 88,232 | 89,039 |
| 2007 | 41,290 | 41,416 | 41,773 | 40,310 | 40,562 | 41,275 |
Estimated using MCMC with 10,000 samples and 4000 samples burn-in. The −2LL is the posterior mean of the MCMC deviance (Li et al., .
Class size for the 21 trended anchor items in 1999, 2003, and 2007 TIMSS using two-class mixture 3PL IRT model.
| Class 1 | 0.04 (0.01) | 0.19 (0.01) | 0.10 (0.01) |
| Class 2 | 0.96 (0.01) | 0.81 (0.01) | 0.90 (0.01) |
| 2.27 (0.07) | 0.74 (0.01) | 1.76 (0.05) |
Values in parenthesis represent MC Error.
Conditions for the simulation study: generating values.
| 1 (Increase in Item Difficulty for Group 1 at Time 2) | 1 | 1.44 | −0.43 | 0.67 | −1.29 | 1.44 | 0.57 | 0.67 | −1.29 |
| 2 | 1.68 | −1.08 | 1.37 | −0.54 | 1.68 | −0.08 | 1.37 | −0.54 | |
| 3 | 1.39 | −0.71 | 0.66 | −0.55 | 1.39 | 0.29 | 0.66 | −0.55 | |
| 4 | 1.24 | −0.25 | 0.95 | 1.67 | 1.24 | 0.75 | 0.95 | 1.67 | |
| 5 | 1.30 | 0.08 | 2.27 | 1.22 | 1.30 | 1.08 | 2.27 | 1.22 | |
| 6 | 1.38 | −0.42 | 0.58 | −1.58 | 1.38 | 0.58 | 0.58 | −1.58 | |
| 7 | 1.57 | 0.75 | 1.35 | 2.30 | 1.57 | 1.75 | 1.35 | 2.30 | |
| 8 | 1.53 | −1.03 | 1.79 | 0.24 | 1.53 | −0.03 | 1.79 | 0.24 | |
| 9 | 1.71 | −1.96 | 1.59 | −0.97 | 1.71 | −0.96 | 1.59 | −0.97 | |
| 10 | 1.78 | −0.51 | 0.93 | −1.93 | 1.78 | 0.49 | 0.93 | −1.93 | |
| 11 | 1.68 | −0.15 | 1.54 | −0.92 | 1.68 | 0.85 | 1.54 | −0.92 | |
| 12 | 1.47 | 0.98 | 0.82 | 1.70 | 1.47 | 1.98 | 0.82 | 1.70 | |
| 13 | 1.34 | −0.09 | 1.14 | 1.36 | 1.34 | 0.91 | 1.14 | 1.36 | |
| 14 | 1.43 | −0.43 | 0.97 | 1.20 | 1.43 | 0.57 | 0.97 | 1.20 | |
| 15 | 1.15 | −0.48 | 0.76 | 1.14 | 1.15 | 0.52 | 0.76 | 1.14 | |
| 16 | 1.97 | −0.74 | 1.83 | −1.14 | 1.97 | 0.26 | 1.83 | −1.14 | |
| 17 | 1.65 | 1.27 | 1.23 | 1.99 | 1.65 | 2.27 | 1.23 | 1.99 | |
| 18 | 1.59 | 0.97 | 1.89 | 1.70 | 1.59 | 1.97 | 1.89 | 1.70 | |
| 19 | 1.50 | −0.05 | 2.26 | 1.88 | 1.50 | 0.95 | 2.26 | 1.88 | |
| 20 | 1.32 | 0.37 | 1.30 | 1.65 | 1.32 | 1.37 | 1.30 | 1.65 | |
| 2 (Increase in Item Discrimination for Group 1 atTime 2) | 1 | 1.44 | −0.43 | 0.67 | −1.29 | 2.44 | −0.43 | 0.67 | −1.29 |
| 2 | 1.68 | −1.08 | 1.37 | −0.54 | 2.68 | −1.08 | 1.37 | −0.54 | |
| 3 | 1.39 | −0.71 | 0.66 | −0.55 | 2.39 | −0.71 | 0.66 | −0.55 | |
| 4 | 1.24 | −0.25 | 0.95 | 1.67 | 2.24 | −0.25 | 0.95 | 1.67 | |
| 5 | 1.30 | 0.08 | 2.27 | 1.22 | 2.30 | 0.08 | 2.27 | 1.22 | |
| 6 | 1.38 | −0.42 | 0.58 | −1.58 | 2.38 | −0.42 | 0.58 | −1.58 | |
| 7 | 1.57 | 0.75 | 1.35 | 2.30 | 2.57 | 0.75 | 1.35 | 2.30 | |
| 8 | 1.53 | −1.03 | 1.79 | 0.24 | 2.53 | −1.03 | 1.79 | 0.24 | |
| 9 | 1.71 | −1.96 | 1.59 | −0.97 | 2.71 | −1.96 | 1.59 | −0.97 | |
| 10 | 1.78 | −0.51 | 0.93 | −1.93 | 2.78 | −0.51 | 0.93 | −1.93 | |
| 11 | 1.68 | −0.15 | 1.54 | −0.92 | 2.68 | −0.15 | 1.54 | −0.92 | |
| 12 | 1.47 | 0.98 | 0.82 | 1.70 | 2.47 | 0.98 | 0.82 | 1.70 | |
| 13 | 1.34 | −0.09 | 1.14 | 1.36 | 2.34 | −0.09 | 1.14 | 1.36 | |
| 14 | 1.43 | −0.43 | 0.97 | 1.20 | 2.43 | −0.43 | 0.97 | 1.20 | |
| 15 | 1.15 | −0.48 | 0.76 | 1.14 | 2.15 | −0.48 | 0.76 | 1.14 | |
| 16 | 1.97 | −0.74 | 1.83 | −1.14 | 2.97 | −0.74 | 1.83 | −1.14 | |
| 17 | 1.65 | 1.27 | 1.23 | 1.99 | 2.65 | 1.27 | 1.23 | 1.99 | |
| 18 | 1.59 | 0.97 | 1.89 | 1.70 | 2.59 | 0.97 | 1.89 | 1.70 | |
| 19 | 1.50 | −0.05 | 2.26 | 1.88 | 2.50 | −0.05 | 2.26 | 1.88 | |
| 20 | 1.32 | 0.37 | 1.30 | 1.65 | 2.32 | 0.37 | 1.30 | 1.65 | |
| 3 (Increase in both Item Difficulty and Discrimination for Group 1 at Time 2) | 1 | 1.44 | −0.43 | 0.67 | −1.29 | 2.44 | 0.57 | 0.67 | −1.29 |
| 2 | 1.68 | −1.08 | 1.37 | −0.54 | 2.68 | −0.08 | 1.37 | −0.54 | |
| 3 | 1.39 | −0.71 | 0.66 | −0.55 | 2.39 | 0.29 | 0.66 | −0.55 | |
| 4 | 1.24 | −0.25 | 0.95 | 1.67 | 2.24 | 0.75 | 0.95 | 1.67 | |
| 5 | 1.30 | 0.08 | 2.27 | 1.22 | 2.30 | 1.08 | 2.27 | 1.22 | |
| 6 | 1.38 | −0.42 | 0.58 | −1.58 | 2.38 | 0.58 | 0.58 | −1.58 | |
| 7 | 1.57 | 0.75 | 1.35 | 2.30 | 2.57 | 1.75 | 1.35 | 2.30 | |
| 8 | 1.53 | −1.03 | 1.79 | 0.24 | 2.53 | −0.03 | 1.79 | 0.24 | |
| 9 | 1.71 | −1.96 | 1.59 | −0.97 | 2.71 | −0.96 | 1.59 | −0.97 | |
| 10 | 1.78 | −0.51 | 0.93 | −1.93 | 2.78 | 0.49 | 0.93 | −1.93 | |
| 11 | 1.68 | −0.15 | 1.54 | −0.92 | 2.68 | 0.85 | 1.54 | −0.92 | |
| 12 | 1.47 | 0.98 | 0.82 | 1.70 | 2.47 | 1.98 | 0.82 | 1.70 | |
| 13 | 1.34 | −0.09 | 1.14 | 1.36 | 2.34 | 0.91 | 1.14 | 1.36 | |
| 14 | 1.43 | −0.43 | 0.97 | 1.20 | 2.43 | 0.57 | 0.97 | 1.20 | |
| 15 | 1.15 | −0.48 | 0.76 | 1.14 | 2.15 | 0.52 | 0.76 | 1.14 | |
| 16 | 1.97 | −0.74 | 1.83 | −1.14 | 2.97 | 0.26 | 1.83 | −1.14 | |
| 17 | 1.65 | 1.27 | 1.23 | 1.99 | 2.65 | 2.27 | 1.23 | 1.99 | |
| 18 | 1.59 | 0.97 | 1.89 | 1.70 | 2.59 | 1.97 | 1.89 | 1.70 | |
| 19 | 1.50 | −0.05 | 2.26 | 1.88 | 2.50 | 0.95 | 2.26 | 1.88 | |
| 20 | 1.32 | 0.37 | 1.30 | 1.65 | 2.32 | 1.37 | 1.30 | 1.65 | |
1. Generating values for Group 1 and Group 2 for Time 1 were obtained from estimated item parameters using the mixture 3PL IRT model of TIMSS 1999 data (items 1 to 20).
2. All simulations in Conditions 1–3 used ability as normally distributed with mean 0 and variance 1, θ ~N(0,1).
3. In addition to the conditions listed above two additional conditions were examined:
Condition 4: Change in ability distribution for one group at Time 2
- Time 1: Same generating values used at Time 1 of Condition 1
- Time 2: Same generating values used at Time 1 of Condition 1 with θ ~N(1,0.5) for Group 1
Condition 5: Change in ability distribution for one group at Time 2 and increased discrimination
- Time 1: Same generating values used at Time 1 of Condition 1
- Time 2: Same generating values used at Time 2 of Condition 2 (increase in discrimination for Group 1) with θ ~N(1,0.5).
Fitting data generated from mixture IRT to unidimensional 2PL IRT model: Condition 1 (Increase in item difficulty for Group 1 at Time 2).
| 1 | 0.94 (0.07) | −0.69 (0.06) | 1.27 (0.08) | 0.06 (0.06) | −0.33 | −0.75 |
| 2 | 1.65 (0.10) | −1.38 (0.08) | 1.60 (0.10) | −0.38 (0.07) | 0.05 | −1.00 |
| 3 | 1.09 (0.07) | −0.71 (0.06) | 1.18 (0.08) | 0.05 (0.06) | −0.09 | −0.76 |
| 4 | 1.38 (0.08) | 0.44 (0.06) | 0.97 (0.07) | 1.13 (0.06) | 0.42 | −0.69 |
| 5 | 1.71 (0.10) | 0.88 (0.07) | 1.30 (0.09) | 1.65 (0.08) | 0.41 | −0.77 |
| 6 | 0.85 (0.07) | −0.68 (0.05) | 1.20 (0.08) | 0.03 (0.06) | −0.35 | −0.71 |
| 7 | 1.76 (0.11) | 1.93 (0.10) | 1.41 (0.12) | 2.81 (0.13) | 0.35 | −0.88 |
| 8 | 1.79 (0.11) | −0.81 (0.08) | 1.34 (0.08) | 0.11 (0.06) | 0.45 | −0.91 |
| 9 | 1.83 (0.14) | −2.59 (0.13) | 1.42 (0.10) | −1.49 (0.08) | 0.41 | −1.10 |
| 10 | 0.99 (0.07) | −1.12 (0.06) | 1.54 (0.09) | −0.27 (0.06) | −0.55 | −0.84 |
| 11 | 1.03 (0.07) | −0.60 (0.06) | 1.59 (0.10) | 0.24 (0.07) | −0.56 | −0.84 |
| 12 | 1.09 (0.08) | 1.37 (0.07) | 1.19 (0.09) | 2.16 (0.09) | −0.10 | −0.79 |
| 13 | 1.48 (0.09) | 0.54 (0.06) | 1.15 (0.08) | 1.30 (0.07) | 0.33 | −0.76 |
| 14 | 1.50 (0.09) | 0.13 (0.06) | 1.11 (0.08) | 0.91 (0.06) | 0.39 | −0.78 |
| 15 | 1.19 (0.07) | 0.04 (0.06) | 0.91 (0.07) | 0.69 (0.06) | 0.28 | −0.65 |
| 16 | 1.47 (0.10) | −1.50 (0.08) | 2.04 (0.12) | −0.49 (0.08) | −0.57 | −1.01 |
| 17 | 1.40 (0.10) | 2.19 (0.10) | 1.46 (0.13) | 3.11 (0.14) | −0.06 | −0.92 |
| 18 | 1.72 (0.11) | 2.09 (0.10) | 1.67 (0.14) | 3.13 (0.15) | 0.05 | −1.05 |
| 19 | 1.86 (0.11) | 1.06 (0.08) | 1.17 (0.09) | 1.83 (0.08) | 0.68 | −0.77 |
| 20 | 1.50 (0.09) | 1.12 (0.07) | 1.22 (0.09) | 1.88 (0.09) | 0.28 | −0.76 |
Simulated with 100 replications using posterior mode estimation. Values in parenthesis represent standard errors.
Fitting data generated from mixture IRT to unidimensional 2PL IRT model: Condition 2 (Increase in item discrimination for Group 1 at Time 2).
| 1 | 0.94 (0.07) | −0.69 (0.06) | 1.48 (0.09) | −0.94 (0.07) | −0.55 | 0.25 |
| 2 | 1.65 (0.10) | −1.38 (0.08) | 2.20 (0.14) | −1.86 (0.11) | −0.55 | 0.48 |
| 3 | 1.09 (0.07) | −0.71 (0.06) | 1.65 (0.10) | −1.02 (0.07) | −0.57 | 0.32 |
| 4 | 1.38 (0.08) | 0.44 (0.06) | 2.02 (0.11) | 0.41 (0.07) | −0.63 | 0.03 |
| 5 | 1.71 (0.10) | 0.88 (0.07) | 2.47 (0.14) | 1.06 (0.09) | −0.75 | −0.18 |
| 6 | 0.85 (0.07) | −0.68 (0.05) | 1.40 (0.09) | −0.93 (0.07) | −0.55 | 0.25 |
| 7 | 1.76 (0.11) | 1.93 (0.10) | 2.27 (0.14) | 2.48 (0.13) | −0.52 | −0.55 |
| 8 | 1.79 (0.11) | −0.81 (0.08) | 2.21 (0.13) | −1.16 (0.09) | −0.42 | 0.35 |
| 9 | 1.83 (0.14) | −2.59 (0.13) | 1.76 (0.13) | −2.76 (0.13) | 0.07 | 0.17 |
| 10 | 0.99 (0.07) | −1.12 (0.06) | 1.59 (0.10) | −1.46 (0.08) | −0.61 | 0.34 |
| 11 | 1.03 (0.07) | −0.60 (0.06) | 1.52 (0.09) | −0.74 (0.07) | −0.49 | 0.14 |
| 12 | 1.09 (0.08) | 1.37 (0.07) | 1.26 (0.08) | 1.63 (0.08) | −0.17 | −0.26 |
| 13 | 1.48 (0.09) | 0.54 (0.06) | 2.13 (0.12) | 0.58 (0.08) | −0.65 | −0.03 |
| 14 | 1.50 (0.09) | 0.13 (0.06) | 2.06 (0.11) | 0.00 (0.07) | −0.57 | 0.14 |
| 15 | 1.19 (0.07) | 0.04 (0.06) | 1.77 (0.10) | −0.14 (0.07) | −0.58 | 0.17 |
| 16 | 1.47 (0.10) | −1.50 (0.08) | 2.28 (0.14) | −2.07 (0.12) | −0.81 | 0.57 |
| 17 | 1.40 (0.10) | 2.19 (0.10) | 1.52 (0.10) | 2.54 (0.11) | −0.12 | −0.35 |
| 18 | 1.72 (0.11) | 2.09 (0.10) | 2.03 (0.13) | 2.55 (0.13) | −0.31 | −0.46 |
| 19 | 1.86 (0.11) | 1.06 (0.08) | 2.64 (0.15) | 1.24 (0.10) | −0.79 | −0.18 |
| 20 | 1.50 (0.09) | 1.12 (0.07) | 2.06 (0.11) | 1.41 (0.09) | −0.56 | −0.29 |
Simulated with 100 replications using posterior mode estimation. Values in parenthesis represent standard errors.
Fitting data generated from mixture IRT to unidimensional 2PL IRT model: Condition 3 (Increase in both item difficulty and discrimination for Group 1 at Time 2).
| 1 | 0.94 (0.07) | −0.69 (0.06) | 1.82 (0.10) | 0.23 (0.07) | −0.88 | −0.92 |
| 2 | 1.65 (0.10) | −1.38 (0.08) | 1.96 (0.11) | −0.42 (0.07) | −0.31 | −0.96 |
| 3 | 1.09 (0.07) | −0.71 (0.06) | 1.68 (0.09) | 0.16 (0.07) | −0.59 | −0.87 |
| 4 | 1.38 (0.08) | 0.44 (0.06) | 1.57 (0.10) | 1.59 (0.08) | −0.19 | −1.15 |
| 5 | 1.71 (0.10) | 0.88 (0.07) | 2.13 (0.14) | 2.48 (0.13) | −0.41 | −1.60 |
| 6 | 0.85 (0.07) | −0.68 (0.05) | 1.75 (0.10) | 0.20 (0.07) | −0.90 | −0.88 |
| 7 | 1.76 (0.11) | 1.93 (0.10) | 2.09 (0.17) | 3.88 (0.21) | −0.33 | −1.95 |
| 8 | 1.79 (0.11) | −0.81 (0.08) | 1.65 (0.09) | 0.15 (0.07) | 0.14 | −0.96 |
| 9 | 1.83 (0.14) | −2.59 (0.13) | 1.35 (0.09) | −1.62 (0.08) | 0.48 | −0.96 |
| 10 | 0.99 (0.07) | −1.12 (0.06) | 2.18 (0.12) | −0.19 (0.08) | −1.19 | −0.93 |
| 11 | 1.03 (0.07) | −0.60 (0.06) | 2.12 (0.12) | 0.48 (0.08) | −1.09 | −1.08 |
| 12 | 1.09 (0.08) | 1.37 (0.07) | 1.37 (0.11) | 2.56 (0.11) | −0.28 | −1.20 |
| 13 | 1.48 (0.09) | 0.54 (0.06) | 1.79 (0.11) | 1.84 (0.10) | −0.31 | −1.29 |
| 14 | 1.50 (0.09) | 0.13 (0.06) | 1.64 (0.10) | 1.24 (0.08) | −0.15 | −1.11 |
| 15 | 1.19 (0.07) | 0.04 (0.06) | 1.44 (0.09) | 0.98 (0.07) | −0.25 | −0.94 |
| 16 | 1.47 (0.10) | −1.50 (0.08) | 2.86 (0.17) | −0.48 (0.09) | −1.39 | −1.02 |
| 17 | 1.40 (0.10) | 2.19 (0.10) | 1.58 (0.14) | 3.53 (0.17) | −0.18 | −1.34 |
| 18 | 1.72 (0.11) | 2.09 (0.10) | 2.12 (0.18) | 3.97 (0.22) | −0.40 | −1.89 |
| 19 | 1.86 (0.11) | 1.06 (0.08) | 1.92 (0.13) | 2.58 (0.13) | −0.07 | −1.52 |
| 20 | 1.50 (0.09) | 1.12 (0.07) | 1.93 (0.13) | 2.74 (0.13) | −0.43 | −1.62 |
Simulated with 100 replications using posterior mode estimation. Values in parenthesis represent standard errors.
Fitting data generated from mixture IRT to unidimensional 2PL IRT model: Condition 4 [change in ability distribution for Group 1 at Time 2 to θ~.
| 1 | 0.94 (0.07) | −0.69 (0.06) | 0.88 (0.07) | −1.50 (0.07) | 0.05 | 0.81 |
| 2 | 1.65 (0.10) | −1.38 (0.08) | 1.97 (0.12) | −2.47 (0.12) | −0.32 | 1.09 |
| 3 | 1.09 (0.07) | −0.71 (0.06) | 1.24 (0.08) | −1.51 (0.08) | −0.15 | 0.80 |
| 4 | 1.38 (0.08) | 0.44 (0.06) | 1.94 (0.10) | −0.35 (0.07) | −0.56 | 0.79 |
| 5 | 1.71 (0.10) | 0.88 (0.07) | 2.23 (0.12) | 0.00 (0.08) | −0.52 | 0.89 |
| 6 | 0.85 (0.07) | −0.68 (0.05) | 0.78 (0.07) | −1.46 (0.07) | 0.07 | 0.78 |
| 7 | 1.76 (0.11) | 1.93 (0.10) | 2.10 (0.12) | 0.87 (0.08) | −0.35 | 1.05 |
| 8 | 1.79 (0.11) | −0.81 (0.08) | 2.59 (0.15) | −1.95 (0.12) | −0.80 | 1.15 |
| 9 | 1.83 (0.14) | −2.59 (0.13) | 2.44 (0.19) | −3.81 (0.22) | −0.61 | 1.22 |
| 10 | 0.99 (0.07) | −1.12 (0.06) | 0.85 (0.08) | −2.13 (0.08) | 0.14 | 1.02 |
| 11 | 1.03 (0.07) | −0.60 (0.06) | 0.92 (0.07) | −1.53 (0.07) | 0.11 | 0.94 |
| 12 | 1.09 (0.08) | 1.37 (0.07) | 1.12 (0.07) | 0.54 (0.06) | −0.03 | 0.82 |
| 13 | 1.48 (0.09) | 0.54 (0.06) | 1.94 (0.10) | −0.34 (0.07) | −0.46 | 0.88 |
| 14 | 1.50 (0.09) | 0.13 (0.06) | 2.04 (0.11) | −0.79 (0.08) | −0.55 | 0.92 |
| 15 | 1.19 (0.07) | 0.04 (0.06) | 1.60 (0.09) | −0.69 (0.07) | −0.40 | 0.73 |
| 16 | 1.47 (0.10) | −1.50 (0.08) | 1.47 (0.11) | −2.71 (0.12) | 0.00 | 1.21 |
| 17 | 1.40 (0.10) | 2.19 (0.10) | 1.46 (0.09) | 1.19 (0.07) | −0.06 | 1.00 |
| 18 | 1.72 (0.11) | 2.09 (0.10) | 1.90 (0.11) | 1.02 (0.08) | −0.18 | 1.07 |
| 19 | 1.86 (0.11) | 1.06 (0.08) | 3.01 (0.18) | 0.09 (0.10) | −1.16 | 0.97 |
| 20 | 1.50 (0.09) | 1.12 (0.07) | 1.84 (0.10) | 0.26 (0.07) | −0.34 | 0.86 |
Simulated with 100 replications using posterior mode estimation. Values in parenthesis represent standard errors.
Fitting data generated from mixture IRT to unidimensional 2PL IRT model: Condition 5 [increase in item discrimination and change in ability distribution for group 1 at Time 2 to θ~.
| 1 | 0.94 (0.07) | −0.69 (0.06) | 1.42 (0.10) | −2.13 (0.10) | −0.48 | 1.44 |
| 2 | 1.65 (0.10) | −1.38 (0.08) | 2.75 (0.18) | −3.40 (0.19) | −1.10 | 2.01 |
| 3 | 1.09 (0.07) | −0.71 (0.06) | 1.89 (0.12) | −2.26 (0.11) | −0.80 | 1.56 |
| 4 | 1.38 (0.08) | 0.44 (0.06) | 2.79 (0.15) | −1.06 (0.10) | −1.41 | 1.49 |
| 5 | 1.71 (0.10) | 0.88 (0.07) | 3.10 (0.17) | −0.60 (0.10) | −1.39 | 1.49 |
| 6 | 0.85 (0.07) | −0.68 (0.05) | 1.31 (0.09) | −2.10 (0.09) | −0.46 | 1.42 |
| 7 | 1.76 (0.11) | 1.93 (0.10) | 2.54 (0.14) | 0.83 (0.09) | −0.79 | 1.10 |
| 8 | 1.79 (0.11) | −0.81 (0.08) | 3.66 (0.24) | −3.07 (0.20) | −1.87 | 2.26 |
| 9 | 1.83 (0.14) | −2.59 (0.13) | 3.14 (0.26) | −4.65 (0.32) | −1.31 | 2.07 |
| 10 | 0.99 (0.07) | −1.12 (0.06) | 1.38 (0.11) | −2.82 (0.13) | −0.39 | 1.70 |
| 11 | 1.03 (0.07) | −0.60 (0.06) | 1.39 (0.09) | −2.09 (0.10) | −0.36 | 1.50 |
| 12 | 1.09 (0.08) | 1.37 (0.07) | 1.29 (0.07) | 0.55 (0.06) | −0.21 | 0.82 |
| 13 | 1.48 (0.09) | 0.54 (0.06) | 2.69 (0.14) | −0.96 (0.10) | −1.21 | 1.50 |
| 14 | 1.50 (0.09) | 0.13 (0.06) | 2.85 (0.15) | −1.54 (0.11) | −1.35 | 1.67 |
| 15 | 1.19 (0.07) | 0.04 (0.06) | 2.38 (0.13) | −1.45 (0.10) | −1.19 | 1.49 |
| 16 | 1.47 (0.10) | −1.50 (0.08) | 2.30 (0.18) | −3.77 (0.21) | −0.84 | 2.27 |
| 17 | 1.40 (0.10) | 2.19 (0.10) | 1.57 (0.09) | 1.35 (0.08) | −0.17 | 0.84 |
| 18 | 1.72 (0.11) | 2.09 (0.10) | 2.13 (0.12) | 1.06 (0.09) | −0.41 | 1.02 |
| 19 | 1.86 (0.11) | 1.06 (0.08) | 4.18 (0.25) | −0.61 (0.13) | −2.32 | 1.67 |
| 20 | 1.50 (0.09) | 1.12 (0.07) | 2.41 (0.12) | −0.07 (0.08) | −0.91 | 1.19 |
Simulated with 100 replications using posterior mode estimation. Values in parenthesis represent standard errors.
Changes in the root mean squared error (RMSE) of ability estimates using population and estimated values.
| 1 | Two-Class 2PL Mixture IRT | 0.21 | 0.20 |
| Unidimensional 2PL IRT | 0.28 | 0.24 | |
| 2 | Two-Class 2PL Mixture IRT | 0.21 | 0.18 |
| Unidimensional 2PL IRT | 0.28 | 0.26 | |
| 3 | Two-Class 2PL Mixture IRT | 0.21 | 0.18 |
| Unidimensional 2PL IRT | 0.28 | 0.27 | |
| 4 | Two-Class 2PL Mixture IRT | 0.21 | 0.74 |
| Unidimensional 2PL IRT | 0.28 | 0.61 | |
| 5 | Two-Class 2PL Mixture IRT | 0.21 | 0.73 |
| Unidimensional 2PL IRT | 0.28 | 0.63 | |
Simulated with 100 replications using posterior mode estimation. Values in parenthesis represent standard errors.
Changes in the root mean squared difference (RMSD) of ability using estimated ability across Time 1 and Time 2.
| 1 | 0.003 | 0.002 |
| 2 | 0.015 | 0.008 |
| 3 | 0.014 | 0.007 |
| 4 | 0.003 | 0.006 |
| 5 | 0.012 | 0.021 |
Simulated with 100 replications using posterior mode estimation. Values in parenthesis represent standard errors.
Classification accuracy: Proportion correctly classified.
| 1 | 0.875 | 0.880 |
| 2 | 0.875 | 0.903 |
| 3 | 0.875 | 0.906 |
| 4 | 0.875 | 0.918 |
| 5 | 0.875 | 0.915 |
Simulated with 100 replications using posterior mode estimation.