| Literature DB >> 29322400 |
Jesper Tijmstra1, Maria Bolsinova2, Minjeong Jeon3.
Abstract
This article proposes a general mixture item response theory (IRT) framework that allows for classes of persons to differ with respect to the type of processes underlying the item responses. Through the use of mixture models, nonnested IRT models with different structures can be estimated for different classes, and class membership can be estimated for each person in the sample. If researchers are able to provide competing measurement models, this mixture IRT framework may help them deal with some violations of measurement invariance. To illustrate this approach, we consider a two-class mixture model, where a person's responses to Likert-scale items containing a neutral middle category are either modeled using a generalized partial credit model, or through an IRTree model. In the first model, the middle category ("neither agree nor disagree") is taken to be qualitatively similar to the other categories, and is taken to provide information about the person's endorsement. In the second model, the middle category is taken to be qualitatively different and to reflect a nonresponse choice, which is modeled using an additional latent variable that captures a person's willingness to respond. The mixture model is studied using simulation studies and is applied to an empirical example.Entities:
Keywords: General mixture item response models; IRTree models; Item response theory; Likert scales; Measurement invariance; Mixture modeling; Response styles
Mesh:
Year: 2018 PMID: 29322400 PMCID: PMC6267524 DOI: 10.3758/s13428-017-0997-0
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Fig. 1Two decision trees for a five-category Likert-scale item
Results of the simulation study on classification accuracy
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
| 500 | 20 | 1 | .25 | .83 (.06) | .39 (.29) | .98 (.02) | .60 (.19) | |
| .5 | .83 (.05) | .83 (.10) | .84 (.07) | .49 (.08) | ||||
| 2 | .25 | .81 (.06) | .28 (.29) | .98 (.02) | .58 (.21) | |||
| .5 | .81 (.05) | .83 (.08) | .80 (.09) | .46 (.09) | ||||
| 40 | 1 | .25 | .94 (.03) | .82 (.10) | .98 (.01) | .85 (.04) | ||
| .5 | .94 (.02) | .94 (.02) | .94 (.03) | .79 (.05) | ||||
| 2 | .25 | .93 (.03) | .78 (.13) | .98 (.01) | .84 (.04) | |||
| .5 | .94 (.02) | .94 (.02) | .93 (.02) | .79 (.05) | ||||
| 1000 | 20 | 1 | .25 | .88 (.04) | .64 (.18) | .96 (.01) | .64 (.08) | |
| .5 | .85 (.04) | .86 (.04) | .85 (.06) | .47 (.10) | ||||
| 2 | .25 | .87 (.02) | .61 (.11) | .96 (.02) | .61 (.06) | |||
| .5 | .85 (.03) | .85 (.04) | .85 (.04) | .46 (.07) | ||||
| 40 | 1 | .25 | .96 (.01) | .88 (.04) | .98 (.01) | .85 (.03) | ||
| .5 | .95 (.01) | .95 (.02) | .95 (.02) | .81 (.05) | ||||
| 2 | .25 | .95 (.01) | .87 (.04) | .98 (.01) | .84 (.03) | |||
| .5 | .95 (.01) | .94 (.02) | .95 (.02) | .80 (.05) | ||||
| 2000 | 20 | 1 | .25 | .90 (.02) | .71 (.09) | .96 (.01) | .59 (.07) | |
| .5 | .87 (.03) | .86 (.03) | .88 (.03) | .48 (.08) | ||||
| 2 | .25 | .89 (.02) | .71 (.07) | .96 (.01) | .59 (.07) | |||
| .5 | .86 (.02) | .86 (.03) | .87 (.03) | .45 (.07) | ||||
| 40 | 1 | .25 | .96 (.01) | .90 (.03) | .98 (.01) | .85 (.04) | ||
| .5 | .95 (.01) | .95 (.02) | .96 (.01) | .81 (.04) | ||||
| 2 | .25 | .96 (.01) | .89 (.03) | .98 (.01) | .84 (.03) | |||
| .5 | .95 (.01) | .95 (.02) | .96 (.01) | .80 (.04) |
Average values of P (overall proportion of correctly classified persons), P (proportion of correctly classified persons among those whose true class membership is IRTree), P (proportion of correctly classified persons among those whose true class membership is gPCM-5), and P (proportion of persons that were assigned to the correct class with high certainty) and their standard deviations (SD) across 50 replications for different sample sizes (N), number of items (K), number of dimensions (D), and true proportions of persons belonging to the IRTree class (P)
Average absolute bias (Bias), variance, and mean squared error (MSE) of the estimates of each type of item parameter in the mixture IRT model (1000 persons, 40 items, single dimension of primary interest; based on 100 replications)
| Bias | Variance | MSE | Bias | Variance | MSE | Bias | Variance | MSE | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 0.027 | 0.010 | 0.011 | 0.010 | 0.014 | 0.014 | 0.015 | 0.025 | 0.025 | |||
|
| 0.045 | 0.044 | 0.048 | 0.056 | 0.075 | 0.080 | 0.044 | 0.121 | 0.128 | |||
|
| – | – | – | 0.068 | 0.115 | 0.121 | 0.025 | 0.052 | 0.052 | |||
|
| – | – | – | 0.017 | 0.067 | 0.067 | 0.033 | 0.033 | 0.034 | |||
|
| – | – | – | 0.134 | 0.130 | 0.164 | 0.066 | 0.052 | 0.063 | |||
|
| – | – | – | 0.132 | 0.531 | 0.595 | 0.065 | 0.218 | 0.236 | |||
Fig. 2Bias of the estimates of 𝜃1 under the nonmixture gPCM-5 (a, b, c) and the mixture model (d, e, f) when the true model is the mixture model with the true proportion of persons in the IRTree class equal to P. Each point represents a single person
Fig. 3Mean squared error (MSE) of the estimates of 𝜃1 under the nonmixture gPCM-5 (a, b, c) and the mixture model (d, e, f) when the true model is the mixture model with the true proportion of persons in the IRTree class equal to P. Each point represents a single person
Model comparison for the three models fitted to the experience in close relationships data: Expectation of the deviance (; measure of model fit), effective number of parameters (p; measure of model complexity), and deviance information criterion (DIC)
| Model |
|
| DIC |
|---|---|---|---|
| gPCM-5 | 86993.17 | 2010.87 | 89004.04 |
| IRTree | 86441.34 | 2633.07 | 89074.41 |
| Mixture | 82275.47 | 2294.56 | 84570.03 |
Model comparison of the gPCM-5 and IRTree model considered separately for both classes obtained based on the mixture model
| Class | Class | |||||
|---|---|---|---|---|---|---|
| Model |
|
| DIC |
|
| DIC |
| gPCM-5 | 30361.66 | 771.84 | 31133.50 | 52484.74 | 1370.02 | 53854.75 |
| IRTree | 29715.09 | 994.53 | 30709.63 | 52671.08 | 1749.36 | 54420.44 |