| Literature DB >> 31019359 |
Janine Buchholz1, Johannes Hartig1.
Abstract
Questionnaires for the assessment of attitudes and other psychological traits are crucial in educational and psychological research, and item response theory (IRT) has become a viable tool for scaling such data. Many international large-scale assessments aim at comparing these constructs across countries, and the invariance of measures across countries is thus required. In its most recent cycle, the Programme for International Student Assessment (PISA 2015) implemented an innovative approach for testing the invariance of IRT-scaled constructs in the context questionnaires administered to students, parents, school principals, and teachers. On the basis of a concurrent calibration with equal item parameters across all groups (i.e., languages within countries), a group-specific item-fit statistic (root mean square deviance [RMSD]) was used as a measure for the invariance of item parameters for individual groups. The present simulation study examines the statistic's distribution under different types and extents of (non)invariance in polytomous items. Responses to five 4-point Likert-type items were generated under the generalized partial credit model (GPCM) for 1,000 simulees in 50 groups each. For one of the five items, either location or discrimination parameters were drawn from a normal distribution. In addition to the type of noninvariance, the extent of noninvariance was varied by manipulating the variation of these distributions. The results indicate that the RMSD statistic is better at detecting noninvariance related to between-group differences in item location than in item discrimination. The study's findings may be used as a starting point to sensitivity analysis aiming to define cutoff values for determining (non)invariance.Entities:
Keywords: attitude measurement; comparability; cross-cultural testing; differential item functioning; invariance; item response theory; partial credit model; polytomous items; questionnaires; simulation
Year: 2017 PMID: 31019359 PMCID: PMC6463271 DOI: 10.1177/0146621617748323
Source DB: PubMed Journal: Appl Psychol Meas ISSN: 0146-6216
Simulation Conditions With Varying Types and Extents of Noninvariance of Item 1.
| Condition | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Baseline | Noninvariance of item location | Noninvariance of item discrimination | |||||||
|
| 0.00 | 0.25 | 0.50 | 0.75 | 1.00 | 0.00 | 0.00 | 0.00 | 0.00 |
|
| 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.25 | 0.50 | 0.75 | 1.00 |
Figure 1.Distribution of RMSD for Item 1 across 50 groups and 100 replications in the baseline condition ().
Note. RMSD = root mean square deviance.
Distribution of RMSD for Item 1 Across 50 Groups and 100 Replications in Each Noninvariance Simulation Condition.
| Extent | ||||
|---|---|---|---|---|
| Type |
|
|
|
|
| Item location (β) |
|
|
|
|
|
|
|
|
| |
| Item discrimination (α) |
|
|
|
|
Note. RMSD = root mean square deviance.
Descriptive Statistics of RMSD for Item 1 Across 50 Groups and 100 Replications in Each Simulation Condition.
| Condition | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Baseline |
|
| |||||||
| .25 | .50 | .75 | 1.00 | .25 | .50 | .75 | 1.00 | ||
| Minimum | .005 | .006 | .007 | .013 | .019 | .006 | .007 | .007 | .009 |
| Maximum | .047 | .148 | .300 | .334 | .401 | .101 | .333 | .396 | .458 |
| p20 | .013 | .020 | .028 | .040 | .055 | .016 | .020 | .028 | .036 |
| p50 | .018 | .030 | .050 | .073 | .093 | .021 | .030 | .045 | .063 |
| p90 | .026 | .060 | .111 | .153 | .188 | .033 | .057 | .091 | .126 |
| prop > .1 | .000 | .004 | .136 | .321 | .451 | .000 | .025 | .082 | .165 |
| prop > .3 | .000 | .000 | .000 | .001 | .010 | .000 | .000 | .003 | .009 |
Note. RMSD = root mean square deviance; p20 = 20th percentile; p50 = 50th percentile (median); p90 = 90th percentile; prop > .1 and .3 denote the proportion of cases above RMSD values of .1 and .3, respectively.