| Literature DB >> 30964935 |
Alexandra Rouquette1,2,3, Jean-Benoit Hardouin4,5, Alexis Vanhaesebrouck1, Véronique Sébille4,5, Joël Coste3,6.
Abstract
OBJECTIVE: The aims were to review practices concerning Differential Item Functioning (DIF) detection in composite measurement scales, particularly those used in health research, and to provide guidance on how to proceed if statistically significant DIF is detected.Entities:
Mesh:
Year: 2019 PMID: 30964935 PMCID: PMC6456214 DOI: 10.1371/journal.pone.0215073
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flowchart of the selection of articles reporting detection of Differential Item Functioning (DIF).
DIF: Differential Item Functioning, CFA: Confirmatory Factorial Analysis, IRT: Item Response Theory, PROMIS: Patient Reported Outcomes Measurement Information System, 2PL: Two Parameters Logistic model, LORDIF: Logistic Ordinal Regression Differential Item Functioning using IRT [20].
Practices related to the Differential Item Functioning (DIF) detection and results within the item response theory framework.
| Rasch | 2PL | LORDIF | |
|---|---|---|---|
| Sample size | 300 (166–610) | 964 (762–2429) | 1082 (573–2520) |
| Non-uniform DIF assessed, n (%) | 13 (21%) | 4 (25%) | 7 (63%) |
| Use of DIF effect-size, n (%) | 18 (30%) | 3 (19%) | 9 (82%) |
| Statistically significant DIF detected, n (%) | 49 (79%) | 14 (88%) | 8 (73%) |
* Median (1st quartile– 3rd quartile), 2PL: two parameters logistic, LORDIF: Logistic Ordinal Regression Differential Item Functioning using Item Response Theory [20]
Factors manipulated in the simulation study and values studied in the four kinds of Differential Item Functioning (DIF).
| Factor | Values studied | |||
|---|---|---|---|---|
| Uniform DIF | Balanced non-uniform DIF | Balanced non-uniform DIF | ||
| Gentle Slope | Steep Slope | |||
| {4, 8} | {8} | {8} | ||
| {100, 200} | {200} | {200} | ||
| {0, 0.1, 0.2, | {0.1} | {0.1} | ||
| Δ = {0.25, 0.5, 1} | Δ = {0.25, 0.5, 1} | Δ = {0.25, 0.5, 1} | ||
| {25%, 50%, 75%} | ||||
| {5} | ||||
| Percentiles of the standardized normal distribution | ||||
| Items concerned by DIF were: | ||||
| • “Unif”: uniformly distributed along the continuum | ||||
| • “Mean”: close to the mean of the item difficulties in the scale | ||||
| • “Extreme”: the most difficult and easiest items of the scale | ||||
| • “High”: the most difficult items of the scale | ||||
| • “Low”: the easiest items of the scale | ||||
In bold and italics, values added for the scenarios added in the current work
* as defined in Fig 2 was added in the current work
** as defined in the Fig 2 were added in the current work
Fig 2Item characteristic curves in the reference and focal groups and Differential Item Functioning (DIF) effect-size used in the simulation model for the four kinds of DIF.
δi: item location parameter for the item category i, Δ = {0.25, 0.5, 1}, the latent trait scale on the X-axis is given in logits.
Results of the multivariate analysis of variance of the measurement bias for the four kinds of Differential Item Functioning (DIF) studied.
| Factor | Uniform DIF | Balanced non-uniform DIF | Balanced non-uniform DIF | Unbalanced non-uniform DIF | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Category | Estimate (SE) | Category | Estimate (SE) | Category | Estimate (SE) | Category | Estimate (SE) | |||||
| Reference | <0.001 | Reference | 0.808 | Reference | 0.270 | Reference | <0.001 | |||||
| 0.051 (0.002) | -0.001 (0.002) | 0.000 (0.002) | 0.204 (0.004) | |||||||||
| 0.145 (0.002) | -0.000 (0.002) | 0.002 (0.002) | 0.185 (0.004) | |||||||||
| -0.003 (0.002) | ||||||||||||
| Reference | <0.001 | Reference | 0.364 | Reference | 0.030 | Reference | <0.001 | |||||
| 0.063 (0.002) | 0.003 (0.002) | -0.003 (0.002) | 0.000 (0.004) | |||||||||
| 0.128 (0.002) | 0.002 (0.002) | 0.002 (0.002) | 0.006 (0.004) | |||||||||
| Reference | <0.001 | Reference | <0.001 | Reference | <0.001 | Reference | <0.001 | |||||
| -0.024 (0.001) | 0.001 (0.003) | 0.000 (0.003) | -0.026 (0.003) | |||||||||
| -0.052 (0.001) | 0.001 (0.003) | 0.000 (0.003) | -0.046 (0.003) | |||||||||
| -0.037 (0.001) | 0.017 (0.003) | -0.025 (0.003) | -0.084 (0.003) | |||||||||
| -0.011 (0.001) | -0.015 (0.003) | 0.028 (0.003) | 0.037 (0.003) | |||||||||
| 0.064 (0.003) | <0.001 | 0.214 (0.005) | <0.001 | |||||||||
| 0.129 (0.003) | 0.446 (0.005) | |||||||||||
| 0.166 (0.003) | 0.184 (0.005) | |||||||||||
| 0.366 (0.003) | 0.410 (0.005) | |||||||||||
| 0.089 (0.002) | <0.001 | -0.004 (0.003) | 0.225 | -0.003 (0.003) | 0.312 | 0.017 (0.003) | <0.001 | |||||
Note: Only the interaction terms which were statistically significant and with estimates higher than 0.1 were kept in the models. SE: Standard Error, Δ: Magnitude of the group difference in the location parameter; Proportion: proportion of items with DIF in the scale; Position: position of the location parameters of items with DIF along the latent trait; Unif: uniformly distributed; Mean: around the mean of the item difficulties; Extreme: high and low difficulties; High: high difficulties; Low: low difficulties
*: interaction