| Literature DB >> 32185193 |
Marjan Faghih1, Zahra Bagheri1, Dejan Stevanovic2, Seyyed Mohhamad Taghi Ayatollahi1, Peyman Jafari1.
Abstract
The logistic regression (LR) model for assessing differential item functioning (DIF) is highly dependent on the asymptotic sampling distributions. However, for rare events data, the maximum likelihood estimation method may be biased and the asymptotic distributions may not be reliable. In this study, the performance of the regular maximum likelihood (ML) estimation is compared with two bias correction methods including weighted logistic regression (WLR) and Firth's penalized maximum likelihood (PML) to assess DIF for imbalanced or rare events data. The power and type I error rate of the LR model for detecting DIF were investigated under different combinations of sample size, moderate and severe magnitudes of uniform DIF (DIF = 0.4 and 0.8), sample size ratio, number of items, and the imbalanced degree (τ). Indeed, as compared with WLR and for severe imbalanced degree (τ = 0.069), there were reductions of approximately 30% and 24% under DIF = 0.4 and 27% and 23% under DIF = 0.8 in the power of the PML and ML, respectively. The present study revealed that the WLR outperforms both the ML and PML estimation methods when logistic regression is used to evaluate DIF for imbalanced or rare events data.Entities:
Mesh:
Year: 2020 PMID: 32185193 PMCID: PMC7060847 DOI: 10.1155/2020/1632350
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Statistical power of different methods of estimation under different combinations.
| Item | Ratio |
| DIF: 0.4 | DIF: 0.8 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
| |||||||||||
| ML | PML | WLR | ML | PML | WLR | ML | PML | WLR | ML | PML | WLR | |||
| 5 |
| 200 | 0.13 | 0.10 | 0.16 | 0.09 | 0.06 | 0.12 | 0.34 | 0.30 | 0.42 | 0.20 | 0.15 | 0.28 |
| 600 | 0.29 | 0.26 | 0.33 | 0.17 | 0.15 | 0.21 | 0.78 | 0.75 | 0.83 | 0.49 | 0.45 | 0.58 | ||
| 1000 | 0.46 | 0.42 | 0.51 | 0.23 | 0.21 | 0.27 | 0.94 | 0.92 | 0.96 | 0.69 | 0.66 | 0.77 | ||
|
| ||||||||||||||
| 5 |
| 200 | 0.13 | 0.11 | 0.16 | 0.09 | 0.07 | 0.13 | 0.32 | 0.29 | 0.41 | 0.19 | 0.15 | 0.30 |
| 600 | 0.29 | 0.27 | 0.32 | 0.16 | 0.15 | 0.22 | 0.74 | 0.72 | 0.82 | 0.45 | 0.44 | 0.59 | ||
| 1000 | 0.41 | 0.39 | 0.46 | 0.23 | 0.22 | 0.29 | 0.89 | 0.88 | 0.93 | 0.62 | 0.61 | 0.73 | ||
|
| ||||||||||||||
| 5 |
| 200 | 0.13 | 0.11 | 0.17 | 0.09 | 0.08 | 0.16 | 0.30 | 0.28 | 0.41 | 0.17 | 0.16 | 0.29 |
| 600 | 0.21 | 0.20 | 0.26 | 0.13 | 0.13 | 0.18 | 0.61 | 0.60 | 0.73 | 0.37 | 0.36 | 0.51 | ||
| 1000 | 0.37 | 0.35 | 0.42 | 0.21 | 0.20 | 0.26 | 0.86 | 0.85 | 0.91 | 0.56 | 0.56 | 0.70 | ||
|
| ||||||||||||||
| 15 |
| 200 | 0.13 | 0.11 | 0.17 | 0.10 | 0.09 | 0.13 | 0.36 | 0.33 | 0.45 | 0.22 | 0.18 | 0.29 |
| 600 | 0.34 | 0.32 | 0.38 | 0.20 | 0.19 | 0.23 | 0.81 | 0.80 | 0.87 | 0.52 | 0.49 | 0.62 | ||
| 1000 | 0.48 | 0.46 | 0.53 | 0.28 | 0.25 | 0.32 | 0.96 | 0.96 | 0.98 | 0.74 | 0.72 | 0.81 | ||
|
| ||||||||||||||
| 15 |
| 200 | 0.13 | 0.12 | 0.17 | 0.11 | 0.09 | 0.15 | 0.35 | 0.33 | 0.44 | 0.22 | 0.20 | 0.32 |
| 600 | 0.34 | 0.32 | 0.39 | 0.20 | 0.20 | 0.25 | 0.77 | 0.77 | 0.85 | 0.51 | 0.50 | 0.65 | ||
| 1000 | 0.45 | 0.44 | 0.50 | 0.25 | 0.24 | 0.31 | 0.94 | 0.94 | 0.97 | 0.70 | 0.70 | 0.80 | ||
|
| ||||||||||||||
| 15 |
| 200 | 0.13 | 0.13 | 0.17 | 0.10 | 0.08 | 0.14 | 0.31 | 0.31 | 0.41 | 0.19 | 0.19 | 0.31 |
| 600 | 0.25 | 0.25 | 0.31 | 0.15 | 0.15 | 0.21 | 0.74 | 0.74 | 0.83 | 0.43 | 0.44 | 0.57 | ||
| 1000 | 0.37 | 0.37 | 0.43 | 0.23 | 0.24 | 0.30 | 0.90 | 0.89 | 0.94 | 0.64 | 0.66 | 0.77 | ||
Note. Ratio: sample size ratio between the reference and focal groups. nr and nf represent sample sizes in reference and focal groups, respectively. N = total sample size; N = nr + nf. τ: the fraction of 1s in the population. DIF: differential item functioning; ML: maximum likelihood; PML: penalized maximum likelihood; WLR: Weighted Logistic Regression.
Type I error rate of different methods of estimation under different combinations.
| Item | Ratio |
| DIF: 0.4 | DIF: 0.8 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
| |||||||||||
| ML | PML | WLR | ML | PML | WLR | ML | PML | WLR | ML | PML | WLR | |||
| 5 |
| 200 | 0.07 | 0.05 | 0.02 | 0.07 | 0.05 | 0.01 | 0.07 | 0.05 | 0.03 | 0.07 | 0.05 | 0.01 |
| 600 | 0.05 | 0.04 | 0.03 | 0.05 | 0.03 | 0.01 | 0.08 | 0.06 | 0.05 | 0.06 | 0.04 | 0.01 | ||
| 1000 | 0.07 | 0.05 | 0.04 | 0.06 | 0.04 | 0.01 | 0.11 | 0.08 | 0.08 | 0.06 | 0.05 | 0.01 | ||
|
| ||||||||||||||
| 5 |
| 200 | 0.07 | 0.05 | 0.03 | 0.07 | 0.05 | 0.01 | 0.08 | 0.06 | 0.04 | 0.07 | 0.05 | 0.01 |
| 600 | 0.06 | 0.04 | 0.02 | 0.04 | 0.04 | 0.01 | 0.08 | 0.06 | 0.05 | 0.05 | 0.04 | 0.01 | ||
| 1000 | 0.06 | 0.05 | 0.03 | 0.05 | 0.04 | 0.01 | 0.11 | 0.09 | 0.07 | 0.06 | 0.04 | 0.01 | ||
|
| ||||||||||||||
| 5 |
| 200 | 0.06 | 0.04 | 0.03 | 0.05 | 0.04 | 0.01 | 0.07 | 0.05 | 0.04 | 0.06 | 0.04 | 0.01 |
| 600 | 0.07 | 0.05 | 0.03 | 0.07 | 0.05 | 0.01 | 0.08 | 0.06 | 0.04 | 0.07 | 0.05 | 0.01 | ||
| 1000 | 0.07 | 0.06 | 0.04 | 0.06 | 0.04 | 0.01 | 0.09 | 0.07 | 0.07 | 0.06 | 0.05 | 0.01 | ||
|
| ||||||||||||||
| 15 |
| 200 | 0.05 | 0.03 | 0.02 | 0.05 | 0.03 | 0.01 | 0.05 | 0.03 | 0.02 | 0.05 | 0.03 | 0.01 |
| 600 | 0.05 | 0.04 | 0.02 | 0.05 | 0.04 | 0.01 | 0.06 | 0.04 | 0.03 | 0.05 | 0.04 | 0.01 | ||
| 1000 | 0.05 | 0.05 | 0.03 | 0.05 | 0.05 | 0.01 | 0.05 | 0.05 | 0.03 | 0.05 | 0.05 | 0.01 | ||
|
| ||||||||||||||
| 15 |
| 200 | 0.07 | 0.05 | 0.03 | 0.06 | 0.05 | 0.01 | 0.07 | 0.05 | 0.03 | 0.07 | 0.05 | 0.01 |
| 600 | 0.06 | 0.04 | 0.03 | 0.06 | 0.05 | 0.01 | 0.06 | 0.05 | 0.03 | 0.06 | 0.05 | 0.01 | ||
| 1000 | 0.04 | 0.04 | 0.02 | 0.04 | 0.03 | 0.01 | 0.05 | 0.04 | 0.02 | 0.04 | 0.04 | 0.01 | ||
|
| ||||||||||||||
| 15 |
| 200 | 0.06 | 0.05 | 0.02 | 0.06 | 0.05 | 0.01 | 0.06 | 0.05 | 0.02 | 0.06 | 0.05 | 0.01 |
| 600 | 0.05 | 0.05 | 0.02 | 0.05 | 0.04 | 0.01 | 0.05 | 0.05 | 0.02 | 0.05 | 0.05 | 0.01 | ||
| 1000 | 0.05 | 0.04 | 0.02 | 0.05 | 0.04 | 0.01 | 0.05 | 0.04 | 0.02 | 0.05 | 0.04 | 0.01 | ||
Note. Ratio: sample size ratio between the reference and focal groups. nr and nf represent sample sizes in reference and focal groups, respectively. N = total sample size; N = nr + nf. τ: the fraction of 1s in the population. DIF: differential item functioning; ML: maximum likelihood; PML: penalized maximum likelihood; WLR: Weighted Logistic Regression. Near to 0.01.
Figure 1The average power of MLE (solid lines), PML (dotted line), and WLR (broken line) methods on measures with 5 and 15 items. Note. Left panel for DIF = 0.4 and right panel for DIF = 0.8. From top to bottom, the four panels are (nf = nr, τ = 0.156), (nf = 3nr, τ = 0.156), (nf = nr, τ = 0.069), and (nf = 3nr, τ = 0.069).
Figure 2The average type I error rates of MLE (solid lines), PMLE (dotted line), and WLR (broken line) methods on measures with 5 and 15 items. Note. Left panel for DIF = 0.4 and right panel for DIF = 0.8. From top to bottom, the four panels are (nf = nr, τ = 0.156), (nf = 3nr, τ = 0.156), (nf = nr, τ = 0.069), and (nf = 3nr, τ = 0.069).
The results of DIF analysis across male and female Serbian individuals based on ML, PML, and WLR methods.
| Item | ML | PML | WLR | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| B | SE |
| B | SE |
| B | SE |
|
| ||
| Social phobia | 7 | 0.61 | 0.29 | 0.032 | 0.6 | 0.29 | 0.042 | 0.61 | 0.31 | 0.042 | 0.24 |
| 20 | −1.16 | 0.44 | 0.007 | −1.13 | 0.44 | 0.01 | −1.19 | 0.42 | 0.002 | 0.17 | |
| 38 | 1.24 | 0.44 | 0.002 | 1.19 | 0.43 | 0.004 | 1.18 | 0.41 | 0.001 | 0.19 | |
| 43 | −1.57 | 0.47 | <0.001 | −1.52 | 0.46 | 0.001 | −1.56 | 0.44 | <0.001 | 0.19 | |
|
| |||||||||||
| Separation anxiety | 9 | −0.89 | 0.48 | 0.06 | −0.86 | 0.47 | 0.1 | −0.83 | 0.36 | 0.015 | 0.31 |
| 17 | 2.06 | 1.22 | 0.045 | 1.7 | 1.06 | 0.092 | 0.97 | 0.75 | 0.008 | 0.11 | |
| 45 | 0.96 | 0.48 | 0.034 | 0.91 | 0.47 | 0.059 | 1.04 | 0.56 | 0.036 | 0.09 | |
| 46 | −3.04 | 1.14 | 0.001 | −2.69 | 1.01 | 0.003 | −1.9 | 0.51 | <0.001 | 0.14 | |
|
| |||||||||||
| Generalized anxiety disorder | 13 | 1.14 | 0.35 | 0.001 | 1.12 | 0.35 | 0.001 | 1.12 | 0.35 | 0.001 | 0.31 |
| 37 | −2.01 | 0.44 | <0.001 | −1.96 | 0.44 | <0.001 | −1.95 | 0.44 | <0.001 | 0.22 | |
|
| |||||||||||
| Obsessive compulsive disorder | 23 | 0.79 | 0.43 | 0.059 | 0.75 | 0.43 | 0.1 | 0.79 | 0.41 | 0.038 | 0.18 |
|
| |||||||||||
| Major depression | 2 | 1.23 | 0.45 | 0.004 | 1.18 | 0.44 | 0.006 | 1.19 | 0.45 | 0.003 | 0.19 |
| 6 | −1.22 | 0.58 | 0.031 | −1.18 | 0.56 | 0.043 | −1.15 | 0.42 | 0.004 | 0.12 | |
| 21 | 0.63 | 0.32 | 0.048 | 0.62 | 0.32 | 0.072 | 0.65 | 0.34 | 0.049 | 0.26 | |
Note. P value is reported in three decimal places for more accuracy of comparing the three models. τ: the fraction of 1s in the population that is extracted from a data set of 4192 adolescents in eleven countries. B: regression coefficient for testing uniform DIF. SE: standard error of regression coefficient.