| Literature DB >> 27943018 |
Mathilde G E Verdam1,2, Frans J Oort3,4, Mirjam A G Sprangers3.
Abstract
PURPOSE: Comparison of patient-reported outcomes may be invalidated by the occurrence of item bias, also known as differential item functioning. We show two ways of using structural equation modeling (SEM) to detect item bias: (1) multigroup SEM, which enables the detection of both uniform and nonuniform bias, and (2) multidimensional SEM, which enables the investigation of item bias with respect to several variables simultaneously.Entities:
Keywords: Depression Scale; Differential item functioning; Hospital Anxiety; Item bias; Structural equation modeling
Mesh:
Year: 2016 PMID: 27943018 PMCID: PMC5420371 DOI: 10.1007/s11136-016-1469-1
Source DB: PubMed Journal: Qual Life Res ISSN: 0962-9343 Impact factor: 4.147
Fig. 1Two-group Measurement Model for gender-related item bias detection in the anxiety subscale of the HADS. Similar models have been used for the detection of age-related item bias in the anxiety subscale of the HADS, and for the detection of gender- and age-related item bias in the depression subscale of the HADS. The squares represent the underlying continuous variables associated with the observed item responses of Item 1 to Item 13. The circle at the top is the underlying common factor Anxiety, which represents everything that Item 1 to Item 13 have in common. Each item is associated with a residual factor, which represents everything that is specific to the corresponding item. Item bias is operationalized as across-group differences in intercepts (uniform) and factor loadings (nonuniform)
Results of gender- and age-related item bias detection in the anxiety and depression scales of the HADS questionnaire using the multigroup structural equation modeling (SEM-MG) and multidimensional structural equation modeling (SEM-MD) approaches
| Item | Gender-related item bias | Age-related item bias | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| LOGR1 | IRT2 | CONT3 | SEM-MG4 | SEM-MD5 | LOGR1 | IRT2 | CONT3 | SEM-MG4 | SEM-MD5 | |
|
| ||||||||||
| 1. I feel tense or wound up | – | – | – | – | – | − |
|
|
| − |
| 3. I get a … feeling as if something awful… | – | – | – | – | – | – | – | – |
| – |
| 5. Worrying thoughts go through my mind | – | – | – | – | – | – | – | – | – | – |
| 7. I can sit at ease and feel relaxed | – | – | – | – | – | – | – | – | – | – |
| 9. I get a.. feeling like ‘butterflies’ in the stomach | − | – |
| − |
| – | – | – | – | – |
| 11. I feel restless as if I have to be on the move |
|
|
|
|
| – | – | – | – | – |
| 13. I get sudden feelings of panic | – | – | – | – | – | – | – | – |
|
|
|
| ||||||||||
| 2. I still enjoy the things I used to enjoy | – | – | – | – |
| – | – | – | – | – |
| 4. I can laugh and see the funny side of things | – | – | – |
| – | – | – | – |
|
|
| 6. I feel cheerful | – | – | – | – |
|
|
|
|
|
|
| 8. I feel as if I am slowed down | – | – | – | – | – |
|
|
| – |
|
| 10. I have lost interest in my appearance | – | – | – |
| – |
|
|
|
|
|
| 12. I look forward with enjoyment to things | – | – | – | – | – | – | – | – |
| – |
| 14. I can enjoy… book or radio or TV | – | – | – | – |
| – | – | – |
|
|
Results are compared to the item bias detection results as reported by Cameron et al. [7] from the ordinal logistic regression method (LOGR), the item response theory method (IRT), and the contingency table method (CONT)
aUniform item bias
bNonuniform item bias. Results meeting the criteria for important item bias are marked in bold, results meeting only the significance criterion are marked in italics. Numbers are given only for those item bias detection results that were considered statistically significant
1Log odds ratios are presented, where items were regarded as having important bias if the absolute magnitude of the log odds ratio was greater than 0.64 and p < 0.001
2Contrasts with absolute values greater than 0.50 and p < 0.05 were taken as an indication of important item bias
3Standardized Liu–Agresti cumulative common log odds ratios (LOR Z) are presented, where absolute values <2 and p < .001 are considered important item bias
4Effect size indices d are presented. For uniform item bias, these refer to the difference in intercept parameters between the groups, divided by the pooled standard deviation. For nonuniform item bias these refer to the difference in factor loading parameter multiplied with the difference in common factor means between the groups, divided by the pooled standard deviation. Effect sizes larger than .20 and p < .001 are indicative of important item bias
5Effect size indices r are presented, which are the standardized direct effect of Gender/Age on the specific item. Effect sizes larger than .10 and p < .001 are indicative of important item bias
Fig. 2Multidimensional “no item bias” model for gender- and age-related item bias detection in the anxiety and depression subscales of the HADS. The squares represent the underlying continuous variables associated with the observed item responses of Item 1 to Item 14. The circles at the top are the underlying common factors Anxiety and Depression. Anxiety represents everything that Item 1 to Item 13 have in common, whereas Depression represents everything that Item 2 to Item 14 have in common. Each item is associated with a residual factor, which represents everything that is specific to the corresponding item. The multidimensional model includes two exogenous variables: Gender and Age. Uniform item bias is operationalized as significant direct effects of the exogenous variables on the indicator variables (i.e., Item 1 to Item 14)
Goodness of overall model fit and difference in model fit of the models for gender- and age-related item bias detection models in Stage 2; for both the multigroup structural equation modeling approach, and the multidimensional structural equation modeling approach
| Model |
| CHISQ |
| RMSEA [90% CI] | Compared to |
| CHISQdiff |
| |
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| Anxiety subscale | |||||||||
| 1a | Measurement Model | 28 | 50.64 | .005 | 0.039 [0.021; 0.056] | ||||
| 1b | No Item Bias Model | 40 | 126.4 | <.001 | 0.064 [0.051; 0.076] | Model 1a | 12 | 75.76 | <.001 |
| 1c | Final Model | 38 | 81.29 | <.001 | 0.046 [0.032; 0.060] | Model 1a | 10 | 30.65 | <.001 |
| Depression subscale | |||||||||
| 2a | Measurement Model | 28 | 46.26 | .016 | 0.035 [0.015; 0.052] | ||||
| 2b | No Item Bias Model | 40 | 120.9 | <.001 | 0.062 [0.049; 0.074] | Model 2a | 12 | 74.63 | <.001 |
| 2c | Final Model | 37 | 70.02 | <.001 | 0.041 [0.026; 0.055] | Model 2a | 9 | 23.76 | .005 |
|
| |||||||||
| Anxiety subscale | |||||||||
| 3a | Measurement Model | 28 | 61.02 | <.001 | 0.047 [0.031; 0.063] | ||||
| 3b | No Item Bias Model | 40 | 163.2 | <.001 | 0.076 [0.064; 0.088] | Model 3a | 12 | 102.1 | <.001 |
| 3c | Final Model | 37 | 81.71 | <.001 | 0.048 [0.04; 0.062] | Model 3a | 9 | 20.69 | .014 |
| Depression subscale | |||||||||
| 4a | Measurement Model | 28 | 42.59 | .038 | 0.031 [0.008; 0.049] | ||||
| 4b | No Item Bias Model | 40 | 357.2 | <.001 | 0.122 [0.111; 0.134] | Model 4a | 12 | 314.6 | <.001 |
| 4c | Final Model | 34 | 83.24 | <.001 | 0.052 [0.038; 0.066] | Model 4a | 6 | 40.65 | <.001 |
|
| |||||||||
| Anxiety and Depression subscale | |||||||||
| 5a | Measurement Model | 76 | 485.05 | <.001 | 0.071 [0.065; 0.077] | ||||
| 5b | No Item Bias Model | 100 | 1029.8 | <.001 | 0.093 [0.088; 0.098] | ||||
| 5c | Final Model | 88 | 455.71 | <.001 | 0.063 [0.057; 0.068] | ||||
N = 1068. Overall model fit and difference in fit was evaluated using WLS Chi-square values that are provided in the standard LISREL output (denoted C2_NNT)