| Literature DB >> 35346056 |
E Nichols1, J A Deal2,3, B K Swenor2,4, A G Abraham2,5, N M Armstrong6, K Bandeen-Roche7, M C Carlson8, M Griswold9, F R Lin2,3,8, T H Mosley9, P Y Ramulu4, N S Reed2,3, A R Sharrett2, A L Gross2.
Abstract
BACKGROUND: Item response theory (IRT) methods for addressing differential item functioning (DIF) can detect group differences in responses to individual items (e.g., bias). IRT and DIF-detection methods have been used increasingly often to identify bias in cognitive test performance by characteristics (DIF grouping variables) such as hearing impairment, race, and educational attainment. Previous analyses have not considered the effect of missing data on inferences, although levels of missing cognitive data can be substantial in epidemiologic studies.Entities:
Keywords: Cognition; Differential item functioning; Item response theory; Measurement
Mesh:
Year: 2022 PMID: 35346056 PMCID: PMC8961895 DOI: 10.1186/s12874-022-01572-2
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Demographic characteristics and the percentage of missing data on cognitive test scores in the Atherosclerosis Risk in Communities Neurocognitive Study (ARIC-NCS), 2016–2017
| Age—Mean (SD) | 79.8 (4.7) |
| Sex—Female—N(%) | 2142 (59.8) |
| Race—Black—N (%) | 837 (23.4) |
| Hearing Impairment—Impaired—N (%) | 2614 (73.0) |
| Vision Impairment—Impaired—N (%) | 180 (17.9) |
| Education: Less than HS—N (%) | 428 (12.0) |
| Education: HS or equivalent—N (%) | 1475 (41.2) |
| Education: Beyond HS—N (%) | 1673 (46.8) |
| Cognitive Items—% Missing in Observed Data (N) | |
| Boston Naming Test (30 item) | 8.2% (270) |
| Category Fluency (Animals) | 0.5% (17) |
| Delayed Word Recall | 1.6% (58) |
| Digit Symbol Backwards | 8.8% (289) |
| Digit Symbol Substitution Task | 3.9% (136) |
| Incidental Learning | 4.5% (154) |
| Logical Memory 1 | 9.2% (303) |
| Logical Memory 2 | 9.3% (306) |
| Phonemic Fluency (Sum of 3 Trials) | 1.5% (54) |
| Trail-Making Test A | 3.8% (130) |
| Trail-Making Test B | 19.1% (573) |
HS High school, the sample size of each DIF analysis can be calculated by subtracting the number of records with missing data on a given cognitive test from the total sample size, as each DIF analysis started from the reference scenario of no missing data.
Fig. 1Schematic showing the relationships modeled in the Multiple Indicators, Multiple Causes (MIMIC) model. Underlying cognition leads to cognitive test performance on each of the items included in the cognitive battery. Group can influence underlying cognition, which is also influenced by some error. The dashed arrow connecting group directly to Item 1 is the primary association of interest and represents bias, or the relationship between group and cognitive test item performance, holding underlying cognition constant
Fig. 2DIF estimates for hearing impairment in the reference scenario (No Additional Missingness) compared to the six missingness scenarios for select cognitive test items in the Atherosclerosis Risk in Communities Neurocognitive Study (ARIC-NCS). Each dot represents a difference between the two groups (impaired and unimpaired) for a single threshold of the ordinal test score. When the difference in thresholds is significantly different from zero (indicated by an unfilled circle), this indicates DIF. Differences between the reference scenario and each of the missingness scenarios illustrates the magnitude of error due to missingness. Estimated thresholds in the random scenarios are generally stacked vertically on top of the estimates with no additional missingness, indicating no systematic DIF estimation error. However, estimates for scenarios with missingness among lower cognitive test scores are shifted to the left, indicating systematic DIF estimation error in these scenarios
Fig. 3Distributions of errors in DIF estimates for (A) hearing impairment, (B) moderate to low education, and (C) black race due to different missing data scenarios across all cognitive tests and thresholds in the Atherosclerosis Risk in Communities Neurocognitive Study (ARIC-NCS)
Fig. 4DIF estimates for hearing impairment in the reference scenario (No Additional Missingness) compared to low missingness scenarios with and without hot deck single imputation (single imputation) and Multiple Imputation by Chained Equations (MICE) (multiple imputation) for select cognitive test items in the Atherosclerosis Risk in Communities Neurocognitive Study (ARIC-NCS). Each dot represents a difference between the two groups (impaired and unimpaired) for a single threshold of the ordinal test score. When the difference in thresholds is significantly different from zero (indicated by an unfilled circle), this indicates DIF. Differences between the reference scenario and each of the missingness scenarios illustrates the magnitude of error due to missingness. The scenarios with no imputation are shifted to the left as compared to the reference estimates with no additional missingness, indicating systematic DIF estimation error. However, estimates with both single and multiple imputation more closely align with the reference estimates, illustrating the reduction of DIF estimation error with both imputation strategies
Fig. 5Absolute value of the median error in DIF estimates due to missingness related to cognitive test performance for hearing impairment, black race and moderate to low education in the Atherosclerosis Risk in Communities Neurocognitive Study (ARIC-NCS). Estimates are shown for scenarios with no imputation, hot deck single imputation, and Multiple Imputation by Chained Equations (MICE)
Fig. 6Percentage of DIF estimates with flipped inferences due to missingness related to cognitive test performance for hearing impairment, black race and moderate to low education in the Atherosclerosis Risk in Communities Neurocognitive Study (ARIC-NCS). Estimates are shown for scenarios with no imputation and Multiple Imputation by Chained Equations (MICE). Inferences are considered flipped if the statistical significance of the estimate at the α = 0.05 level is discrepant with the statistical significance of the reference estimate without missing data