| Literature DB >> 35229881 |
Björn Andersson1, Hao Luo2, Kseniia Marcq1.
Abstract
Reliability of scores from psychological or educational assessments provides important information regarding the precision of measurement. The reliability of scores is however population dependent and may vary across groups. In item response theory, this population dependence can be attributed to differential item functioning or to differences in the latent distributions between groups and needs to be accounted for when estimating the reliability of scores for different groups. Here, we introduce group-specific and overall reliability coefficients for sum scores and maximum likelihood ability estimates defined by a multiple group item response theory model. We derive confidence intervals using asymptotic theory and evaluate the empirical properties of estimators and the confidence intervals in a simulation study. The results show that the estimators are largely unbiased and that the confidence intervals are accurate with moderately large sample sizes. We exemplify the approach with the Montreal Cognitive Assessment (MoCA) in two groups defined by education level and give recommendations for applied work.Entities:
Keywords: confidence intervals; item response theory; multiple groups; reliability
Mesh:
Year: 2022 PMID: 35229881 PMCID: PMC9313586 DOI: 10.1111/bmsp.12269
Source DB: PubMed Journal: Br J Math Stat Psychol ISSN: 0007-1102 Impact factor: 2.410
Estimated mean cognitive performance (), variance of cognitive performance (), sum score reliability and maximum likelihood estimate (MLE) reliability in two education levels and overall, with standard errors in parentheses
| Education level |
|
| Sum score reliability | MLE reliability |
|---|---|---|---|---|
| Some formal education | 0 (−) | 1 (−) | 0.736 (0.012) | 0.760 (0.009) |
| No formal education | −1.081 (0.063) | 1.096 (0.103) | 0.785 (0.010) | 0.819 (0.008) |
| All education levels | −0.486 (0.028) | 1.332 (0.066) | 0.806 (0.007) | 0.825 (0.005) |
Bias for the sum score and MLE reliability estimators, with the 14‐ and 28‐item graded response models
|
| Group 1 | Group 2 | All groups | Group 1 | Group 2 | All groups |
|---|---|---|---|---|---|---|
| 14 items | 28 items | |||||
| Bias of sum score reliability estimators | ||||||
| 1,000 | −0.0001 | −0.0009 | 0.0000 | 0.0004 | −0.0004 | 0.0001 |
| 2,000 | −0.0000 | −0.0004 | 0.0001 | 0.0003 | −0.0001 | 0.0001 |
| 4,000 | 0.0001 | −0.0002 | 0.0000 | 0.0003 | −0.0001 | 0.0000 |
| Bias of MLE reliability estimators | ||||||
| 1,000 | 0.0009 | 0.0003 | 0.0010 | 0.0008 | 0.0003 | 0.0006 |
| 2,000 | 0.0004 | 0.0002 | 0.0005 | 0.0005 | 0.0002 | 0.0003 |
| 4,000 | 0.0004 | 0.0001 | 0.0003 | 0.0004 | 0.0000 | 0.0001 |
Empirical coverage rates (%) of 95% confidence intervals for the sum score and MLE reliabilities, with bold font indicating that the coverage rate is statistically significantly different from 95%
|
| Group 1 | Group 2 | All groups | Group 1 | Group 2 | All groups |
|---|---|---|---|---|---|---|
| 14 items | 28 items | |||||
| Sum score reliability estimators | ||||||
| 1,000 | 95.00 | 94.51 | 94.94 | 94.48 | 94.68 | 94.62 |
| 2,000 | 94.86 | 95.04 | 95.14 | 95.01 | 94.71 | 95.15 |
| 4,000 | 95.28 | 94.72 | 95.34 | 94.86 | 95.12 | 94.94 |
| MLE reliability estimators | ||||||
| 1,000 | 94.62 | 94.53 |
|
|
|
|
| 2,000 | 94.72 | 94.70 | 94.44 | 94.71 | 94.65 | 94.83 |
| 4,000 | 95.18 | 94.72 | 95.14 | 94.70 | 94.98 | 94.64 |
Relative efficiency of sum score and MLE reliability estimators from single‐group models relative to estimators from multiple‐group models
|
| Group 1 | Group 2 | All groups | Group 1 | Group 2 | All groups |
|---|---|---|---|---|---|---|
| 14 items | 28 items | |||||
| Sum score reliability estimators | ||||||
| 1,000 | 1.12 | 1.14 | 1.02 | 1.06 | 1.07 | 1.06 |
| 2,000 | 1.14 | 1.13 | 1.04 | 1.07 | 1.06 | 1.14 |
| 4,000 | 1.13 | 1.11 | 1.08 | 1.09 | 1.08 | 1.36 |
| MLE reliability estimators | ||||||
| 1,000 | 1.05 | 1.05 | 1.11 | 1.02 | 1.04 | 1.10 |
| 2,000 | 1.05 | 1.04 | 1.24 | 1.02 | 1.03 | 1.19 |
| 4,000 | 1.05 | 1.05 | 1.49 | 1.03 | 1.03 | 1.33 |