Literature DB >> 29795945

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients.

Björn Andersson1, Tao Xin1.   

Abstract

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability is typically not reported. In this study, the asymptotic variances of the IRT marginal and test reliability coefficient estimators are derived for dichotomous and polytomous IRT models assuming an underlying asymptotically normally distributed item parameter estimator. The results are used to construct confidence intervals for the reliability coefficients. Simulations are presented which show that the confidence intervals for the test reliability coefficient have good coverage properties in finite samples under a variety of settings with the generalized partial credit model and the three-parameter logistic model. Meanwhile, it is shown that the estimator of the marginal reliability coefficient has finite sample bias resulting in confidence intervals that do not attain the nominal level for small sample sizes but that the bias tends to zero as the sample size increases.

Keywords:  asymptotic variance; confidence intervals; item response theory; reliability

Year:  2017        PMID: 29795945      PMCID: PMC5965626          DOI: 10.1177/0013164417713570

Source DB:  PubMed          Journal:  Educ Psychol Meas        ISSN: 0013-1644            Impact factor:   2.821


  3 in total

1.  Ramsay-curve item response theory (RC-IRT) to detect and correct for nonnormal latent variables.

Authors:  Carol M Woods
Journal:  Psychol Methods       Date:  2006-09

2.  Information matrices and standard errors for MLEs of item parameters in IRT.

Authors:  Ke-Hai Yuan; Ying Cheng; Jeff Patton
Journal:  Psychometrika       Date:  2013-03-27       Impact factor: 2.500

3.  How to compare scores from different depression scales: equating the Patient Health Questionnaire (PHQ) and the ICD-10-Symptom Rating (ISR) using Item Response Theory.

Authors:  H Felix Fischer; Karin Tritt; Burghard F Klapp; Herbert Fliege
Journal:  Int J Methods Psychiatr Res       Date:  2011-10-20       Impact factor: 4.035

  3 in total
  3 in total

1.  Reliability coefficients for multiple group item response theory models.

Authors:  Björn Andersson; Hao Luo; Kseniia Marcq
Journal:  Br J Math Stat Psychol       Date:  2022-03-01       Impact factor: 2.410

2.  Urban-rural disparities in the healthy ageing trajectory in China: a population-based study.

Authors:  Haomiao Li; Yixin Zeng; Jiangyun Chen; Li Gan; Yusupujiang Tuersun; Jiao Yang; Jing Liu
Journal:  BMC Public Health       Date:  2022-07-23       Impact factor: 4.135

3.  Nonrestorative sleep scale: a reliable and valid short form of the traditional Chinese version.

Authors:  S Li; D Y T Fong; J Y H Wong; K Wilkinson; C Shapiro; E P H Choi; B McPherson; E Y Y Lau; C L K Lam; L X Huang; M S M Ip
Journal:  Qual Life Res       Date:  2020-05-16       Impact factor: 4.147

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.