Literature DB >> 24484622

Reliability measures in item response theory: manifest versus latent correlation functions.

Elasma Milanzi1, Geert Molenberghs, Ariel Alonso, Geert Verbeke, Paul De Boeck.   

Abstract

For item response theory (IRT) models, which belong to the class of generalized linear or non-linear mixed models, reliability at the scale of observed scores (i.e., manifest correlation) is more difficult to calculate than latent correlation based reliability, but usually of greater scientific interest. This is not least because it cannot be calculated explicitly when the logit link is used in conjunction with normal random effects. As such, approximations such as Fisher's information coefficient, Cronbach's α, or the latent correlation are calculated, allegedly because it is easy to do so. Cronbach's α has well-known and serious drawbacks, Fisher's information is not meaningful under certain circumstances, and there is an important but often overlooked difference between latent and manifest correlations. Here, manifest correlation refers to correlation between observed scores, while latent correlation refers to correlation between scores at the latent (e.g., logit or probit) scale. Thus, using one in place of the other can lead to erroneous conclusions. Taylor series based reliability measures, which are based on manifest correlation functions, are derived and a careful comparison of reliability measures based on latent correlations, Fisher's information, and exact reliability is carried out. The latent correlations are virtually always considerably higher than their manifest counterparts, Fisher's information measure shows no coherent behaviour (it is even negative in some cases), while the newly introduced Taylor series based approximations reflect the exact reliability very closely. Comparisons among the various types of correlations, for various IRT models, are made using algebraic expressions, Monte Carlo simulations, and data analysis. Given the light computational burden and the performance of Taylor series based reliability measures, their use is recommended.
© 2014 The British Psychological Society.

Keywords:  Rasch model; logit link; one-parameter logistic model; probit link; two-parameter logistic model

Mesh:

Year:  2014        PMID: 24484622     DOI: 10.1111/bmsp.12033

Source DB:  PubMed          Journal:  Br J Math Stat Psychol        ISSN: 0007-1102            Impact factor:   3.380


  4 in total

1.  Modeling Learning in Doubly Multilevel Binary Longitudinal Data Using Generalized Linear Mixed Models: An Application to Measuring and Explaining Word Learning.

Authors:  Sun-Joo Cho; Amanda P Goodwin
Journal:  Psychometrika       Date:  2016-04-01       Impact factor: 2.500

2.  Measurement versus prediction in the construction of patient-reported outcome questionnaires: can we have our cake and eat it?

Authors:  Niels Smits; L Andries van der Ark; Judith M Conijn
Journal:  Qual Life Res       Date:  2017-11-02       Impact factor: 4.147

3.  Typology of Deflation-Corrected Estimators of Reliability.

Authors:  Jari Metsämuuronen
Journal:  Front Psychol       Date:  2022-07-18

4.  Deflation-Corrected Estimators of Reliability.

Authors:  Jari Metsämuuronen
Journal:  Front Psychol       Date:  2022-01-04
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.