Literature DB >> 31583903

Testing Differential Item Functioning in Small Samples.

William C M Belzak1.   

Abstract

Differential item functioning (DIF) is a pernicious statistical issue that can mask true group differences on a target latent construct. A considerable amount of research has focused on evaluating methods for testing DIF, such as using likelihood ratio tests in item response theory (IRT). Most of this research has focused on the asymptotic properties of DIF testing, in part because many latent variable methods require large samples to obtain stable parameter estimates. Much less research has evaluated these methods in small sample sizes despite the fact that many social and behavioral scientists frequently encounter small samples in practice. In this article, we examine the extent to which model complexity-the number of model parameters estimated simultaneously-affects the recovery of DIF in small samples. We compare three models that vary in complexity: logistic regression with sum scores, the 1-parameter logistic IRT model, and the 2-parameter logistic IRT model. We expected that logistic regression with sum scores and the 1-parameter logistic IRT model would more accurately estimate DIF because these models yielded more stable estimates despite being misspecified. Indeed, a simulation study and empirical example of adolescent substance use show that, even when data are generated from / assumed to be a 2-parameter logistic IRT, using parsimonious models in small samples leads to more powerful tests of DIF while adequately controlling for Type I error. We also provide evidence for minimum sample sizes needed to detect DIF, and we evaluate whether applying corrections for multiple testing is advisable. Finally, we provide recommendations for applied researchers who conduct DIF analyses in small samples.

Keywords:  differential item functioning; item response theory; logistic regression; measurement invariance; model complexity; small samples

Mesh:

Year:  2019        PMID: 31583903     DOI: 10.1080/00273171.2019.1671162

Source DB:  PubMed          Journal:  Multivariate Behav Res        ISSN: 0027-3171            Impact factor:   5.923


  2 in total

1.  The Early Humor Survey (EHS): A reliable parent-report measure of humor development for 1- to 47-month-olds.

Authors:  Elena Hoicka; Burcu Soy Telli; Eloise Prouten; George Leckie; William J Browne; Gina Mireault; Claire Fox
Journal:  Behav Res Methods       Date:  2021-11-18

2.  A Machine Learning Approach to Assess Differential Item Functioning in Psychometric Questionnaires Using the Elastic Net Regularized Ordinal Logistic Regression in Small Sample Size Groups.

Authors:  Vahid Ebrahimi; Zahra Bagheri; Zahra Shayan; Peyman Jafari
Journal:  Biomed Res Int       Date:  2021-12-15       Impact factor: 3.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.