Literature DB >> 35281339

Predictive Fit Metrics for Item Response Models.

Benjamin A Stenhaug1, Benjamin W Domingue1.   

Abstract

The fit of an item response model is typically conceptualized as whether a given model could have generated the data. In this study, for an alternative view of fit, "predictive fit," based on the model's ability to predict new data is advocated. The authors define two prediction tasks: "missing responses prediction"-where the goal is to predict an in-sample person's response to an in-sample item-and "missing persons prediction"-where the goal is to predict an out-of-sample person's string of responses. Based on these prediction tasks, two predictive fit metrics are derived for item response models that assess how well an estimated item response model fits the data-generating model. These metrics are based on long-run out-of-sample predictive performance (i.e., if the data-generating model produced infinite amounts of data, what is the quality of a "model's predictions on average?"). Simulation studies are conducted to identify the prediction-maximizing model across a variety of conditions. For example, defining prediction in terms of missing responses, greater average person ability, and greater item discrimination are all associated with the 3PL model producing relatively worse predictions, and thus lead to greater minimum sample sizes for the 3PL model. In each simulation, the prediction-maximizing model to the model selected by Akaike's information criterion, Bayesian information criterion (BIC), and likelihood ratio tests are compared. It is found that performance of these methods depends on the prediction task of interest. In general, likelihood ratio tests often select overly flexible models, while BIC selects overly parsimonious models. The authors use Programme for International Student Assessment data to demonstrate how to use cross-validation to directly estimate the predictive fit metrics in practice. The implications for item response model selection in operational settings are discussed.
© The Author(s) 2022.

Entities:  

Keywords:  cross-validation; fit; item response theory; model comparison; prediction

Year:  2022        PMID: 35281339      PMCID: PMC8908407          DOI: 10.1177/01466216211066603

Source DB:  PubMed          Journal:  Appl Psychol Meas        ISSN: 0146-6216


  5 in total

1.  Structural Model Evaluation and Modification: An Interval Estimation Approach.

Authors:  J H Steiger
Journal:  Multivariate Behav Res       Date:  1990-04-01       Impact factor: 5.923

2.  Goodness of Fit in Item Response Models.

Authors:  R P McDonald; M M Mok
Journal:  Multivariate Behav Res       Date:  1995-01-01       Impact factor: 5.923

Review 3.  Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning.

Authors:  Tal Yarkoni; Jacob Westfall
Journal:  Perspect Psychol Sci       Date:  2017-08-25

4.  Latent Class Models for Diary Method Data: Parameter Estimation by Local Computations.

Authors:  Frank Rijmen; Kristof Vansteelandt; Paul De Boeck
Journal:  Psychometrika       Date:  2007-10-04       Impact factor: 2.500

5.  Metric Stability in Item Response Models.

Authors:  Leah M Feuerstahler
Journal:  Multivariate Behav Res       Date:  2020-09-02       Impact factor: 5.923

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.