Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Predictive Fit Metrics for Item Response Models.

Literature DB >> 35281339

Predictive Fit Metrics for Item Response Models.

Benjamin A Stenhaug¹, Benjamin W Domingue¹.

Abstract

The fit of an item response model is typically conceptualized as whether a given model could have generated the data. In this study, for an alternative view of fit, "predictive fit," based on the model's ability to predict new data is advocated. The authors define two prediction tasks: "missing responses prediction"-where the goal is to predict an in-sample person's response to an in-sample item-and "missing persons prediction"-where the goal is to predict an out-of-sample person's string of responses. Based on these prediction tasks, two predictive fit metrics are derived for item response models that assess how well an estimated item response model fits the data-generating model. These metrics are based on long-run out-of-sample predictive performance (i.e., if the data-generating model produced infinite amounts of data, what is the quality of a "model's predictions on average?"). Simulation studies are conducted to identify the prediction-maximizing model across a variety of conditions. For example, defining prediction in terms of missing responses, greater average person ability, and greater item discrimination are all associated with the 3PL model producing relatively worse predictions, and thus lead to greater minimum sample sizes for the 3PL model. In each simulation, the prediction-maximizing model to the model selected by Akaike's information criterion, Bayesian information criterion (BIC), and likelihood ratio tests are compared. It is found that performance of these methods depends on the prediction task of interest. In general, likelihood ratio tests often select overly flexible models, while BIC selects overly parsimonious models. The authors use Programme for International Student Assessment data to demonstrate how to use cross-validation to directly estimate the predictive fit metrics in practice. The implications for item response model selection in operational settings are discussed.

Entities: Chemical

Keywords: cross-validation; fit; item response theory; model comparison; prediction

Year: 2022 PMID： 35281339 PMCID： PMC8908407 DOI： 10.1177/01466216211066603

Source DB: PubMed Journal: Appl Psychol Meas ISSN： 0146-6216

Keyword Cloud
References

5 in total

Predictive Fit Metrics for Item Response Models.

1. Structural Model Evaluation and Modification: An Interval Estimation Approach.

2. Goodness of Fit in Item Response Models.

Review 3. Choosing Prediction Over Explanation in Psychology: Lessons From Machine Learning.

4. Latent Class Models for Diary Method Data: Parameter Estimation by Local Computations.

5. Metric Stability in Item Response Models.