Literature DB >> 29881078

Essay Selection Methods for Adaptive Rater Monitoring.

Chun Wang1, Tian Song2, Zhuoran Wang1, Edward Wolfe3.   

Abstract

Constructed-response items are commonly used in educational and psychological testing, and the answers to those items are typically scored by human raters. In the current rater monitoring processes, validity scoring is used to ensure that the scores assigned by raters do not deviate severely from the standards of rating quality. In this article, an adaptive rater monitoring approach that may potentially improve the efficiency of current rater monitoring practice is proposed. Based on the Rasch partial credit model and known development in multidimensional computerized adaptive testing, two essay selection methods-namely, the D-optimal method and the Single Fisher information method-are proposed. These two methods intend to select the most appropriate essays based on what is already known about a rater's performance. Simulation studies, using a simulated essay bank and a cloned real essay bank, show that the proposed adaptive rater monitoring methods can recover rater parameters with much fewer essay questions. Future challenges and potential solutions are discussed in the end.

Entities:  

Keywords:  Fisher information matrix; Rasch partial credit model; essay selection; interim scoring

Year:  2016        PMID: 29881078      PMCID: PMC5978486          DOI: 10.1177/0146621616672855

Source DB:  PubMed          Journal:  Appl Psychol Meas        ISSN: 0146-6216


  9 in total

Review 1.  Detecting and measuring rater effects using many-facet Rasch measurement: part I.

Authors:  Carol M Myford; Edward W Wolfe
Journal:  J Appl Meas       Date:  2003

2.  Detecting and measuring rater effects using many-facet Rasch measurement: Part II.

Authors:  Carol M Myford; Edward W Wolfe
Journal:  J Appl Meas       Date:  2004

3.  Comparison of Models and Indices for Detecting Rater Centrality.

Authors:  Edward W Wolfe; Tian Song
Journal:  J Appl Meas       Date:  2015

4.  Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit.

Authors:  J Cohen
Journal:  Psychol Bull       Date:  1968-10       Impact factor: 17.737

5.  A Family of Rater Accuracy Models.

Authors:  Edward W Wolfe; Hong Jiao; Tian Song
Journal:  J Appl Meas       Date:  2015

6.  On Latent Trait Estimation in Multidimensional Compensatory Item Response Models.

Authors:  Chun Wang
Journal:  Psychometrika       Date:  2014-03-07       Impact factor: 2.500

Review 7.  Psychometrics behind Computerized Adaptive Testing.

Authors:  Hua-Hua Chang
Journal:  Psychometrika       Date:  2014-02-06       Impact factor: 2.500

8.  Multidimensional Adaptive Testing with Optimal Design Criteria for Item Selection.

Authors:  Joris Mulder; Wim J van der Linden
Journal:  Psychometrika       Date:  2008-12-23       Impact factor: 2.500

9.  The effect of rater severity on person ability measure: a Rasch model analysis.

Authors:  M E Lunz; J A Stahl
Journal:  Am J Occup Ther       Date:  1993-04
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.