Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Essay Selection Methods for Adaptive Rater Monitoring.

Literature DB >> 29881078

Essay Selection Methods for Adaptive Rater Monitoring.

Chun Wang¹, Tian Song², Zhuoran Wang¹, Edward Wolfe³.

Abstract

Constructed-response items are commonly used in educational and psychological testing, and the answers to those items are typically scored by human raters. In the current rater monitoring processes, validity scoring is used to ensure that the scores assigned by raters do not deviate severely from the standards of rating quality. In this article, an adaptive rater monitoring approach that may potentially improve the efficiency of current rater monitoring practice is proposed. Based on the Rasch partial credit model and known development in multidimensional computerized adaptive testing, two essay selection methods-namely, the D-optimal method and the Single Fisher information method-are proposed. These two methods intend to select the most appropriate essays based on what is already known about a rater's performance. Simulation studies, using a simulated essay bank and a cloned real essay bank, show that the proposed adaptive rater monitoring methods can recover rater parameters with much fewer essay questions. Future challenges and potential solutions are discussed in the end.

Entities: Species

Keywords: Fisher information matrix; Rasch partial credit model; essay selection; interim scoring

Year: 2016 PMID： 29881078 PMCID： PMC5978486 DOI： 10.1177/0146621616672855

Source DB: PubMed Journal: Appl Psychol Meas ISSN： 0146-6216

Keyword Cloud
References

9 in total

Essay Selection Methods for Adaptive Rater Monitoring.

Review 1. Detecting and measuring rater effects using many-facet Rasch measurement: part I.

2. Detecting and measuring rater effects using many-facet Rasch measurement: Part II.

3. Comparison of Models and Indices for Detecting Rater Centrality.

4. Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit.

5. A Family of Rater Accuracy Models.

6. On Latent Trait Estimation in Multidimensional Compensatory Item Response Models.

Review 7. Psychometrics behind Computerized Adaptive Testing.

8. Multidimensional Adaptive Testing with Optimal Design Criteria for Item Selection.

9. The effect of rater severity on person ability measure: a Rasch model analysis.