Literature DB >> 29795943

Kappa and Rater Accuracy: Paradigms and Parameters.

Anthony J Conger1.   

Abstract

Drawing parallels to classical test theory, this article clarifies the difference between rater accuracy and reliability and demonstrates how category marginal frequencies affect rater agreement and Cohen's kappa (κ). Category assignment paradigms are developed: comparing raters to a standard (index) versus comparing two raters to one another (concordance), using both nonstochastic and stochastic category membership. Using a probability model to express category assignments in terms of rater accuracy and random error, it is shown that observed agreement (Po) depends only on rater accuracy and number of categories; however, expected agreement (Pe) and κ depend additionally on category frequencies. Moreover, category frequencies affect Pe and κ solely through the variance of the category proportions, regardless of the specific frequencies underlying the variance. Paradoxically, some judgment paradigms involving stochastic categories are shown to yield higher κ values than their nonstochastic counterparts. Using the stated probability model, assignments to categories were generated for 552 combinations of paradigms, rater and category parameters, category frequencies, and number of stimuli. Observed means and standard errors for Po, Pe, and κ were fully consistent with theory expectations. Guidelines for interpretation of rater accuracy and reliability are offered, along with a discussion of alternatives to the basic model.

Keywords:  interrater reliability; kappa; nominal scales; rater accuracy

Year:  2016        PMID: 29795943      PMCID: PMC5965649          DOI: 10.1177/0013164416663277

Source DB:  PubMed          Journal:  Educ Psychol Meas        ISSN: 0013-1644            Impact factor:   2.821


  4 in total

1.  Delta: a new measure of agreement between two raters.

Authors:  A Martín Andrés; P Femia Marzo
Journal:  Br J Math Stat Psychol       Date:  2004-05       Impact factor: 3.380

2.  Computing inter-rater reliability and its variance in the presence of high agreement.

Authors:  Kilem Li Gwet
Journal:  Br J Math Stat Psychol       Date:  2008-05       Impact factor: 3.380

3.  Interrater agreement statistics with skewed data: evaluation of alternatives to Cohen's kappa.

Authors:  Shu Xu; Michael F Lorber
Journal:  J Consult Clin Psychol       Date:  2014-08-04

4.  The measurement of observer agreement for categorical data.

Authors:  J R Landis; G G Koch
Journal:  Biometrics       Date:  1977-03       Impact factor: 2.571

  4 in total
  4 in total

1.  Kappa Coefficients for Missing Data.

Authors:  Alexandra De Raadt; Matthijs J Warrens; Roel J Bosker; Henk A L Kiers
Journal:  Educ Psychol Meas       Date:  2019-01-16       Impact factor: 2.821

2.  Intrathecal IgM Synthesis Is Associated with Spinal Cord Manifestation and Neuronal Injury in Early MS.

Authors:  Johanna Oechtering; Therese Lincke; Sabine Schaedelin; Bernhard F Décard; Aleksandra Maceski; Annette Orleth; Stephanie Meier; Eline Willemse; Arabella Buchmann; Michael Khalil; Tobias Derfuss; Pascal Benkert; Ingmar Heijnen; Axel Regeniter; Stefanie Müller; Lutz Achtnichts; Patrice Lalive; Anke Salmen; Caroline Pot; Claudio Gobbi; Ludwig Kappos; Cristina Granziera; David Leppert; Regina Schlaeger; Johanna M Lieb; Jens Kuhle
Journal:  Ann Neurol       Date:  2022-04-09       Impact factor: 11.274

3.  Methodological issues on evaluating agreement between two detection methods by Cohen's kappa analysis.

Authors:  Ming Li; Tianfei Yu
Journal:  Parasit Vectors       Date:  2022-07-29       Impact factor: 4.047

4.  Interrater reliability estimators tested against true interrater reliabilities.

Authors:  Xinshu Zhao; Guangchao Charles Feng; Song Harris Ao; Piper Liping Liu
Journal:  BMC Med Res Methodol       Date:  2022-08-29       Impact factor: 4.612

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.