Literature DB >> 29280179

Multiple-rater kappas for binary data: Models and interpretation.

Dietrich Stoyan1, Arne Pommerening2, Manuela Hummel3, Annette Kopp-Schneider3.   

Abstract

Interrater agreement on binary measurements with more than two raters is often assessed using Fleiss' κ, which is known to be difficult to interpret. In situations where the same raters rate all items, however, the far less known κ suggested by Conger, Hubert, and Schouten is more appropriate. We try to support the interpretation of these characteristics by investigating various models or scenarios of rating. Our analysis, which is restricted to binary data, shows that conclusions concerning interrater agreement by κ heavily depend on the population of items or subjects considered, even if the raters have identical behavior. The standard scale proposed by Landis and Koch, which verbally interprets numerical values of κ, appears to be rather subjective. On the basis of one of the models for rater behavior, we suggest an alternative verbal interpretation for kappa. Finally, we reconsider a classical example from pathology to illustrate the application of our methods and models. We also look for subgroups of raters with similar rating behavior using hierarchical clustering.
© 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  Conger-Hubert-Schouten kappa; Fleiss’ kappa; binary ratings; carcinoma data; modeling rater behavior

Mesh:

Year:  2017        PMID: 29280179     DOI: 10.1002/bimj.201600267

Source DB:  PubMed          Journal:  Biom J        ISSN: 0323-3847            Impact factor:   2.207


  3 in total

1.  In Search of the Common Elements of Clinical Supervision: A Systematic Review.

Authors:  Mimi Choy-Brown; Daniel Baslock; Charissa Cable; Scott Marsalis; Nathaniel J Williams
Journal:  Adm Policy Ment Health       Date:  2022-02-07

2.  Toward automated assessment of mole similarity on dermoscopic images.

Authors:  Yao Zhang; Kamil Ali; Jacob A George; Jason S Reichenberg; Matthew C Fox; Adewole S Adamson; James W Tunnell; Mia K Markey
Journal:  J Med Imaging (Bellingham)       Date:  2021-02-10

3.  Rating experiments in forestry: How much agreement is there in tree marking?

Authors:  Arne Pommerening; Carlos Pallarés Ramos; Wojciech Kędziora; Jens Haufe; Dietrich Stoyan
Journal:  PLoS One       Date:  2018-03-22       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.