Literature DB >> 34267400

Large-Sample Variance of Fleiss Generalized Kappa.

Kilem L Gwet1.   

Abstract

Cohen's kappa coefficient was originally proposed for two raters only, and it later extended to an arbitrarily large number of raters to become what is known as Fleiss' generalized kappa. Fleiss' generalized kappa and its large-sample variance are still widely used by researchers and were implemented in several software packages, including, among others, SPSS and the R package "rel." The purpose of this article is to show that the large-sample variance of Fleiss' generalized kappa is systematically being misused, is invalid as a precision measure for kappa, and cannot be used for constructing confidence intervals. A general-purpose variance expression is proposed, which can be used in any statistical inference procedure. A Monte-Carlo experiment is presented, showing the validity of the new variance estimation procedure.
© The Author(s) 2020.

Entities:  

Keywords:  Cohen kappa; Fleiss kappa; Gwet AC1; interrater reliability

Year:  2021        PMID: 34267400      PMCID: PMC8243202          DOI: 10.1177/0013164420973080

Source DB:  PubMed          Journal:  Educ Psychol Meas        ISSN: 0013-1644            Impact factor:   3.088


  5 in total

1.  A mixture model approach to indexing rater agreement.

Authors:  Christof Schuster
Journal:  Br J Math Stat Psychol       Date:  2002-11       Impact factor: 3.380

2.  Indexing systematic rater agreement with a latent-class model.

Authors:  Christof Schuster; David A Smith
Journal:  Psychol Methods       Date:  2002-09

3.  Computing inter-rater reliability and its variance in the presence of high agreement.

Authors:  Kilem Li Gwet
Journal:  Br J Math Stat Psychol       Date:  2008-05       Impact factor: 3.380

4.  High agreement but low kappa: II. Resolving the paradoxes.

Authors:  D V Cicchetti; A R Feinstein
Journal:  J Clin Epidemiol       Date:  1990       Impact factor: 6.437

5.  High agreement but low kappa: I. The problems of two paradoxes.

Authors:  A R Feinstein; D V Cicchetti
Journal:  J Clin Epidemiol       Date:  1990       Impact factor: 6.437

  5 in total
  1 in total

1.  Robustness of κ -type coefficients for clinical agreement.

Authors:  Amalia Vanacore; Maria Sole Pellegrino
Journal:  Stat Med       Date:  2022-02-06       Impact factor: 2.497

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.