Literature DB >> 25168681

Observer agreement paradoxes in 2x2 tables: comparison of agreement measures.

Viswanathan Shankar1, Shrikant I Bangdiwala.   

Abstract

BACKGROUND: Various measures of observer agreement have been proposed for 2 x 2 tables. We examine the behavior of alternative measures of observer agreement for 2 x 2 tables.
METHODS: The alternative measures of observer agreement and the corresponding agreement chart were calculated under various scenarios of marginal distributions (symmetrical or not, balanced or not) and of degree of diagonal agreement, and their behaviors are compared. Specifically, two specific paradoxes previously identified for kappa were examined: (1) low kappa values despite high observed agreement under highly symmetrically imbalanced marginals, and (2) higher kappa values for asymmetrical imbalanced marginal distributions.
RESULTS: Kappa and alpha behave similarly and are affected by the marginal distributions more so than the B-statistic, AC1-index and delta measures. Delta and kappa provide values that are similar when the marginal totals are asymmetrically imbalanced or symmetrical but not excessively imbalanced. The AC1-index and B-statistics provide closer results when the marginal distributions are symmetrically imbalanced and the observed agreement is greater than 50%. Also, the B-statistic and the AC1-index provide values closer to the observed agreement when the subjects are classified mostly in one of the diagonal cells. Finally, the B-statistic is seen to be consistent and more stable than kappa under both types of paradoxes studied.
CONCLUSIONS: The B-statistic behaved better under all scenarios studied as well as with varying prevalences, sensitivities and specificities than the other measures, we recommend using B-statistic along with its corresponding agreement chart as an alternative to kappa when assessing agreement in 2 x 2 tables.

Entities:  

Mesh:

Year:  2014        PMID: 25168681      PMCID: PMC4236536          DOI: 10.1186/1471-2288-14-100

Source DB:  PubMed          Journal:  BMC Med Res Methodol        ISSN: 1471-2288            Impact factor:   4.615


  13 in total

Review 1.  Statistical description of interrater variability in ordinal ratings.

Authors:  J C Nelson; M S Pepe
Journal:  Stat Methods Med Res       Date:  2000-10       Impact factor: 3.021

Review 2.  Kappa coefficients in medical research.

Authors:  Helena Chmura Kraemer; Vyjeyanthi S Periyakoil; Art Noda
Journal:  Stat Med       Date:  2002-07-30       Impact factor: 2.373

3.  High agreement but low kappa: II. Resolving the paradoxes.

Authors:  D V Cicchetti; A R Feinstein
Journal:  J Clin Epidemiol       Date:  1990       Impact factor: 6.437

4.  The meaning of kappa: probabilistic concepts of reliability and validity revisited.

Authors:  I Guggenmoos-Holzmann
Journal:  J Clin Epidemiol       Date:  1996-07       Impact factor: 6.437

5.  Behavior and interpretation of the kappa statistic: resolution of the two paradoxes.

Authors:  C A Lantz; E Nebenzahl
Journal:  J Clin Epidemiol       Date:  1996-04       Impact factor: 6.437

6.  High agreement but low kappa: I. The problems of two paradoxes.

Authors:  A R Feinstein; D V Cicchetti
Journal:  J Clin Epidemiol       Date:  1990       Impact factor: 6.437

7.  Maximum likelihood estimation of agreement in the constant predictive probability model, and its relation to Cohen's kappa.

Authors:  M Aickin
Journal:  Biometrics       Date:  1990-06       Impact factor: 2.571

8.  Bias, prevalence and kappa.

Authors:  T Byrt; J Bishop; J B Carlin
Journal:  J Clin Epidemiol       Date:  1993-05       Impact factor: 6.437

9.  How reliable are chance-corrected measures of agreement?

Authors:  I Guggenmoos-Holzmann
Journal:  Stat Med       Date:  1993-12-15       Impact factor: 2.373

10.  The agreement chart.

Authors:  Shrikant I Bangdiwala; Viswanathan Shankar
Journal:  BMC Med Res Methodol       Date:  2013-07-29       Impact factor: 4.615

View more
  18 in total

1.  Interobserver Reproducibility of the PI-RADS Version 2 Lexicon: A Multicenter Study of Six Experienced Prostate Radiologists.

Authors:  Andrew B Rosenkrantz; Luke A Ginocchio; Daniel Cornfeld; Adam T Froemming; Rajan T Gupta; Baris Turkbey; Antonio C Westphalen; James S Babb; Daniel J Margolis
Journal:  Radiology       Date:  2016-04-01       Impact factor: 11.105

2.  Validity and Reliability of the Thai Version of the 19-Item Compliance-Questionnaire-Rheumatology.

Authors:  Saranya Panichaporn; Wanwisa Chanapai; Ananya Srisomnuek; Phakhamon Thaweeratthakul; Wanruchada Katchamart
Journal:  Patient Prefer Adherence       Date:  2022-08-17       Impact factor: 2.314

3.  Highly specific reasons for nonadherence to antiretroviral therapy: results from the German adherence study.

Authors:  Johanna Boretzki; Eva Wolf; Carmen Wiese; Sebastian Noe; Annamaria Balogh; Anja Meurer; Ivanka Krznaric; Alexander Zink; Christian Lersch; Christoph D Spinner
Journal:  Patient Prefer Adherence       Date:  2017-11-08       Impact factor: 2.711

4.  Measuring psychopathology as it unfolds in daily life: addressing key assumptions of intensive longitudinal methods in the TRAILS TRANS-ID study.

Authors:  Marieke J Schreuder; Robin N Groen; Johanna T W Wigman; Catharina A Hartman; Marieke Wichers
Journal:  BMC Psychiatry       Date:  2020-07-06       Impact factor: 3.630

5.  Inter-rater reliability of the radiographic assessment of simple bone cysts.

Authors:  S Cho; R Yankanah; P Babyn; J Stimec; A S Doria; D Stephens; J G Wright
Journal:  J Child Orthop       Date:  2019-04-01       Impact factor: 1.548

6.  More than interobserver agreement is required for comparisons of categorization systems.

Authors:  Gloria Palazuelos; Sergio Alfonso Valencia; Javier Andres Romero
Journal:  Ultrasonography       Date:  2019-05-15

7.  Shifting perceptions of female genital cutting in a Swedish migration context.

Authors:  Anna Wahlberg; Sara Johnsdotter; Katarina Ekholm Selling; Birgitta Essén
Journal:  PLoS One       Date:  2019-12-04       Impact factor: 3.240

8.  Evaluation of Inter-Observer Reliability of Animal Welfare Indicators: Which Is the Best Index to Use?

Authors:  Mauro Giammarino; Silvana Mattiello; Monica Battini; Piero Quatto; Luca Maria Battaglini; Ana C L Vieira; George Stilwell; Manuela Renna
Journal:  Animals (Basel)       Date:  2021-05-18       Impact factor: 2.752

9.  Pilot study to test the feasibility of a trial design and complex intervention on PRIoritising MUltimedication in Multimorbidity in general practices (PRIMUMpilot).

Authors:  Christiane Muth; Sebastian Harder; Lorenz Uhlmann; Justine Rochon; Birgit Fullerton; Corina Güthlin; Antje Erler; Martin Beyer; Marjan van den Akker; Rafael Perera; André Knottnerus; Jose M Valderas; Ferdinand M Gerlach; Walter E Haefeli
Journal:  BMJ Open       Date:  2016-07-25       Impact factor: 2.692

10.  Protocol paper for the Movimente school-based program: A cluster-randomized controlled trial targeting physical activity and sedentary behavior among Brazilian adolescents.

Authors:  Kelly Samara Silva; Jaqueline Aragoni da Silva; Valter Cordeiro Barbosa Filho; Priscila Cristina Dos Santos; Pablo Magno da Silveira; Marcus V V Lopes; Jo Salmon
Journal:  Medicine (Baltimore)       Date:  2020-07-31       Impact factor: 1.817

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.