Literature DB >> 29881092

An Evaluation of Interrater Reliability Measures on Binary Tasks Using d-Prime.

Malcolm J Grant1, Cathryn M Button1, Brent Snook1.   

Abstract

Many indices of interrater agreement on binary tasks have been proposed to assess reliability, but none has escaped criticism. In a series of Monte Carlo simulations, five such indices were evaluated using d-prime, an unbiased indicator of raters' ability to distinguish between the true presence or absence of the characteristic being judged. Phi and, to a lesser extent, Kappa coefficients performed best across variations in characteristic prevalence, and raters' expertise and bias. Correlations with d-prime for Percentage Agreement, Scott's Pi, and Gwet's AC1 were markedly lower. In situations where two raters make a series of binary judgments, the findings suggest that researchers should choose Phi or Kappa to assess interrater agreement as the superiority of these indices was least influenced by variations in the decision environment and characteristics of the decision makers.

Entities:  

Keywords:  Kappa; Percentage Agreement; Phi correlation; interrater agreement; reliability; research methods

Year:  2016        PMID: 29881092      PMCID: PMC5978587          DOI: 10.1177/0146621616684584

Source DB:  PubMed          Journal:  Appl Psychol Meas        ISSN: 0146-6216


  10 in total

1.  Calculation of signal detection theory measures.

Authors:  H Stanislaw; N Todorov
Journal:  Behav Res Methods Instrum Comput       Date:  1999-02

2.  Measuring agreement between two judges on the presence or absence of a trait.

Authors:  J L Fleiss
Journal:  Biometrics       Date:  1975-09       Impact factor: 2.571

Review 3.  The use of "overall accuracy" to evaluate the validity of screening or diagnostic tests.

Authors:  Anthony J Alberg; Ji Wan Park; Brant W Hager; Malcolm V Brock; Marie Diener-West
Journal:  J Gen Intern Med       Date:  2004-05       Impact factor: 5.128

4.  Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit.

Authors:  J Cohen
Journal:  Psychol Bull       Date:  1968-10       Impact factor: 17.737

5.  High agreement but low kappa: II. Resolving the paradoxes.

Authors:  D V Cicchetti; A R Feinstein
Journal:  J Clin Epidemiol       Date:  1990       Impact factor: 6.437

6.  A Ratio Test of Interrater Agreement With High Specificity.

Authors:  Denis Cousineau; Louis Laurencelle
Journal:  Educ Psychol Meas       Date:  2015-03-25       Impact factor: 2.821

7.  High agreement but low kappa: I. The problems of two paradoxes.

Authors:  A R Feinstein; D V Cicchetti
Journal:  J Clin Epidemiol       Date:  1990       Impact factor: 6.437

8.  Interrater agreement statistics with skewed data: evaluation of alternatives to Cohen's kappa.

Authors:  Shu Xu; Michael F Lorber
Journal:  J Consult Clin Psychol       Date:  2014-08-04

9.  Percent agreement, Pearson's correlation, and kappa as measures of inter-examiner reliability.

Authors:  R J Hunt
Journal:  J Dent Res       Date:  1986-02       Impact factor: 6.116

10.  Detection theory analysis of group data: estimating sensitivity from average hit and false-alarm rates.

Authors:  N A Macmillan; H L Kaplan
Journal:  Psychol Bull       Date:  1985-07       Impact factor: 17.737

  10 in total
  4 in total

1.  Learning to teach: A novel method for assessing surgical trainees' teaching and operative knowledge.

Authors:  Leah Furman; Eliza Beth Littleton; Christof Kaltenmeier; Giselle G Hamad
Journal:  Am J Surg       Date:  2020-11-05       Impact factor: 2.565

2.  In Search of the Common Elements of Clinical Supervision: A Systematic Review.

Authors:  Mimi Choy-Brown; Daniel Baslock; Charissa Cable; Scott Marsalis; Nathaniel J Williams
Journal:  Adm Policy Ment Health       Date:  2022-02-07

3.  Development and Validation of a Checklist to Assess Proficient Performance of Basketball Straight Speed Dribbling Skill.

Authors:  Fernando Garbeloto Dos Santos; Matheus Maia Pacheco; Luciano Basso; Flavio Henrique Bastos; Go Tani
Journal:  J Hum Kinet       Date:  2020-01-31       Impact factor: 2.193

Review 4.  Technological State of the Art of Electronic Mental Health Interventions for Major Depressive Disorder: Systematic Literature Review.

Authors:  Franziska Burger; Mark A Neerincx; Willem-Paul Brinkman
Journal:  J Med Internet Res       Date:  2020-01-20       Impact factor: 5.428

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.