Literature DB >> 29029991

Summary measures of agreement and association between many raters' ordinal classifications.

Aya A Mitani1, Phoebe E Freer2, Kerrie P Nelson3.   

Abstract

PURPOSE: Interpretation of screening tests such as mammograms usually require a radiologist's subjective visual assessment of images, often resulting in substantial discrepancies between radiologists' classifications of subjects' test results. In clinical screening studies to assess the strength of agreement between experts, multiple raters are often recruited to assess subjects' test results using an ordinal classification scale. However, using traditional measures of agreement in some studies is challenging because of the presence of many raters, the use of an ordinal classification scale, and unbalanced data.
METHODS: We assess and compare the performances of existing measures of agreement and association as well as a newly developed model-based measure of agreement to three large-scale clinical screening studies involving many raters' ordinal classifications. We also conduct a simulation study to demonstrate the key properties of the summary measures.
RESULTS: The assessment of agreement and association varied according to the choice of summary measure. Some measures were influenced by the underlying prevalence of disease and raters' marginal distributions and/or were limited in use to balanced data sets where every rater classifies every subject. Our simulation study indicated that popular measures of agreement and association are prone to underlying disease prevalence.
CONCLUSIONS: Model-based measures provide a flexible approach for calculating agreement and association and are robust to missing and unbalanced data as well as the underlying disease prevalence.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Agreement; Association; Cohen's kappa; Ordinal classification; Weighted kappa

Mesh:

Year:  2017        PMID: 29029991      PMCID: PMC5687310          DOI: 10.1016/j.annepidem.2017.09.001

Source DB:  PubMed          Journal:  Ann Epidemiol        ISSN: 1047-2797            Impact factor:   3.797


  22 in total

1.  Interval estimation for Cohen's kappa as a measure of agreement.

Authors:  N J Blackman; J J Koval
Journal:  Stat Med       Date:  2000-03-15       Impact factor: 2.373

2.  Categorizing breast mammographic density: intra- and interobserver reproducibility of BI-RADS density categories.

Authors:  S Ciatto; N Houssami; A Apruzzese; E Bassetti; B Brancato; F Carozzi; S Catarzi; M P Lamberini; G Marcelli; R Pellizzoni; B Pesce; G Risso; F Russo; A Scorsolini
Journal:  Breast       Date:  2005-08       Impact factor: 4.380

3.  The exact variance of weighted kappa with multiple raters.

Authors:  Paul W Mielke; Kenneth J Berry; Janis E Johnston
Journal:  Psychol Rep       Date:  2007-10

4.  Assessing the influence of rater and subject characteristics on measures of agreement for ordinal ratings.

Authors:  Kerrie P Nelson; Aya A Mitani; Don Edwards
Journal:  Stat Med       Date:  2017-06-13       Impact factor: 2.373

5.  Bayesian random effects for interrater and test-retest reliability with nested clinical observations.

Authors:  Chuhsing K Hsiao; Pei-Chun Chen; Wen-Hsin Kao
Journal:  J Clin Epidemiol       Date:  2011-02-02       Impact factor: 6.437

6.  An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers.

Authors:  J R Landis; G G Koch
Journal:  Biometrics       Date:  1977-06       Impact factor: 2.571

7.  A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.

Authors:  Terry K Koo; Mae Y Li
Journal:  J Chiropr Med       Date:  2016-03-31

8.  Reliability testing of the dermatology index of disease severity (DIDS). An index for staging the severity of cutaneous inflammatory disease.

Authors:  H B Faust; R Gonin; T Y Chuang; C W Lewis; C A Melfi; E R Farmer
Journal:  Arch Dermatol       Date:  1997-11

9.  Misclassification of Breast Imaging Reporting and Data System (BI-RADS) Mammographic Density and Implications for Breast Density Reporting Legislation.

Authors:  Charlotte C Gard; Erin J Aiello Bowles; Diana L Miglioretti; Stephen H Taplin; Carolyn M Rutter
Journal:  Breast J       Date:  2015-07-01       Impact factor: 2.431

10.  Radiologist agreement for mammographic recall by case difficulty and finding type.

Authors:  Tracy Onega; Megan Smith; Diana L Miglioretti; Patricia A Carney; Berta A Geller; Karla Kerlikowske; Diana S M Buist; Robert D Rosenberg; Robert A Smith; Edward A Sickles; Sebastien Haneuse; Melissa L Anderson; Bonnie Yankaskas
Journal:  J Am Coll Radiol       Date:  2012-11       Impact factor: 5.532

View more
  7 in total

1.  Methods of assessing categorical agreement between correlated screening tests in clinical studies.

Authors:  Thomas J Zhou; Sughra Raza; Kerrie P Nelson
Journal:  J Appl Stat       Date:  2020-06-09       Impact factor: 1.404

2.  Detection of glioma infiltration at the tumor margin using quantitative stimulated Raman scattering histology.

Authors:  Melike Pekmezci; Ramin A Morshed; Pranathi Chunduru; Balaji Pandian; Jacob Young; Javier E Villanueva-Meyer; Tarik Tihan; Emily A Sloan; Manish K Aghi; Annette M Molinaro; Mitchel S Berger; Shawn L Hervey-Jumper
Journal:  Sci Rep       Date:  2021-06-09       Impact factor: 4.379

3.  Persistent inter-observer variability of breast density assessment using BI-RADS® 5th edition guidelines.

Authors:  Leah H Portnow; Dianne Georgian-Smith; Irfanullah Haider; Mirelys Barrios; Camden P Bay; Kerrie P Nelson; Sughra Raza
Journal:  Clin Imaging       Date:  2021-12-10       Impact factor: 1.605

4.  Test-retest Reliability and Construct Validity of the Satisfaction with Treatment Result Questionnaire in Patients with Hand and Wrist Conditions: A Prospective Study.

Authors:  Willemijn A De Ridder; Yara E van Kooij; Guus M Vermeulen; Harm P Slijper; Ruud W Selles; Robbert M Wouters
Journal:  Clin Orthop Relat Res       Date:  2021-09-01       Impact factor: 4.755

5.  Variability in grading of ductal carcinoma in situ among an international group of pathologists.

Authors:  Esther H Lips; Jelle Wesseling; Maartje van Seijen; Katarzyna Jóźwiak; Sarah E Pinder; Allison Hall; Savitri Krishnamurthy; Jeremy Sj Thomas; Laura C Collins; Jonathan Bijron; Joost Bart; Danielle Cohen; Wen Ng; Ihssane Bouybayoune; Hilary Stobart; Jan Hudecek; Michael Schaapveld; Alastair Thompson
Journal:  J Pathol Clin Res       Date:  2021-02-23

6.  Wheat Spike Blast Image Classification Using Deep Convolutional Neural Networks.

Authors:  Mariela Fernández-Campos; Yu-Ting Huang; Mohammad R Jahanshahi; Tao Wang; Jian Jin; Darcy E P Telenko; Carlos Góngora-Canul; C D Cruz
Journal:  Front Plant Sci       Date:  2021-06-17       Impact factor: 5.753

7.  Prognostic factors of adjuvant chemotherapy discontinuation among stage III colon cancer patients: A survey of medical oncologists and a systematic review and meta-analysis.

Authors:  Devon J Boyne; Dylan E O'Sullivan; Emily V Heer; Robert J Hilsden; Tolulope T Sajobi; Winson Y Cheung; Darren R Brenner; Christine M Friedenreich
Journal:  Cancer Med       Date:  2020-01-21       Impact factor: 4.452

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.