Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Summary measures of agreement and association between many raters' ordinal classifications.

Literature DB >> 29029991

Summary measures of agreement and association between many raters' ordinal classifications.

Aya A Mitani¹, Phoebe E Freer², Kerrie P Nelson³.

Abstract

PURPOSE: Interpretation of screening tests such as mammograms usually require a radiologist's subjective visual assessment of images, often resulting in substantial discrepancies between radiologists' classifications of subjects' test results. In clinical screening studies to assess the strength of agreement between experts, multiple raters are often recruited to assess subjects' test results using an ordinal classification scale. However, using traditional measures of agreement in some studies is challenging because of the presence of many raters, the use of an ordinal classification scale, and unbalanced data.
METHODS: We assess and compare the performances of existing measures of agreement and association as well as a newly developed model-based measure of agreement to three large-scale clinical screening studies involving many raters' ordinal classifications. We also conduct a simulation study to demonstrate the key properties of the summary measures.
RESULTS: The assessment of agreement and association varied according to the choice of summary measure. Some measures were influenced by the underlying prevalence of disease and raters' marginal distributions and/or were limited in use to balanced data sets where every rater classifies every subject. Our simulation study indicated that popular measures of agreement and association are prone to underlying disease prevalence.
CONCLUSIONS: Model-based measures provide a flexible approach for calculating agreement and association and are robust to missing and unbalanced data as well as the underlying disease prevalence.

Entities: Chemical Disease Gene Species

Keywords: Agreement; Association; Cohen's kappa; Ordinal classification; Weighted kappa

Mesh：

Year: 2017 PMID： 29029991 PMCID： PMC5687310 DOI： 10.1016/j.annepidem.2017.09.001

Source DB: PubMed Journal: Ann Epidemiol ISSN： 1047-2797 Impact factor: 3.797

22 in total

1. Interval estimation for Cohen's kappa as a measure of agreement.

Authors: N J Blackman; J J Koval
Journal: Stat Med Date: 2000-03-15 Impact factor: 2.373

2. Categorizing breast mammographic density: intra- and interobserver reproducibility of BI-RADS density categories.

Authors: S Ciatto; N Houssami; A Apruzzese; E Bassetti; B Brancato; F Carozzi; S Catarzi; M P Lamberini; G Marcelli; R Pellizzoni; B Pesce; G Risso; F Russo; A Scorsolini
Journal: Breast Date: 2005-08 Impact factor: 4.380

3. The exact variance of weighted kappa with multiple raters.

Authors: Paul W Mielke; Kenneth J Berry; Janis E Johnston
Journal: Psychol Rep Date: 2007-10

4. Assessing the influence of rater and subject characteristics on measures of agreement for ordinal ratings.

Authors: Kerrie P Nelson; Aya A Mitani; Don Edwards
Journal: Stat Med Date: 2017-06-13 Impact factor: 2.373

5. Bayesian random effects for interrater and test-retest reliability with nested clinical observations.

Authors: Chuhsing K Hsiao; Pei-Chun Chen; Wen-Hsin Kao
Journal: J Clin Epidemiol Date: 2011-02-02 Impact factor: 6.437

6. An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers.

Authors: J R Landis; G G Koch
Journal: Biometrics Date: 1977-06 Impact factor: 2.571

7. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.

Authors: Terry K Koo; Mae Y Li
Journal: J Chiropr Med Date: 2016-03-31

8. Reliability testing of the dermatology index of disease severity (DIDS). An index for staging the severity of cutaneous inflammatory disease.

Authors: H B Faust; R Gonin; T Y Chuang; C W Lewis; C A Melfi; E R Farmer
Journal: Arch Dermatol Date: 1997-11

9. Misclassification of Breast Imaging Reporting and Data System (BI-RADS) Mammographic Density and Implications for Breast Density Reporting Legislation.

Authors: Charlotte C Gard; Erin J Aiello Bowles; Diana L Miglioretti; Stephen H Taplin; Carolyn M Rutter
Journal: Breast J Date: 2015-07-01 Impact factor: 2.431

10. Radiologist agreement for mammographic recall by case difficulty and finding type.

Authors: Tracy Onega; Megan Smith; Diana L Miglioretti; Patricia A Carney; Berta A Geller; Karla Kerlikowske; Diana S M Buist; Robert D Rosenberg; Robert A Smith; Edward A Sickles; Sebastien Haneuse; Melissa L Anderson; Bonnie Yankaskas
Journal: J Am Coll Radiol Date: 2012-11 Impact factor: 5.532

7 in total

1. Methods of assessing categorical agreement between correlated screening tests in clinical studies.

Authors: Thomas J Zhou; Sughra Raza; Kerrie P Nelson
Journal: J Appl Stat Date: 2020-06-09 Impact factor: 1.404

2. Detection of glioma infiltration at the tumor margin using quantitative stimulated Raman scattering histology.

Authors: Melike Pekmezci; Ramin A Morshed; Pranathi Chunduru; Balaji Pandian; Jacob Young; Javier E Villanueva-Meyer; Tarik Tihan; Emily A Sloan; Manish K Aghi; Annette M Molinaro; Mitchel S Berger; Shawn L Hervey-Jumper
Journal: Sci Rep Date: 2021-06-09 Impact factor: 4.379

3. Persistent inter-observer variability of breast density assessment using BI-RADS® 5th edition guidelines.

Authors: Leah H Portnow; Dianne Georgian-Smith; Irfanullah Haider; Mirelys Barrios; Camden P Bay; Kerrie P Nelson; Sughra Raza
Journal: Clin Imaging Date: 2021-12-10 Impact factor: 1.605

4. Test-retest Reliability and Construct Validity of the Satisfaction with Treatment Result Questionnaire in Patients with Hand and Wrist Conditions: A Prospective Study.

Authors: Willemijn A De Ridder; Yara E van Kooij; Guus M Vermeulen; Harm P Slijper; Ruud W Selles; Robbert M Wouters
Journal: Clin Orthop Relat Res Date: 2021-09-01 Impact factor: 4.755

5. Variability in grading of ductal carcinoma in situ among an international group of pathologists.

Authors: Esther H Lips; Jelle Wesseling; Maartje van Seijen; Katarzyna Jóźwiak; Sarah E Pinder; Allison Hall; Savitri Krishnamurthy; Jeremy Sj Thomas; Laura C Collins; Jonathan Bijron; Joost Bart; Danielle Cohen; Wen Ng; Ihssane Bouybayoune; Hilary Stobart; Jan Hudecek; Michael Schaapveld; Alastair Thompson
Journal: J Pathol Clin Res Date: 2021-02-23

6. Wheat Spike Blast Image Classification Using Deep Convolutional Neural Networks.

Authors: Mariela Fernández-Campos; Yu-Ting Huang; Mohammad R Jahanshahi; Tao Wang; Jian Jin; Darcy E P Telenko; Carlos Góngora-Canul; C D Cruz
Journal: Front Plant Sci Date: 2021-06-17 Impact factor: 5.753

7. Prognostic factors of adjuvant chemotherapy discontinuation among stage III colon cancer patients: A survey of medical oncologists and a systematic review and meta-analysis.

Authors: Devon J Boyne; Dylan E O'Sullivan; Emily V Heer; Robert J Hilsden; Tolulope T Sajobi; Winson Y Cheung; Darren R Brenner; Christine M Friedenreich
Journal: Cancer Med Date: 2020-01-21 Impact factor: 4.452

7 in total