Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Measuring rater bias in diagnostic tests with ordinal ratings.

Literature DB >> 33969509

Measuring rater bias in diagnostic tests with ordinal ratings.

Chanmin Kim¹, Xiaoyan Lin², Kerrie P Nelson³.

Abstract

Diagnostic tests are frequently reliant upon the interpretation of images by skilled raters. In many clinical settings, however, the variability observed between experts' ratings plays a detrimental role in the degree of confidence in these interpretations, leading to uncertainty in the diagnostic process. For example, in breast cancer testing, radiologists interpret mammographic images, while breast biopsy results are examined by pathologists. Each of these procedures involves elements of subjectivity. We propose here a flexible two-stage Bayesian latent variable model to investigate how the skills of individual raters impact the diagnostic accuracy of image-related testing in large-scale medical testing studies. A strength of the proposed model is that the true disease status of a patient within a reasonable time frame may or may not be known. In these studies, many raters each contribute classifications on a large sample of patients using a defined ordinal grading scale, leading to a complex correlation structure between ratings. Our modeling approach considers the different sources of variability contributed by experts and patients while accounting for correlations present between ratings and patients, in contrast to currently available methods. We propose a novel measure of a rater's ability (magnifier) that, in contrast to conventional measures of sensitivity and specificity, is robust to the underlying prevalence of disease in the population, providing an alternative measure of diagnostic accuracy across patient populations. Extensive simulation studies demonstrate lower bias in estimation of parameters and measures of accuracy, and illustrate outperformance of the proposed model when compared with existing models. Receiver operator characteristic curves are derived to assess the diagnostic accuracy of individual experts and their overall performance. Our proposed modeling approach is applied to a large breast imaging study for known disease status and a uterine cancer dataset for unknown disease status.

Entities: Chemical

Keywords: Bayesian latent variable model; ROC curve; breast imaging; diagnostic test; ordinal ratings; variability

Mesh：

Year: 2021 PMID： 33969509 PMCID： PMC8277718 DOI： 10.1002/sim.9011

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.497

Keyword Cloud
References

21 in total

1. Assessing physicians' accuracy in diagnosing paediatric patients with acute abdominal pain: measuring accuracy for multiple diseases.

Authors: N A Obuchowski; M J Goske; K E Applegate
Journal: Stat Med Date: 2001-11-15 Impact factor: 2.373

2. Receiver operating characteristic rating analysis. Generalization to the population of readers and patients with the jackknife method.

Authors: D D Dorfman; K S Berbaum; C E Metz
Journal: Invest Radiol Date: 1992-09 Impact factor: 6.016

Measuring rater bias in diagnostic tests with ordinal ratings.

1. Assessing physicians' accuracy in diagnosing paediatric patients with acute abdominal pain: measuring accuracy for multiple diseases.

2. Receiver operating characteristic rating analysis. Generalization to the population of readers and patients with the jackknife method.

3. Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

4. Random effects modeling approaches for estimating ROC curves from repeated ordinal tests without a gold standard.

5. Weighted kappa for multiple raters.

6. Random effects models for assessing diagnostic accuracy of traditional Chinese doctors in absence of a gold standard.

7. Random effects models in latent class analysis for evaluating accuracy of diagnostic tests.

Review 8. Estimating diagnostic accuracy without a gold standard: A continued controversy.

9. The meaning and use of the area under a receiver operating characteristic (ROC) curve.

10. Measures of Diagnostic Accuracy: Basic Definitions.