Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 An empirical comparison of discrete ratings and subjective probability ratings.

Literature DB >> 12139089

An empirical comparison of discrete ratings and subjective probability ratings.

Kevin S Berbaum¹, Donald D Dorfman, E A Franken, Robert T Caldwell.

Abstract

RATIONALE AND
OBJECTIVES: The authors compared receiver operating characteristic (ROC) data from a five-category discrete scale with that from a 101-category subjective probability scale to determine how well the latter categories define the ROC curve.
MATERIALS AND METHODS: The authors analyzed data from a pilot study performed for another purpose in which 10 radiologists provided both a five-point confidence rating and a subjective probability rating of abnormality for each interpretation. ROC operating points were plotted for a five-category scale and a 101-category scale to determine how well the observed points covered the range of false-positive probabilities. ROC curves were fitted to the subjective probability data according to the standard ROC model.
RESULTS: For these data, subjective probability ratings were somewhat more effective in populating the range of false-positive probability with ROC points. For three observers, the ROC curves inappropriately crossed the chance line. For another four, prevention of such crossing seemed to depend on one or two ROC points near the upper right corner of the ROC space, points based on discriminations within the discrete category "no abnormality to report."
CONCLUSION: Subjective probability rating should provide substantially better coverage of the ROC space with operating points, preventing inappropriate crossing of the chance line. Unfortunately, the protection offered by subjective probability ratings was unreliable and depended on ROC points derived from discriminations not directly related to apparent abnormality. The use of proper ROC models to fit data may offer a better solution.

Mesh：

Year: 2002 PMID： 12139089 DOI： 10.1016/s1076-6332(03)80344-6

Source DB: PubMed Journal: Acad Radiol ISSN： 1076-6332 Impact factor: 3.173

Keyword Cloud
Cited

8 in total

1. Evaluating imaging and computer-aided detection and diagnosis devices at the FDA.

Authors: Brandon D Gallas; Heang-Ping Chan; Carl J D'Orsi; Lori E Dodd; Maryellen L Giger; David Gur; Elizabeth A Krupinski; Charles E Metz; Kyle J Myers; Nancy A Obuchowski; Berkman Sahiner; Alicia Y Toledano; Margarita L Zuley
Journal: Acad Radiol Date: 2012-02-03 Impact factor: 3.173

2. Quasi-continuous and discrete confidence rating scales for observer performance studies: Effects on ROC analysis.

Authors: Lubomir Hadjiiski; Heang-Ping Chan; Berkman Sahiner; Mark A Helvie; Marilyn A Roubidoux
Journal: Acad Radiol Date: 2007-01 Impact factor: 3.173

3. Selection of a rating scale in receiver operating characteristic studies: some remaining issues.

Authors: Howard E Rockette; David Gur
Journal: Acad Radiol Date: 2008-02 Impact factor: 3.173

4. Is an ROC-type response truly always better than a binary response in observer performance studies?

Authors: David Gur; Andriy I Bandos; Howard E Rockette; Margarita L Zuley; Christiane M Hakim; Denise M Chough; Marie A Ganott; Jules H Sumkin
Journal: Acad Radiol Date: 2010-03-16 Impact factor: 3.173

Review 5. New developments in observer performance methodology in medical imaging.

Authors: Dev P Chakraborty
Journal: Semin Nucl Med Date: 2011-11 Impact factor: 4.446

6. Satisfaction of search from detection of pulmonary nodules in computed tomography of the chest.

Authors: Kevin S Berbaum; Kevin M Schartz; Robert T Caldwell; Mark T Madsen; Brad H Thompson; Brian F Mullan; Andrew N Ellingson; Edmund A Franken
Journal: Acad Radiol Date: 2012-10-26 Impact factor: 3.173

7. Binary and multi-category ratings in a laboratory observer performance study: a comparison.

Authors: David Gur; Andriy I Bandos; Jill L King; Amy H Klym; Cathy S Cohen; Christiane M Hakim; Lara A Hardesty; Marie A Ganott; Ronald L Perrin; William R Poller; Ratan Shah; Jules H Sumkin; Luisa P Wallace; Howard E Rockette
Journal: Med Phys Date: 2008-10 Impact factor: 4.071

8. Agreement of the order of overall performance levels under different reading paradigms.

Authors: David Gur; Andriy I Bandos; Amy H Klym; Cathy S Cohen; Christiane M Hakim; Lara A Hardesty; Marie A Ganott; Ronald L Perrin; William R Poller; Ratan Shah; Jules H Sumkin; Luisa P Wallace; Howard E Rockette
Journal: Acad Radiol Date: 2008-12 Impact factor: 3.173

8 in total