Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Binary and multi-category ratings in a laboratory observer performance study: a comparison.

Literature DB >> 18975686

Binary and multi-category ratings in a laboratory observer performance study: a comparison.

David Gur¹, Andriy I Bandos, Jill L King, Amy H Klym, Cathy S Cohen, Christiane M Hakim, Lara A Hardesty, Marie A Ganott, Ronald L Perrin, William R Poller, Ratan Shah, Jules H Sumkin, Luisa P Wallace, Howard E Rockette.

Abstract

The authors investigated radiologists, performances during retrospective interpretation of screening mammograms when using a binary decision whether to recall a woman for additional procedures or not and compared it with their receiver operating characteristic (ROC) type performance curves using a semi-continuous rating scale. Under an Institutional Review Board approved protocol nine experienced radiologists independently rated an enriched set of 155 examinations that they had not personally read in the clinic, mixed with other enriched sets of examinations that they had individually read in the clinic, using both a screening BI-RADS rating scale (recall/not recall) and a semi-continuous ROC type rating scale (0 to 100). The vertical distance, namely the difference in sensitivity levels at the same specificity levels, between the empirical ROC curve and the binary operating point were computed for each reader. The vertical distance averaged over all readers was used to assess the proximity of the performance levels under the binary and ROC-type rating scale. There does not appear to be any systematic tendency of the readers towards a better performance when using either of the two rating approaches, namely four readers performed better using the semi-continuous rating scale, four readers performed better with the binary scale, and one reader had the point exactly on the empirical ROC curve. Only one of the nine readers had a binary "operating point" that was statistically distant from the same reader's empirical ROC curve. Reader-specific differences ranged from -0.046 to 0.128 with an average width of the corresponding 95% confidence intervals of 0.2 and p-values ranging for individual readers from 0.050 to 0.966. On average, radiologists performed similarly when using the two rating scales in that the average distance between the run in individual reader's binary operating point and their ROC curve was close to zero. The 95% confidence interval for the fixed-reader average (0.016) was (-0.0206, 0.0631) (two-sided p-value 0.35). In conclusion the authors found that in retrospective observer performance studies the use of a binary response or a semi-continuous rating scale led to consistent results in terms of performance as measured by sensitivity-specificity operating points.

Entities: Disease Gene Species

Mesh：

Year: 2008 PMID： 18975686 PMCID： PMC2627510 DOI： 10.1118/1.2977766

Source DB: PubMed Journal: Med Phys ISSN： 0094-2405 Impact factor: 4.071

20 in total

1. Detection of lung cancer on radiographs: receiver operating characteristic analyses of radiologists', pulmonologists', and anesthesiologists' performance.

Authors: Laurence Monnier-Cholley; Fabrice Carrat; Bernard P Cholley; Jean-Michel Tubiana; Lionel Arrivé
Journal: Radiology Date: 2004-10-14 Impact factor: 11.105

2. The use of continuous and discrete confidence judgments in receiver operating characteristic studies of diagnostic imaging techniques.

Authors: H E Rockette; D Gur; C E Metz
Journal: Invest Radiol Date: 1992-02 Impact factor: 6.016

Review 3. Receiver operating characteristic analysis: a primer.

Authors: John Eng
Journal: Acad Radiol Date: 2005-07 Impact factor: 3.173

Review 4. Bias in research studies.

Authors: Gregory T Sica
Journal: Radiology Date: 2006-03 Impact factor: 11.105

5. A search model and figure of merit for observer data acquired according to the free-response paradigm.

Authors: D P Chakraborty
Journal: Phys Med Biol Date: 2006-07-06 Impact factor: 3.609

6. A statistical method for the comparison of a discrete diagnostic test with several continuous diagnostic tests.

Authors: C A Beam; H S Wieand
Journal: Biometrics Date: 1991-09 Impact factor: 2.571

7. Pulmonary nodules: estimation of malignancy at thin-section helical CT--effect of computer-aided diagnosis on performance of radiologists.

Authors: Kazuo Awai; Kohei Murao; Akio Ozawa; Yoshiharu Nakayama; Takeshi Nakaura; Duo Liu; Koichi Kawanaka; Yoshinori Funama; Shoji Morishita; Yasuyuki Yamashita
Journal: Radiology Date: 2006-02-07 Impact factor: 11.105

8. Effect of observer instruction on ROC study of chest images.

Authors: D Gur; H E Rockette; W F Good; B S Slasky; L A Cooperstein; W H Straub; N A Obuchowski; C E Metz
Journal: Invest Radiol Date: 1990-03 Impact factor: 6.016

9. Ovarian cancer: comparison of observer performance for four methods of interpreting CT scans.

Authors: P J Fultz; C V Jacobs; W J Hall; R Gottlieb; D Rubens; S M Totterman; S Meyers; C Angel; G Del Priore; D P Warshal; K H Zou; D E Shapiro
Journal: Radiology Date: 1999-08 Impact factor: 11.105

10. Signal characteristics of focal liver lesions on double echo T2-weighted conventional spin echo MRI: observer performance versus quantitative measurements of T2 relaxation times.

Authors: H M Fenlon; R Tello; V L deCarvalho; E K Yucel
Journal: J Comput Assist Tomogr Date: 2000 Mar-Apr Impact factor: 1.826

3 in total

1. Stereoscopic interpretation of low-dose breast tomosynthesis projection images.

Authors: Gautam S Muralidhar; Mia K Markey; Alan C Bovik; Tamara Miner Haygood; Tanya W Stephens; William R Geiser; Naveen Garg; Beatriz E Adrada; Basak E Dogan; Selin Carkaci; Raunak Khisty; Gary J Whitman
Journal: J Digit Imaging Date: 2014-04 Impact factor: 4.056

2. Imaging technology and practice assessments: what next?

Authors: David Gur
Journal: Acad Radiol Date: 2009-05 Impact factor: 3.173

3. Agreement of the order of overall performance levels under different reading paradigms.

Authors: David Gur; Andriy I Bandos; Amy H Klym; Cathy S Cohen; Christiane M Hakim; Lara A Hardesty; Marie A Ganott; Ronald L Perrin; William R Poller; Ratan Shah; Jules H Sumkin; Luisa P Wallace; Howard E Rockette
Journal: Acad Radiol Date: 2008-12 Impact factor: 3.173

3 in total