| Literature DB >> 24586763 |
Elizabeth S Burnside1, Yunzhi Lin2, Alejandro Munoz del Rio3, Perry J Pickhardt4, Yirong Wu4, Roberta M Strigel4, Mai A Elezaby4, Eve A Kerr5, Diana L Miglioretti6.
Abstract
BACKGROUND: Motivated by the challenges in assessing physician-level cancer screening performance and the negative impact of misclassification, we propose a method (using mammography as an example) that enables confident assertion of adequate or inadequate performance or alternatively recognizes when more data is required.Entities:
Mesh:
Year: 2014 PMID: 24586763 PMCID: PMC3931752 DOI: 10.1371/journal.pone.0089418
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
BI-RADS* final assessment categories with associated recommendation.
| Category | Definition | Recommendation | |
| 0 | Needs additional imaging evaluation | Additional imaging | |
| 1 | Negative | None (routine mammography) | |
| 2 | Benign finding | None (routine mammography) | |
| 3 | Probably benign finding | Short-interval follow-up (6 months) | |
| 4 | Suspicious abnormality | Biopsy | |
| 5 | Highly suggestive of malignancy | Biopsy | |
*BI-RADS Version 4 [22].
Figure 1Defining adequate performance based on volume.
Plots demonstrate our method for constructing curves by using the benchmark threshold as the limit of 95% confidence based on volume: (A) CDR performance levels are established using 2.4 as the lower boundary for 95% CI of adequate performance (CIs shown) and the upper boundary for inadequate performance (CIs not shown). This methodology shows (indicated with a black dot) that a volume of 2770 is required to confidently assert the CDR benchmark median of 4.4/1000 is adequate; (B) RR performance levels are established using 16.8 as the upper boundary for 95% CI of adequate (CI shown) and inadequate (CI not shown) performance. A volume of 120 (indicated with a black dot) is required to confidently assert the RR benchmark median of 9.7% is adequate. Plots define regions of adequate, uncertain, and inadequate performance for (B) CDR and (D) RR.
Distribution of study population.
|
|
|
|
|
|
|
| ||
|
| ||||||||
|
| ||||||||
| <30 | 20 | 0.1 | 0 | 0.0 | 20 | 0.1 | 0.1 | |
| 30–39 | 727 | 2.4 | 2 | 1.2 | 729 | 2.4 | 4.7 | |
| 40–49 | 8205 | 27.2 | 24 | 14.8 | 8229 | 27.1 | 29.3 | |
| 50–59 | 10,339 | 34.2 | 45 | 27.8 | 10,384 | 34.2 | 28.9 | |
| 60–69 | 6796 | 22.5 | 52 | 32.1 | 6848 | 22.6 | 19.1 | |
| 70–79 | 3132 | 10.4 | 26 | 16.0 | 3158 | 10.4 | 13.6 | |
| >80 | 982 | 3.3 | 13 | 8.0 | 995 | 3.3 | 4.2 | |
|
| ||||||||
| Yes | 5818 | 19.3 | 46 | 28.4 | 5864 | 19.3 | 15.2 | |
| No | 23,775 | 78.7 | 114 | 70.4 | 23,889 | 78.7 | 84.8 | |
| Unknown | 608 | 2.0 | 2 | 1.2 | 610 | 2.0 | 17.4 | |
|
| ||||||||
| Yes | 3071 | 10.2 | 74 | 45.7 | 3145 | 10.4 | 6.3 | |
| No | 27,130 | 89.8 | 88 | 54.3 | 27,218 | 89.6 | 93.7 | |
|
| ||||||||
| Yes | 24,484 | 81.1 | 143 | 88.3 | 24,627 | 81.1 | 89.2 | |
| No | 5717 | 18.9 | 19 | 11.7 | 5736 | 18.9 | 10.8 | |
|
| ||||||||
| Yes | 1132 | 3.7 | 21 | 13.0 | 1153 | 3.8 | 3.6 | |
| No | 29,069 | 96.3 | 141 | 87.0 | 29,210 | 96.2 | 96.4 | |
*According to Rosenberg, et al. [19].
Figure 2Individual physician performance assessment based on volume.
Plots of (A) CDR and (B) RR for the 4 included radiologists at 6 volumes from 500 examinations (then at 1000 and subsequently 1000 exam increments) to the maximum volume read over the 3 years or 5000 total (whichever was least).
Figure 3Annual observed performance values as compared to aggregated data.
Annual CDR for each individual radiologist are shown on this bar graph with performance values and lower bound 95% CI summarized below the bar graph. The fourth bar for each physician represents performance over the 3 years of the study period aggregated (“Agg”) into a consolidated performance metric. Performance values in th first row in italics and bold represent performance values that would be characterized as inadequate using previously published benchmark thresholds.