| Literature DB >> 30672161 |
Sung Hun Kim1, Eun Hye Lee2, Jae Kwan Jun3, You Me Kim4, Yun Woo Chang5, Jin Hwa Lee6, Hye Won Kim7, Eun Jung Choi8.
Abstract
OBJECTIVE: To evaluate the interpretive performance and inter-observer agreement on digital mammographs among radiologists and to investigate whether radiologist characteristics affect performance and agreement.Entities:
Keywords: Medical audit; Observer variation; Radiologists; Screening; Sensitivity and specificity
Mesh:
Year: 2019 PMID: 30672161 PMCID: PMC6342756 DOI: 10.3348/kjr.2018.0193
Source DB: PubMed Journal: Korean J Radiol ISSN: 1229-6929 Impact factor: 3.500
Characteristics of Screen-Detected Cancers
| Number | Age (Yrs) | Density | Lesion Type | Size* (mm) | Pathology | Percentage of Correct Answer |
|---|---|---|---|---|---|---|
| Cancer 1 | 58 | b | Focal asymmetry | 6 | IDC grade 1, node (-) | 100 |
| Cancer 2 | 55 | c | Calcifications | NA | DCIS | 75 |
| Cancer 3 | 49 | d | Focal asymmetry | 15 | IDC grade 1, node (-) | 100 |
| Cancer 4 | 74 | a | Focal asymmetry | 15 | IDC grade 2, node (-) | 100 |
| Cancer 5 | 78 | b | Mass | 8 | IDC grade 1, node (-) | 92 |
| Cancer 6 | 64 | c | Calcifications | 20 | IDC grade 3, node (-) | 83 |
| Cancer 7 | 64 | a | Focal asymmetry | 7 | IDC grade 2, node (-) | 100 |
| Cancer 8 | 69 | a | Focal asymmetry | 2 | IDC grade 1, node (-) | 58 |
| Cancer 9 | 62 | b | Mass | 1 | Microinvasive, node (-) | 100 |
| Cancer 10 | 50 | d | Mass | 15 | IDC grade 2, node (-) | 75 |
| Cancer 11 | 49 | c | Mass | 33 | ILC grade 2 node (+1/21) | 100 |
| Cancer 12 | 52 | c | Calcifications | NA | DCIS | 75 |
*Size of invasive cancer. DCIS = ductal carcinoma in situ, IDC = invasive ductal carcinoma, ILC = invasive lobular carcinoma, NA = not applicable, Yrs = years
Characteristics of Radiologists Participating in Study
| Characteristic | No. of Radiologists (%) |
|---|---|
| Total | 12 (100.0) |
| Years' experience interpreting mammography | |
| < 10 | 7 (58.3) |
| ≥ 10 | 5 (41.7) |
| Fellowship training in breast imaging | |
| Yes | 5 (41.7) |
| No | 7 (58.3) |
| Mean annual diagnostic volume (no. of mammograms) | |
| < 3000 | 9 (75.0) |
| ≥ 3000 | 3 (25.0) |
| Mean annual screening volume (no. of mammograms) | |
| < 3000 | 7 (58.3) |
| ≥ 3000 | 5 (41.7) |
| Percentage of all examinations that were screening mammograms* | |
| < 50 | 5 (41.7) |
| ≥ 50 | 7 (58.3) |
*Average per year over previous 1 year.
Interpretive Performances and Radiologist Characteristics
| Characteristics | Recall Rate, % | No. Cancers Detected* | PPV†, % | Sensitivity, % | Specificity, % | False Positive Rate, % | AUC (Range) |
|---|---|---|---|---|---|---|---|
| Mean (Range) | Mean (Range) | Mean (Range) | Mean (Range) | Mean (Range) | Mean (Range) | ||
| Total | 7.5 (3.3–10.2) | 10.6 (8.0–12.0) | 15.9 (8.8–33.3) | 88.2 (66.7–100.0) | 93.5 (90.6–97.8) | 6.5 (2.2–9.4) | 0.93 (0.82–0.99) |
| Years experience in interpreting mammography | |||||||
| < 10 | 7.2 (3.6–9.6) | 10.4 (8.0–12.0) | 15.1 (11.5–22.2) | 86.9 (66.7–100.0) | 93.7 (91.6–97.2) | 6.3 (2.8–8.5) | 0.92 (0.82–0.99) |
| ≥ 10 | 7.7 (3.3–10.2) | 10.8 (9.0–12.0) | 16.9 (8.8–33.3) | 90.0 (75.0–100.0) | 93.3 (90.6–97.8) | 6.7 (2.2–9.4) | 0.95 (0.90–0.99) |
| | 0.725 | 0.652 | 0.293 | 0.651 | 0.741 | 0.753 | 0.516 |
| Fellowship training in breast imaging | |||||||
| Yes | 8.0 (6.6–10.2) | 10.0 (9.0–11.0) | 12.9 (8.8–15.2) | 83.3 (75.0–91.7) | 92.9 (90.6–94.3) | 7.1 (5.7–9.4) | 0.92 (0.88–0.99) |
| No | 7.0 (3.3–9.6) | 11.0 (8.0–12.0) | 18.1 (10.4–33.3) | 91.7 (66.7–100.0) | 94.0 (91.3–97.8) | 6.0 (2.2–8.7) | 0.95 (0.82–0.99) |
| | 0.487 | 0.208 | 0.192 | 0.206 | 0.425 | 0.432 | 0.389 |
| Mean annual screening volume (no. of mammograms) | |||||||
| < 3000 | 6.9 (3.3–10.2) | 10.0 (8.0–12.0) | 17.3 (8.8–33.3) | 83.3 (66.7–100.0) | 94.0 (90.6–97.8) | 6.0 (2.2–9.4) | 0.91 (0.82–0.99) |
| ≥ 3000 | 8.2 (7.3–9.6) | 11.4 (10.0–12.0) | 14.0 (12.5–15.8) | 95.0 (83.3–100.0) | 92.8 (91.6–93.7) | 7.2 (6.3–8.5) | 0.96 (0.88–0.99) |
| | 0.332 | 0.064 | 0.424 | 0.064 | 0.384 | 0.377 | 0.096 |
| Percentage of all examination that were screening mammograms‡ | |||||||
| < 50 | 6.5 (3.3–10.2) | 10.4 (8.0–12.0) | 19.3 (8.8–33.3) | 86.7 (66.7–100.0) | 94.4 (90.6–97.8) | 5.6 (2.2–9.4) | 0.93 (0.82–0.99) |
| ≥ 50 | 8.1 (6.6–9.6) | 10.7 (10.0–12.0) | 13.5 (10.4–15.8) | 89.3 (83.3–100.0) | 92.9 (91.3–94.3) | 7.1 (5.7–8.7) | 0.94 (0.88–0.99) |
| | 0.258 | 0.707 | 0.138 | 0.706 | 0.263 | 0.258 | 0.884 |
*Total of 12 cancer cases detected per 1000 screening mammograms, †PPV, ‡Average per year over previous year. AUC = area under curve, PPV = positive predictive value
Fig. 1Areas under curve of twelve radiologists ranged from 0.82 to 0.99 with mean value of 0.93.
ROC = receiver-operating-characteristic
Intra- and Inter-Radiologist Agreement and Radiologist Characteristics
| Characteristics | Intra-Radiologist Agreement (n = 150 Cases) | Inter-Radiologist Agreement (n = 1000 Cases) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Percent Agreement | Kappa | Pairwise Percent Agreement | Pairwise Kappa | |||||||
| Range | Mean (95% CI) | Range | Mean (95% CI) | Range | Mean (95% CI) | Range | Mean (95% CI) | |||
| NCSP scale | 0.111 | 0.272 | ||||||||
| 2 categories | 84.7–98.0 | 92.5 (90.1–94.8) | 0.116–0.733 | 0.457 (0.326–0.588) | 84.6–95.7 | 88.7 (87.3–90.1) | 0.118–0.433 | 0.266 (0.213–0.325) | ||
| 4 categories | 78.7–95.3 | 85.4 (82.1–88.6) | 0.444–0.745 | 0.572 (0.514–0.628) | 57.1–88.2 | 76.3 (74.6–77.9) | 0.108–0.507 | 0.342 (0.216–0.463) | ||
| Malignancy scale | 0.169 | 0.631 | ||||||||
| 2 categories | 88.7–98.7 | 94.6 (92.6–96.5) | 0.088–0.686 | 0.410 (0.264–0.557) | 84.6–95.7 | 88.8 (87.3–90.2) | 0.117–0.433 | 0.265 (0.159–0.638) | ||
| 7 categories | 76.0–95.3 | 85.8 (82.0–89.6) | 0.364–0.709 | 0.522 (0.452–0.592) | 55.5–92.1 | 77.2 (75.4–79.1) | 0.129–0.432 | 0.327 (0.246–0.407) | ||
| Years experience in interpreting mammography | 0.239 | 0.809 | ||||||||
| < 10 | 78.7–95.3 | 86.6 (82.6–90.6) | 0.488–0.745 | 0.602 (0.500–0.700) | 57.3–87.9 | 66.6 (64.4–68.8) | 0.108–0.426 | 0.323 (0.190–0.447) | ||
| ≥ 10 | 78.7–88.0 | 83.6 (80.5–86.7) | 0.444–0.666 | 0.529 (0.462–0.596) | 62.6–83.5 | 77.4 (75.8–79.0) | 0.200–0.507 | 0.344 (0.230–0.453) | ||
| Fellowship training in breast imaging | 0.177 | 0.646 | ||||||||
| Yes | 78.7–90.0 | 82.6 (76.6–88.6) | 0.444–0.611 | 0.529 (0.449–0.609) | 58.1–84.3 | 80.6 (79.1–82.0) | 0.174–0.415 | 0.345 (0.225–0.461) | ||
| No | 80.7–95.3 | 87.3 (83.0–91.7) | 0.482–0.745 | 0.602 (0.512–0.691) | 62.6–87.0 | 71.7 (69.7–73.7) | 0.135–0.507 | 0.322 (0.193–0.443) | ||
| Mean annual screening volume (no. of mammograms) | 0.598 | 0.062 | ||||||||
| < 3000 | 78.7–95.3 | 85.4 (81.1–89.8) | 0.444–0.666 | 0.560 (0.510–0.610) | 57.1–88.2 | 75.3 (73.6–76.9) | 0.108–0.507 | 0.285 (0.162–0.400) | ||
| ≥ 3000 | 80.7–90.7 | 85.2 (82.4–88.1) | 0.482–0.745 | 0.588 (0.496–0.679) | 75.2–83.3 | 83.1 (81.5–84.7) | 0.302–0.438 | 0.448 (0.320–0.566) | ||
| Percentage of all examination that were screening mammograms | 0.380 | 0.475 | ||||||||
| < 50 | 78.7–95.3 | 85.1 (80.0–90.2) | 0.444–0.666 | 0.547 (0.478–0.616) | 62.6–83.5 | 77.4 (75.8–79.1) | 0.135–0.507 | 0.298 (0.179–0.417) | ||
| ≥ 50 | 78.7–90.7 | 85.5 (82.4–88.7) | 0.488–0.745 | 0.589 (0.525–0.653) | 57.1–88.2 | 66.6 (64.3–68.8) | 0.170–0.463 | 0.361 (0.234–0.485) | ||
Agreement among radiologist characteristics was calculated using weighted kappa values based on four NCSP categories. CI = confidence interval, NCSP = National Cancer Screening Program