| Literature DB >> 35798962 |
S van de Weerd1,2,3, E Hong4,5,6, I van den Berg7,8, J W Wijlemans9, J van Vooren9, M W Prins9, F J Wessels9, B C Heeres4, S Roberti10, J Nederend11, J H J M van Krieken2, J M L Roodhart8, R G H Beets-Tan4,6, J P Medema12,13.
Abstract
PURPOSE: To investigate whether locoregional staging of colon cancer by experienced radiologists can be improved by training and feedback to minimize the risk of over-staging into the context of patient selection for neoadjuvant therapy and to identify potential pitfalls of CT staging by characterizing pathologic traits of tumors that remain challenging for radiologists.Entities:
Keywords: Colon cancer; Computed tomography; Learning curve; Neoadjuvant therapy; Radiology
Mesh:
Year: 2022 PMID: 35798962 PMCID: PMC9463303 DOI: 10.1007/s00261-022-03573-7
Source DB: PubMed Journal: Abdom Radiol (NY)
Patient demographics and tumor characteristics
| Characteristic | Number (%) |
|---|---|
Age (median) IQR | 70.5 63.0–77.0 |
Male Female | 21 (47.8) 23 (52.2) |
Cecum Ascending colon Transverse colon Distal transverse colon Descending colon Sigmoid colon | 17 (38.6) 4 (9.1) 3 (6.8) 1 (2.3) 3 (6.8) 16 (36.4) |
pT1 pT2 pT3 pT4 | 3 (6.8) 12 (27.3) 24 (54.5) 5 (11.4) |
pN0 pN1 pN2 | 28 (63.6) 11 (25.0) 5 (11.4) |
IQR Interquartile range
Diagnostic performance of all radiologists in distinguishing T1-2 vs. T3-4 and N0 vs. N1-2 with increasing number of evaluated cases
| Batch | Accuracy | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|
| 0 | 60 (15/25) | 87 (13/15) | 20 (2/10) | 62 (13/21) | 50 (2/4) |
| 1 | 58 (29/50) | 57 (20/35) | 60 (9/15) | 77 (20/26) | 38 (9/24) |
| 2 | 67 (30/45) | 63 (19/30) | 73 (11/15) | 83 (19/23) | 50 (11/22) |
| 3 | 82 (41/50) | 86 (30/35) | 73 (11/15) | 88 (30/34) | 69 (11/16) |
| 4 | 80 (40/50) | 77 (23/30) | 85 (17/20) | 88 (23/26) | 71 (17/24) |
| All | 70 (155/220) | 72 (105/145) | 67 (50/75) | 81 (105/130) | 56 (50/90) |
| 0 | 48 (12/25) | 20 (1/5) | 55 (11/20) | 10 (1/10) | 73 (11/15) |
| 1 | 66 (33/50) | 52 (13/25) | 80 (20/25) | 72 (13/18) | 63 (20/32) |
| 2 | 51 (23/45) | 53 (8/15) | 50 (15/30) | 35 (8/23) | 68 (15/22) |
| 3 | 72 (36/50) | 65 (13/20) | 77 (23/30) | 65 (13/20) | 77 (23/30) |
| 4 | 62 (31/50) | 60 (9/15) | 63 (22/35) | 41 (9/22) | 76 (22/28) |
| All | 61 (135/220) | 55 (44/80) | 65 (91/140) | 47 (44/93) | 72 (91/127) |
Note numbers are in percentages, absolute numbers are given between parentheses.
PPV Positive predictive value, NPV Negative predictive value
Fig. 1Learning curves for T-staging (T1-2 vs. T3-4) of colon cancer, separately for each radiologist. Accuracy of radiologists who received feedback (dashed line) and did not receive feedback (solid line) is presented. Accuracy was plotted and fitted using generalized estimating equations logistic regression models (black lines, the lower black line represents the same learning curve for two radiologists). Dots represent the number of correct (around 1.00) or incorrect (around 0.00) staged readings, and gray lines are non-parametric smoothed curves
Fig. 2Contingency tables with radiologic T-stage and pathologic T-stage depicted for batch 0–2 and batch 3–4. All cases were scored by 5 radiologists, adding up to 120 readings within batch 0–2 and 100 readings in batch 3–4. The numbers represent readings. For example, 14 pT3 patients were scored in batch 0–2 by 5 radiologists, adding up to 70 readings of which 36 were correctly scored as cT3
Fig. 3Learning curves for N-staging (N0 vs. N1-2) of colon cancer, separately for each radiologist. Accuracy of radiologists who received feedback (dashed line) and did not receive feedback (solid line) is presented. Accuracy was plotted and fitted using generalized estimating equations logistic regression models (black lines, the lower black line represents the same learning curve for two radiologists). Dots represent the number of correct (around 1.00) or incorrect (around 0.00) staged readings, and gray lines are non-parametric smoothed curves
Fig. 4Contingency tables with radiologic assessment and pathologic assessment of lymph node involvement depicted for batch 0–2 and batch 3–4. The depicted numbers represent the number of readings
Fig. 5Four examples of challenging cases. Panels a + b, c + d, e + f, and g + h represent histology (Hematoxylin & Eosin stain, 2 × magnification) and radiology (axial CT image at tumor level) of corresponding cases. Dashed arrows indicate the muscularis propria and point toward the subserosa. Black arrows indicate invasive tumor area. White arrows point out the tumor on CT. a pT3 tumor with minimal invasion through the muscularis propria. No stromal or inflammatory reaction present in the surrounding subserosa. b Case was under-staged by 3 radiologists and the expert. c pT3 tumor with mucinous phenotype. Invasive part almost completely consists of mucus. d Case was under-staged by 4 radiologists and the expert. e pT2 tumor with a striking immune reaction in the subserosa at the tumor location. f Case was over-staged by 4 radiologists and the expert. G Example of a case with discordant assessment of pT-stage between initial evaluation and second review. Case was restaged from pT2 to pT3. h Case was scored as cT3 by three radiologists and as cT2 by 2 radiologists and the expert.
Reader’s confidence in distinguishing T1-2 vs. T3-4 colon cancer and lymph node involvement using a 0–4 scale with 4 as most certain
| Batch | All readers | With Feedback | Without feedback | |
|---|---|---|---|---|
| 0 | 2.80 ± 0.50 | 2.70 ± 0.48 | 2.87 ± 0.52 | 0.239 |
| 1 | 3.02 ± 0.69 | 2.95 ± 0.76 | 3.07 ± 0.64 | 0.512 |
| 2 | 3.13 ± 0.73 | 2.56 ± 0.51 | 3.52 ± 0.58 | < 0.001 |
| 3 | 3.14 ± 0.67 | 2.90 ± 0.72 | 3.30 ± 0.60 | 0.008 |
| 4 | 2.92 ± 0.63 | 2.60 ± 0.50 | 3.13 ± 0.63 | 0.003 |
| All | 3.03 ± 0.66 | 2.76 ± 0.62 | 3.21 ± 0.62 | < 0.001 |
| 0 | 3.20 ± 0.82 | 3.00 ± 0.82 | 3.33 ± 0.82 | 0.174 |
| 1 | 2.96 ± 0.67 | 2.70 ± 0.73 | 3.13 ± 0.57 | 0.027 |
| 2 | 3.31 ± 0.70 | 2.78 ± 0.65 | 3.67 ± 0.48 | < 0.001 |
| 3 | 3.00 ± 0.76 | 2.45 ± 0.51 | 3.37 ± 0.67 | < 0.001 |
| 4 | 3.04 ± 0.73 | 2.65 ± 0.67 | 3.30 ± 0.65 | < 0.001 |
| All | 3.08 ± 0.73 | 2.68 ± 0.67 | 3.36 ± 0.64 | < 0.001 |
Note numbers are mean ± standard deviation. P values are derived from the comparison of feedback groups
Comparison of reading time in minutes for the different batches
| Batch | All readers | With Feedback | Without feedback | |
|---|---|---|---|---|
| 1 | 5.5 (2.0–15.0) | 7.0 (2.8–15.0) | 5.3 (2.0–13.0) | < 0.001 |
| 2 | 4.7 (2.0–14.5) | 6.1 (2.5–14.5) | 4.0 (2.0–9.7) | < 0.001 |
| 3 | 4.0 (2.0–12.7) | 5.6 (2.0–12.7) | 3.9 (2.4–6.7) | < 0.001 |
| 4 | 3.6 (2.0–9.6) | 3.9 (2.0–9.6) | 3.5 (2.0–6.0) | < 0.001 |
| All | 4.0 (2.0–15.0) | 5.4 (2.0–15.0) | 4.0 (2.0–13.0) | < 0.001 |
Note numbers are median (range). P values are derived from the comparison of feedback groups