| Literature DB >> 35501512 |
Arianna Defeudis1,2,3, Simone Mazzetti4,5, Jovana Panic5,6, Monica Micilotta7, Lorenzo Vassallo8, Giuliana Giannetto4,5, Marco Gatti8, Riccardo Faletti8, Stefano Cirillo7, Daniele Regge4,5, Valentina Giannini4,5.
Abstract
BACKGROUND: Pathological complete response after neoadjuvant chemoradiotherapy in locally advanced rectal cancer (LARC) is achieved in 15-30% of cases. Our aim was to implement and externally validate a magnetic resonance imaging (MRI)-based radiomics pipeline to predict response to treatment and to investigate the impact of manual and automatic segmentations on the radiomics models.Entities:
Keywords: Artificial intelligence; Machine learning; Multiparametric magnetic resonance imaging; Neoadjuvant therapy; Rectal neoplasms
Mesh:
Year: 2022 PMID: 35501512 PMCID: PMC9061921 DOI: 10.1186/s41747-022-00272-2
Source DB: PubMed Journal: Eur Radiol Exp ISSN: 2509-9280
Multiparametric MRI acquisition
| Parameters | Centre A | Centre B | Centre C | |
|---|---|---|---|---|
| T2w | TR/TE | 7,660/110 ms | 3,231/90 ms | 5,085/100 ms |
| Acquisition matrix | 416 × 224 | 320 × 311 | 512 × 512 | |
| Slice thickness | 4 mm | 3.5 mm | 3 mm | |
| Pixel size | 0.43 × 0.43 mm2 | 0.47 × 0.47 mm2 | 0.8 × 0.8 mm2 | |
| FOV | 220 mm × 220 mm | 240 mm × 240 mm | 250 mm × 250 mm | |
| Flip angle | 90° | 90° | 90° | |
| DWI | TR/TE | 2,000/87 ms | 4,011/91 ms | 2,694/68 ms |
| Acquisition matrix | 96 × 128 | 100 × 98 | 124 × 101 | |
| Slice thickness | 4 mm | 3.5 mm | 3 mm | |
| Pixel size | 0.86 × 0.86 mm2 | 1.88 × 1.88 mm2 | 2.8 × 2.8 mm2 | |
| FOV | 220 mm × 220 mm | 240 mm × 240 mm | 345 mm × 345 mm | |
| Flip angle | 90° | 90° | 90° | |
| 1,200 s/mm2 | 1,000 s/mm2 | 1,000 s/mm2 | ||
FOV Field of view, ms Millisecond, TR/TE Repetition time/time to echo
Fig. 1Examples of automatic segmentations of rectal cancer using the fully convolutional network model (blue line) and manual segmentation (red line) in different slices of two patients. First row: patient no. 25 with a DSC of 0.70; second row: patient no. 2007 with a DSC of 0.65. DSC Dice similarity coefficient, Pt Patient, Sl. Slice
Patient and tumour data
| Total | Construction | Validation | |
|---|---|---|---|
| Number of patients | 95 | 67 | 28 |
| Number of patients per centre | |||
| Centre A | 44 | 44 | – |
| Centre B | 23 | 23 | – |
| Centre C | 28 | – | 28 |
| Sex | 58 M 37 F | 43 M 24 F | 15 M 13 F |
| Median age, years [IQR] | 64 [34–86] | 64 [34–86] | 64 [35–83] |
| TRG | |||
| 1 | 16 | 9 | 7 |
| 2 | 26 | 18 | 8 |
| 3 | 26 | 18 | 8 |
| 4 | 27 | 22 | 5 |
| 5 | 0 | 0 | 0 |
| Median of tumour volume (cc) [IQR] | 21.3 [2.8–232.2] | 23.8 [2.8–232.2] | 15.4 [3.5–66.7] |
IQR Interquartile range, F Female, M Male, TRG Tumour regression grade
Fig. 2Flowchart explaining the dataset subdivision (a). Flowchart illustrating the radiomics pipeline (b). pR Nonresponders, pR+ Responders
Performances for the manual model
| Construction set: centre A + centre B | ||||||
|---|---|---|---|---|---|---|
| AUC | ACC % | SE % | SP % | NPV % | PPV % | |
| MRMR+EL (AdaBoost) | 1.00 (93–100) | 99 (92–100) | 100 (97–100) | 98 (87–100) | 100 (80–99) | 96 (95–100) |
| [66/67] | [39/39] | [27/28] | [27/27] | [39/40] | ||
| Ranking+EL (Bag) | 0.99 (92–100) | 94 (85–98) | 97 (80–98) | 90 (81–100) | 96 (74–96) | 93 (84–100) |
| [63/67] | [37/38] | [26/29] | [26/27] | [37/40] | ||
| Ranking+SVM Gaussian | 0.87 (79–95) | 81 (69–89) | 93 (76–99) | 73 (67–88) | 94 (79–98) | 69 (58–79) |
| [54/67] | [25/27] | [29/40] | [29/31] | [25/36] | ||
| Ranking+LR stepwise | 0.69 (67–85) | 80 (69–89) [54/67] | 89 (71–98) [24/27] | 75 (59–87) [30/40] | 91 (77–97) [30/33] | 70 (57–80) [24/34] |
0.90 (82–97) | 83 (71–90) [55/67] | 85 (66–96) [23/27] | 80 (64–91) [32/40] | 89 (76–95) [32/36] | 74 (60–84) [23/31] | |
| Ranking+SVM polynomial | 0.61 (52–74) | 68 (48–84) [19/28] | 60 (32–83) [9/15] | 77 (47–95) [10/13] | 63 (46–76) [10/16] | 75 (60–90) [9/12] |
AUC Area under the curve, ACC Accuracy, NPV Negative predictive value, PPV Positive predictive value, SE Sensitivity, SP Specificity
Performances for the automatic model
| Construction set: centre A + centre B | ||||||
|---|---|---|---|---|---|---|
| AUC | ACC % | SE % | SP % | NPV % | PPV % | |
| AP+Bayes | 0.75 (67–83) | 72 (59–82) [45/63] | 74 (54–89) [20/27] | 69 (52–84) [25/36] | 77 (65–87) [25/32] | 65 (51–76) [20/31] |
| MRMR+Bayes | 0.78 (69–85) | 74 (60–83) [46/63] | 81 (62–94) [22/27] | 67 (49–81) [24/36] | 83 (68–92) [24/29] | 65 (44–70) [22/34] |
| Ranking+EL (Bag) | 0.90 (87–95) | 68 (55–79) | 96 (81–99) | 47 (30–65) | 94 (71–99) | 58 (50–65) |
| [43/63] | [26/27] | [17/36] | [17/18] | [26/45] | ||
| AP+SVM polynomial | 0.70 (64–82) | 67 (54–78) | 81 (62–94) | 56 (38–72) | 80 (63–90) | 58 (48–67) |
| [42/63] | [22/27] | [20/36] | [20/25] | [22/38] | ||
| Ranking+SVM linear | 0.83 (78–90) | 74 (62–85) | 70 (50–86) | 78 (66–87) | 78 (66–87) | 70 (55–82) |
| [47/63] | [19/27] | [28/36] | [28/36] | [19/27] | ||
0.86 (78–94) | 78 (66–87) [49/63] | 81 (62–94) [22/27] | 75 (58–88) [27/36] | 84 (71–92) [27/32] | 71 (57–82) [22/31] | |
| Ranking+SVM Gaussian | 0.81 (60–89) | 75 (53–88) [21/28] | 80 (50–95) [12/15] | 69 (37–90) [9/13] | 75 (54–86) [9/12] | 75 (55–83) [12/16] |
AUC Area under the curve, ACC Accuracy, NPV Negative predictive value, PPV Positive predictive value, SE Sensitivity, SP Specificity
Fig. 3Error analysis. Patients in the validation dataset misclassified by either manual, automatic, or both approaches, sorted by DSC (a). b Waterfall diagram showing DSC distribution across the validation set for the automatic approach (b). Red bars are the automatic model misclassified errors and green bars the correct ones. Red * highlights patients misclassified by both approaches. DSC Dice similarity coefficient
Fig. 4Examples of three patients segmented by internally developed fully convolutional network model (blue line) and the radiologists (red line). Pt. 2003 with DSC of 0.18 (a) and Pt. 2015 with DSC of 0.76 (b) were wrongly classified by both manual and automatic systems, while Pt. 2028 with DSC of 0.81 (c) was wrongly classified only by the automatic model. DSC Dice similarity coefficient, Pt Patient, Sl. Slice
Results on the hybrid validation for the two best models
| Hybrid validation: centre C | ||||||
|---|---|---|---|---|---|---|
| AUC | ACC % | SE % | SP % | NPV % | PPV % | |
| Train_MAN + Val_AUTO | 0.69 (60–89) | 75 (53–88) [21/28] | 80 (50–95) [12/15] | 69 (37–90) [9/13] | 75 (54–86) [9/12] | 75 (55–83) [12/16] |
| Train_AUTO + Val_MAN | 0.62 (52–73) | 64 (44–81) [18/28] | 47 (21–73) [7/15] | 85 (55–98) [11/13] | 58 (45–70) [11/19] | 78 (47–93) [7/9] |
AUC Area under the curve, ACC Accuracy, NPV Negative predictive value, PPV Positive predictive value, SE Sensitivity, SP Specificity