| Literature DB >> 35008265 |
Noémie Moreau1,2, Caroline Rousseau3,4, Constance Fourcade1,2, Gianmarco Santini2, Aislinn Brennan2, Ludovic Ferrer4,5, Marie Lacombe4, Camille Guillerminet4, Mathilde Colombié4, Pascal Jézéquel3,4, Mario Campone4,5, Nicolas Normand1, Mathieu Rubeaux2.
Abstract
Metastatic breast cancer patients receive lifelong medication and are regularly monitored for disease progression. The aim of this work was to (1) propose networks to segment breast cancer metastatic lesions on longitudinal whole-body PET/CT and (2) extract imaging biomarkers from the segmentations and evaluate their potential to determine treatment response. Baseline and follow-up PET/CT images of 60 patients from the EPICUREseinmeta study were used to train two deep-learning models to segment breast cancer metastatic lesions: One for baseline images and one for follow-up images. From the automatic segmentations, four imaging biomarkers were computed and evaluated: SULpeak, Total Lesion Glycolysis (TLG), PET Bone Index (PBI) and PET Liver Index (PLI). The first network obtained a mean Dice score of 0.66 on baseline acquisitions. The second network obtained a mean Dice score of 0.58 on follow-up acquisitions. SULpeak, with a 32% decrease between baseline and follow-up, was the biomarker best able to assess patients' response (sensitivity 87%, specificity 87%), followed by TLG (43% decrease, sensitivity 73%, specificity 81%) and PBI (8% decrease, sensitivity 69%, specificity 69%). Our networks constitute promising tools for the automatic segmentation of lesions in patients with metastatic breast cancer allowing treatment response assessment with several biomarkers.Entities:
Keywords: automatic segmentation; deep learning; disease monitoring; imaging biomarkers; metastatic breast cancer
Year: 2021 PMID: 35008265 PMCID: PMC8750371 DOI: 10.3390/cancers14010101
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1(a) U-Net and (b) U-Net networks’ architectures and inputs.
Quantitative evaluation for the two networks on baseline and follow-up acquisitions. (a) Evaluation on validation data. For each network, validation performances were computed for its 5 models from the cross-validation training and then averaged. (b) Evaluation on test data. For each network, models from the cross-validation training were combined in one single ensemble model and test performances were computed with this model. Only 10 unseen patients with one baseline and one follow-up acquisition were used.
| Networks | Acquisitions | Mean Dice | Global Dice | Detection Recall | Detection Precision |
|---|---|---|---|---|---|
| U-Net | Baseline |
| 0.73 | 0.72 | 0.87 |
| Follow-up |
| 0.53 | 0.43 | 0.75 | |
| U-Net | Follow-up |
| 0.64 | 0.63 | 0.78 |
|
|
|
|
|
|
|
| U-Net | Baseline |
| 0.84 | 0.67 | 0.92 |
| Follow-up |
| 0.70 | 0.64 | 0.83 | |
| U-Net | Follow-up |
| 0.77 | 0.75 | 0.88 |
Figure 2Segmentation examples on two acquisitions from the same patient. (a) PET BL, (b) GT BL, (c) U-Net, (d) PET FU, (e) GT FU, (f) U-Net. Zoom on the abdomen: kidneys, spine and bladder are visible. Due to the patient’s response to treatment, lesions on PET FU have a lower contrast than on PET BL and are less visible. BL = Baseline, GT = Ground Truth, FU = Follow-Up.
Figure 3Segmentation examples on two acquisitions from 3 patients from the test dataset. (a–c): Maximum intensity projections of PET images. (d–f): Ground truth segmentation overlaid on the maximum intensity projections of PET images. (g–i): Automatic segmentation overlaid on the maximum intensity projections of PET images. U-Net was used on the baseline acquisition and U-Net on the follow-up acquisitions. For each pair of images: on the left the baseline acquisition and on the right the follow-up acquisition. DSC = dice score between the ground truth and the automatic segmentation. Blue arrows outline discrepancies between manual and automatic segmentations.
Figure 4Graphical representation of each imaging biomarker with x axis biomarkers measured on ground truth segmentations and y axis biomarkers measured on automatic segmentations. The line represents perfect concordance. The concordance and the correlation are evaluated with the Lin’s concordance correlation coefficient (Lin’s CCC) and the Spearman’s rank correlation coefficient (Spearman cor) respectively.
Figure 5Receiver Operating Characteristic (ROC) curve, responders (CR or PR) vs. non-responders (SD or PD).
Biomarker for response assessment according to ROC analysis. Areas Under the Curve (AUCs) were computed on the ROC curve shown in Figure 5. The optimal cutoff value to differentiate between responder and non-responder patients was determined using the Youden’s J statistic method. Sensibility and specificity were computed for this optimal cutoff. P-values are determined using a Mann-Whitney U test for statistical difference between responder and non-responder groups defined by the optimal cutoff.
| Biomarkers | AUC | Optimal Cutoff Value | Sensitivity | Specificity | |
|---|---|---|---|---|---|
| 0.89 | −32% | 87% | 87% | ≤0.001 * | |
| 0.80 | −43% | 73% | 81% | ≤0.001 * | |
| 0.72 | −8% | 69% | 69% | ≤0.001 * | |
| 0.54 | 0% | 53% | 51% | ≤0.001 * |
* Statistically significant.
Figure 6Imaging biomarkers assessment for one patient with partial response. (a) Maximum intensity projection of three PET acquisitions with their biomarkers measured using the automatic segmentation. (b) Graphical representation of each biomarker evaluation across 3 acquisitions (in percentage of the biomarkers from the baseline). BL for Baseline and FU for Follow-up.