| Literature DB >> 36010844 |
Simona Rabinovici-Cohen1, Xosé M Fernández2, Beatriz Grandal Rejo2, Efrat Hexter1, Oliver Hijano Cubelos2, Juha Pajula3, Harri Pölönen3, Fabien Reyal2, Michal Rosen-Zvi1,4.
Abstract
In current clinical practice, it is difficult to predict whether a patient receiving neoadjuvant chemotherapy (NAC) for breast cancer is likely to encounter recurrence after treatment and have the cancer recur locally in the breast or in other areas of the body. We explore the use of clinical history, immunohistochemical markers, and multiparametric magnetic resonance imaging (DCE, ADC, Dixon) to predict the risk of post-treatment recurrence within five years. We performed a retrospective study on a cohort of 1738 patients from Institut Curie and analyzed the data using classical machine learning, image processing, and deep learning. Our results demonstrate the ability to predict recurrence prior to NAC treatment initiation using each modality alone, and the possible improvement achieved by combining the modalities. When evaluated on holdout data, the multimodal model achieved an AUC of 0.75 (CI: 0.70, 0.80) and 0.57 specificity at 0.90 sensitivity. We then stratified the data based on known prognostic biomarkers. We found that our models can provide accurate recurrence predictions (AUC > 0.89) for specific groups of women under 50 years old with poor prognoses. A version of our method won second place at the BMMR2 Challenge, with a very small margin from being first, and was a standout from the other challenge entries.Entities:
Keywords: breast cancer recurrence; deep learning; image processing; machine learning; magnetic resonance imaging (MRI); neoadjuvant chemotherapy; radiomics
Year: 2022 PMID: 36010844 PMCID: PMC9405765 DOI: 10.3390/cancers14163848
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Study setting. Multiparametric MRI and clinical data acquired prior to NAC treatment start are analyzed to predict breast cancer recurrence within five years after diagnosis. The NAC treatment includes six months of chemotherapy with optional targeted treatment followed by surgery. After surgery, there is follow-up and sometimes additional treatment such as radiotherapy.
Cohorts and the number of patients in the dataset.
| Total Number of Patients | Uncensored Patients | |
|---|---|---|
| Clinical cohort | 1627 | 928 |
| MRI+Clinical cohort | 463 | 317 |
| Holdout cohort | 100 | 62 |
Figure 2Multiparametric MRI model architecture. (Top) Subtraction component in which seven adjacent MRI slices (three pre-significant, significant, three post-significant) form the input to seven 2D-CNNs that have the same weights. The features are aggregated into a 3D-CNN followed by an average global pooling layer. (Bottom) Dixon-ADC component in which three 3D MRI volumes form the input to volumetric 3D image processing that generates volumetric features. The features from the two components are concatenated and transformed into the output score.
Evaluation of the models on cross-validation and holdout test. Best results are in bold.
| Model | Cross-Validation | Holdout Test | |
|---|---|---|---|
| 1 | Subtraction-only MRI | 0.67 [0.61, 0.72] | 0.64 [0.60, 0.70] |
| 2 | Multiparametric MRI (mpMRI) | 0.70 [0.65, 0.75] | 0.65 [0.59, 0.70] |
| 3 | Clinical | 0.71 [0.66, 0.76] | 0.71 [0.66, 0.76] |
| 4 | Ensemble subtraction-only MRI and clinical | 0.73 [0.68, 0.78] | 0.73 [0.67, 0.78] |
| 5 | Ensemble multiparametric MRI and clinical (final model) |
|
|
Figure 3Cross-validation and holdout ROC curves for the mpMRI model, clinical model, and final ensemble model. (a) Cross-validation evaluation with ensemble model AUC of 0.75 (95% CI: 0.70, 0.80). (b) Holdout evaluation with ensemble model AUC of 0.75 (95% CI: 0.70, 0.80).
Cross-validation and holdout test of the per-modality models and the ensemble model at sensitivity operation points 0.87, 0.90, and 0.93.
| Cross-Validation | |||
|---|---|---|---|
| Metric | mpMRI | Clinical | Ensemble |
| Specificity | 0.31, 0.18, 0.15 | 0.26, 0.16, 0.14 | 0.39, 0.35, 0.24 |
| F1-score | 0.42, 0.39, 0.39 | 0.40, 0.39, 0.39 | 0.45, 0.45, 0.42 |
| Balanced accuracy | 0.59, 0.54, 0.54 | 0.56, 0.53, 0.53 | 0.63, 0.63, 0.59 |
| PPV | 0.28, 0.25, 0.25 | 0.26, 0.25, 0.25 | 0.30, 0.30, 0.27 |
| NPV | 0.88, 0.86, 0.88 | 0.86, 0.85, 0.87 | 0.91, 0.92, 0.92 |
|
| |||
|
|
|
|
|
| Specificity | 0.26, 0.26, 0.20 | 0.46, 0.46, 0.17 | 0.57, 0.57, 0.48 |
| F1-score | 0.44, 0.44, 0.44 | 0.51, 0.51, 0.44 | 0.56, 0.56, 0.55 |
| Balanced accuracy | 0.57, 0.57, 0.57 | 0.67, 0.67, 0.56 | 0.72, 0.72, 0.71 |
| PPV | 0.29, 0.29, 0.29 | 0.36, 0.36, 0.28 | 0.41, 0.41, 0.39 |
| NPV | 0.86, 0.86, 0.90 | 0.91, 0.91, 0.89 | 0.93, 0.93, 0.96 |
Figure 4Clinical feature contributions. A summary plot of the SHAP values of top features in the clinical model. Each point represents a single patient. The x-axis indicates the effect (either positive or negative) of the feature on the predicted score for the patient. The point’s color represents the value of the features (red = high value, blue = low value, purple = close to the average value).
Subgroup analysis for both cohorts, the MRI+Clinical cohort and the Clinical cohort.
| MRI+Clinical Cohort | |||||||
|---|---|---|---|---|---|---|---|
| # Positives | Age | Cancer | Histological | Tumor | Ki67 | AUC | |
| 1 | 2 pos, 9 neg | ≤50 | Luminal | NST | III | Above 15% | 0.94 [0.75, 1] |
| 2 | 3 pos, 20 neg | ≤50 | No value | NST | III | Above 15% | 0.92 [0.70, 1] |
|
| |||||||
|
|
|
|
|
|
|
| |
| 3 | 2 pos, 10 neg | ≥60 | No value | NST | III | Above 15% | 0.90 [0.67, 1] |
| 4 | 10 pos, 16 neg | ≤50 | Luminal | NST | III | Above 15% | 0.89 [0.74, 1] |
Key feature characteristics in the dataset.
| Feature | Total Patients with Value | Missing % |
|---|---|---|
| Age at diagnosis | 1504 | 13.46% |
| EE grade | 1467 | 15.59% |
| Histological type | 1437 | 17.32% |
| Progesterone status | 1419 | 18.35% |
| Estrogen status | 1416 | 18.53% |
| Weight | 1215 | 30.09% |
| HER2 positive | 1169 | 32.74% |
| Mitotic index | 1080 | 37.86% |
| Ki67 percentage | 1051 | 39.53% |
| Height | 893 | 48.62% |
| Cancer subtype | 797 | 53.82% |