| Literature DB >> 31119567 |
Riku Turkki1,2, Dmitrii Byckhov3, Mikael Lundin3, Jorma Isola4, Stig Nordling5, Panu E Kovanen6, Clare Verrill7,8, Karl von Smitten9, Heikki Joensuu10, Johan Lundin3,11, Nina Linder3,12.
Abstract
PURPOSE: Recent advances in machine learning have enabled better understanding of large and complex visual data. Here, we aim to investigate patient outcome prediction with a machine learning method using only an image of tumour sample as an input.Entities:
Keywords: Breast cancer; Deep learning; Machine learning; Outcome prediction
Mesh:
Substances:
Year: 2019 PMID: 31119567 PMCID: PMC6647903 DOI: 10.1007/s10549-019-05281-1
Source DB: PubMed Journal: Breast Cancer Res Treat ISSN: 0167-6806 Impact factor: 4.872
Fig. 1Workflow for training and testing the digital risk score (DRS) classification. The computational pipeline consists of three sequential steps: (i) feature extraction with a deep convolutional neural network (CNN), (ii) feature pooling with improved Fisher vector encoding (IFV) and principal component analysis (PCA) and (iii) classification with support vector machine (SVM). Training set samples are used in supervision and a separate test set-up used in validation
Patient characteristics
| Variables | Whole data set ( | Test set ( | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Training set ( | Training set ( | Low DRS ( | High DRS ( | |||||||
| % |
| % |
| % |
| % |
| |||
| Number of positive lymph nodes | ||||||||||
| Mean | 1.4 | 1.2 | 0.407 | 0.9 | 1.6 |
| ||||
| 0 | 58 | 504 | 59 | 253 | 0.323 | 63 | 150 | 53 | 103 | 0.057 |
| 1–3 | 24 | 206 | 23 | 99 | 23 | 54 | 23 | 45 | ||
| 4–9 | 8 | 73 | 9 | 38 | 6 | 15 | 12 | 23 | ||
| > 10 | 3 | 30 | 2 | 7 | 1 | 2 | 3 | 5 | ||
| Unknown | 6 | 55 | 8 | 34 | 7 | 16 | 9 | 18 | ||
| Tumour size (per mm) | ||||||||||
| Mean | 23.7 | 23.2 | 0.817 | 2.15 | 25.3 |
| ||||
| Unknown | 3 | 28 | 5 | 22 | 5 | 13 | 5 | 9 | ||
| Histological grade | ||||||||||
| I | 16 | 143 | 19 | 83 | 0.086 | 23 | 54 | 22 | 43 |
|
| II | 34 | 296 | 36 | 154 | 32 | 75 | 41 | 79 | ||
| III | 23 | 197 | 18 | 76 | 14 | 33 | 22 | 43 | ||
| Unknown | 27 | 232 | 27 | 118 | 32 | 75 | 22 | 43 | ||
| Histological type | ||||||||||
| Ductal | 76 | 662 | 77 | 333 | 0.742 | 74 | 175 | 81 | 158 | 0.079 |
| Lobular/special | 24 | 206 | 23 | 98 | 26 | 62 | 19 | 36 | ||
| Age | ||||||||||
| ≤ 39 | 7 | 63 | 7 | 30 | 0.353 | 9 | 21 | 5 | 9 | 0.140 |
| 40–49 | 21 | 186 | 24 | 103 | 27 | 64 | 20 | 39 | ||
| 50–59 | 27 | 234 | 22 | 94 | 21 | 49 | 23 | 45 | ||
| 60–69 | 20 | 172 | 21 | 91 | 20 | 47 | 23 | 44 | ||
| ≥ 70 | 25 | 213 | 26 | 113 | 24 | 56 | 29 | 57 | ||
| ER | ||||||||||
| Negative | 29 | 248 | 27 | 116 | 0.572 | 25 | 60 | 29 | 56 | 0.443 |
| Positive | 62 | 538 | 64 | 274 | 65 | 155 | 61 | 119 | ||
| Unknown | 9 | 82 | 10 | 41 | 9 | 22 | 10 | 19 | ||
| PR | ||||||||||
| Negative | 42 | 362 | 41 | 177 | 0.803 | 36 | 86 | 47 | 91 |
|
| Positive | 49 | 423 | 50 | 215 | 56 | 132 | 43 | 83 | ||
| Unknown | 10 | 83 | 9 | 39 | 8 | 19 | 10 | 20 | ||
| HER2 | ||||||||||
| Negative | 72 | 623 | 74 | 321 | 0.713 | 76 | 181 | 72 | 140 | 0.136 |
| Positive | 17 | 146 | 16 | 70 | 14 | 32 | 20 | 38 | ||
| Unknown | 11 | 99 | 9 | 40 | 10 | 24 | 8 | 16 | ||
Left association of clinicopathological variables in the training and test sets. Right: association of clinicopathological variables between patients in low and high digital risk score (DRS) groups
p-values < 0.05 are shown in bold
Fig. 2Disease-specific survival (DSS) and overall survival (OS) according the classification into low and high digital risk score (DRS) groups
Fig. 3Disease-specific survival (DSS) according the classification into low and high digital risk score (DRS) groups
Fig. 4Disease-specific survival (DSS) according the classification into low and high digital risk group (DRS) groups in patients with different tumour size and nodal status
Cox uni- and multivariate survival analysis
| Variables | Univariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|---|
| HR | CI 95% | HR | CI 95% | |||
| DRS | ||||||
| Low | Ref. | Ref. | ||||
| High | 2.10 | (1.33–3.32) |
| 2.04 | (1.20–3.44) |
|
| Number of positive lymph nodes | ||||||
| 0 | Ref. | Ref. | ||||
| 1–3 | 1.53 | (0.89—2.63) | 0.123 | 2.12 | (0.83–1.47) | 0.116 |
| 4–9 | 2.93 | (1.61–5.33) | 2.15 | (0.75–6.19) | 0.154 | |
| > 10 | 7.43 | (2.90–19.02) | 4.75 | (1.17–19.30) |
| |
| Tumour size | ||||||
| Per mm | 1.04 | (1.03–1.06) |
| 1.04 | (1.02–1.06) |
|
| Histological grade | ||||||
| I | Ref. | Ref. | ||||
| II or III | 3.14 | (1.61–6.09) | 1.57 | (0.76–3.20) | 0.220 | |
| Histological type | ||||||
| Ductal | Ref | Ref. | ||||
| Lobular/special | 0.73 | (0.40–1.33) | 0.306 | 0.90 | (0.41–1.95) | 0.782 |
| Age | ||||||
| ≤ 39 | Ref. | Ref. | ||||
| 40–49 | 0.78 | (0.33–1.88) | 0.585 | 0.43 | (0.17–1.12) | 0.084 |
| 50–59 | 0.69 | (0.28–1.70) | 0.425 | 0.48 | (0.19–1.28) | 0.144 |
| 60–69 | 1.00 | (0.42–2.36) | 0.996 | 0.62 | (0.25–1.57) | 0.319 |
| ≥ 70 | 1.58 | (0.66–3.79) | 0.306 | 1.35 | (0.49–3.72) | 0.564 |
| ER | ||||||
| Negative | Ref. | |||||
| Positive | 0.69 | (0.44–1.09) | 0.15 | |||
| PR | ||||||
| Negative | Ref. | Ref. | ||||
| Positive | 0.34 | (0.21–0.55) |
| 0.42 | (0.25–0.71) |
|
| HER2 | ||||||
| Negative | Ref. | |||||
| Positive | 1.51 | (0.90–2.53) | 0.119 | 1.07 | (0.57–1.98) | 0.831 |
| Systematic therapy | ||||||
| Not given | Ref. | Ref. | ||||
| Given | 1.90 | (1.21–2.98) |
| 1.07 | (0.22–1.47) | 0.245 |
| Local therapy | ||||||
| Not given | Ref. | Ref | ||||
| Given | 1.23 | (0.75–2.03) | 0.404 | 1.22 | (0.61–2.46) | 0.571 |
Histological grade and type were assessed from whole tumour sections, while ER, PR and HER2 were assessed from TMAs. In order to meet the Cox proportionality assumption, ER was left out from the multivariate analysis and grade II and III were combined
ER estrogen receptor status, PR progesterone receptor status, HER2 human epidermal growth factor receptor 2 gene amplification
p-values < 0.05 are shown in bold