| Literature DB >> 35136676 |
Dmitrii Bychkov1,2, Heikki Joensuu2,3, Stig Nordling4, Aleksei Tiulpin5,6,7, Hakan Kücükel1,2, Mikael Lundin1, Harri Sihto4, Jorma Isola8, Tiina Lehtimäki9, Pirkko-Liisa Kellokumpu-Lehtinen10, Karl von Smitten11, Johan Lundin1,2,12, Nina Linder1,2,13.
Abstract
BACKGROUND: Prediction of clinical outcomes for individual cancer patients is an important step in the disease diagnosis and subsequently guides the treatment and patient counseling. In this work, we develop and evaluate a joint outcome and biomarker supervised (estrogen receptor expression and ERBB2 expression and gene amplification) multitask deep learning model for prediction of outcome in breast cancer patients in two nation-wide multicenter studies in Finland (the FinProg and FinHer studies). Our approach combines deep learning with expert knowledge to provide more accurate, robust, and integrated prediction of breast cancer outcomes.Entities:
Keywords: Breast cancer; ERBB2 gene; convolutional neural networks; digital pathology; estrogen receptor; multitask deep learning; outcome prediction
Year: 2022 PMID: 35136676 PMCID: PMC8794033 DOI: 10.4103/jpi.jpi_29_21
Source DB: PubMed Journal: J Pathol Inform
Biological characteristics of breast cancers and patient survival in the FinHer and FinProg series
| FinProg Patient Series (original and validation) | FinHer Patient Series | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| |||||||||||
| Training and tuning ( | Internal test set ( | Included patients ( | Total ( | External test set ( | Total ( | |||||||
|
|
|
|
|
|
| |||||||
| Variables: |
| % |
| % |
| % |
| % |
| % |
| % |
| Histological grade | 98 | 14.1 | 68 | 19.2 | 166 | 15.9 | 226 | 17 | 95 | 13.3 | 150 | 14.9 |
| 2 | 244 | 35.2 | 127 | 35.9 | 371 | 35.4 | 450 | 35 | 276 | 38.8 | 397 | 39.3 |
| 3 | 168 | 24.2 | 68 | 19.2 | 236 | 22.5 | 273 | 21 | 303 | 42.6 | 414 | 41.0 |
| NA | 183 | 26.4 | 91 | 25.7 | 274 | 26.2 | 350 | 27 | 38 | 5.3 | 48 | 4.8 |
| Negative | 557 | 80.4 | 288 | 81.4 | 845 | 80.7 | 944 | 73 | 548 | 77.0 | 776 | 76.9 |
| Positive | 136 | 19.6 | 66 | 18.6 | 202 | 19.3 | 216 | 17 | 164 | 23.0 | 233 | 23.1 |
| NA | 139 | 10 | ||||||||||
| ER | ||||||||||||
| Positive | 472 | 68.1 | 243 | 68.6 | 715 | 68.3 | 812 | 63 | 501 | 70.4 | 729 | 72.2 |
| Negative | 221 | 31.9 | 111 | 31.4 | 332 | 31.7 | 364 | 28 | 211 | 29.6 | 280 | 27.8 |
| NA | 123 | 9 | ||||||||||
| Survival* | ||||||||||||
| Censored | 483 | 69.7 | 254 | 71.8 | 737 | 70.4 | 979 | 75 | 593 | 83.3 | 846 | 83.8 |
| Uncensored | 210 | 30.3 | 100 | 28.2 | 310 | 29.6 | 205 | 16 | 119 | 16.7 | 163 | 16.2 |
* FinProg – Breast cancer-specific survival; FinHer – Distant disease-free survival (DDFS); CISH – chromogenic in situ hybridization; NA – not available.
Tissue microarray histological scoring
| Feature | Category | Score |
|---|---|---|
| Mitoses | 0 per HPF | 1 |
| 1 per HPF | 2 | |
| >1 per HPF | 3 | |
| Nuclear pleomorphism | Minimal | 1 |
| Moderate | 2 | |
| Marked | 3 | |
| Tubules | >75% | 1 |
| 10%-75% | 2 | |
| <10% | 3 |
*HPF: High-power field
Figure 1Deep convolutional neural networks were trained on images of hematoxylin and eosin-stained tumor tissue microarray spots from a nationwide breast cancer series (FinProg) to predict risk scores of breast cancer-specific survival. The training was performed using a transfer learning approach with ImageNet pretrained weights. The multitask approach combined outcome-supervised and biomarker-supervised feature learning. At the test phase, the networks generate a risk score for each patient in the test sets which consisted of FinProg test set patients and patients from the FinHer series. Additionally, conventional tissue entities in the tissue microarray spot images in the FinProg test set were assessed by a pathologist, i.e., mitoses, nuclear pleomorphism, tubules, tissue necrosis and tumor-infiltrating lymphocytes. Finally, a survival analysis on expert-derived and deep learning-based features was performed using Cox Proportional Hazards method.
Univariate Cox proportional hazards analysis of tissue characteristics assessed on tissue microarrays within the FinProg test set
|
| HR | 95% CI |
| c-index | |
|---|---|---|---|---|---|
| Mitotic count (TMA) | |||||
| Low | 256 | Reference | 0.57 | ||
| Moderate | 43 | 1.50 | 0.88-2.70 | 0.132 | |
| High | 31 | 2.00 | 1.10-3.60 | ≤ 0.05* | |
| Pleomorphism (TMA) | |||||
| Minimal | 45 | Reference | 0.59 | ||
| Moderate | 193 | 1.90 | 0.86-4.20 | 0.11 | |
| Marked | 92 | 3.00 | 1.34-6.70 | ≤ 0.01** | |
| Tubulus formation (TMA) | |||||
| High | 49 | Reference | 0.54 | ||
| Low | 281 | 2.20 | 1.10-4.60 | ≤ 0.05* | |
| Histological grade (TMA)* | |||||
| I | 74 | Reference | 0.60 | ||
| II | 194 | 2.1 | 1.10-3.80 | ≤ 0.05* | |
| III | 62 | 3.0 | 1.50-6.10 | ≤ 0.01** | |
| Histological grade (WS) | |||||
| I | 64 | Reference | 0.64 | ||
| II | 119 | 2.70 | 1.30-5.30 | ≤ 0.01** | |
| III | 61 | 4.00 | 2.00-8.30 | ≤ 0.001*** | |
| Tumor necrosis (TMA) | |||||
| Absent | 320 | Reference | 0.54 | ||
| Present | 11 | 5.00 | 2.40-10.00 | <0.001*** | |
| Tumor-infiltrating lymphocytes (TMA) | |||||
| Low | 289 | Reference | 0.54 | ||
| High | 50 | 1.60 | 0.94-2.60 | 0.083 | |
| Visual risk (TMA) | |||||
| Low risk | 213 | Reference | 0.58 | ||
| High risk | 114 | 1.80 | 1.20-2.70 | ≤ 0.01** | |
| Axillary lymph node status | |||||
| Negative | 200 | Reference | 0.62 | ||
| Positive | 128 | 2.40 | 1.60-3.60 | ≤ 0.001*** | |
| Tumor size (cm) | 336 | 1.50 | 1.30-1.70 | ≤ 0.001*** | 0.71 |
| “Solo” CNN (TMA) | |||||
| Low risk | 177 | Reference | 0.57 | ||
| High risk | 177 | 1.70 | 1.10-2.60 | ≤ 0.01** | |
| Multitask CNN (TMA) | |||||
| Low risk | 177 | Reference | 0.59 | ||
| High risk | 177 | 2.00 | 1.30-3.00 | ≤ 0.001*** |
*Supplementary Table 1. Association of the variables with breast cancer-specific survival is reported as effect size (HR) and a c-index. Prognostic performance of the “solo” and multitask models is compared to tissue characteristics assessed by a pathologist, as well as to the tumor size and lymph node status. HR: Hazard ratio, c-index: Concordance index, CI: Confidence interval, TMA: Tissue microarrays, WS: Whole-slides, CNN: Convolutional neural networks
Figure 2Multivariate Cox Proportional Hazards analysis of deep learning models together with prognostic factors related to the extent of disease in breast cancer, i.e., spread of the cancer to axillary lymph nodes and size of the primary tumor in the FinProg test set. The results indicate that multitask training (b) was an independent predictor of survival as compared to outcome supervised training only (a)
Multivariate Cox proportional hazards regression of deep learning-based outcome predictions adjusted for tumor histological grade on the independent FinHer (n=674) patient series
| “Solo” CNN | Multitask CNN | ||||||
|---|---|---|---|---|---|---|---|
|
|
| ||||||
|
| HR | 95% CI |
| HR | 95% CI |
| |
| CNN risk score | |||||||
| Low risk | 337 | Reference | Reference | ||||
| High risk | 337 | 1.70 | 1.10-2.50 | 0.009 | 1.50 | 1.00-2.30 | 0.033 |
| Histological grade (WS) | |||||||
| Low (I and II) | 371 | Reference | Reference | ||||
| High (III) | 303 | 1.60 | 1.10-2.30 | 0.022 | 1.50 | 1.00-2.20 | 0.037 |
| c-index, Log-rank | 0.60, <0.001 | 0.59, 0.001 | |||||
WS: Whole-slides, CNN: Convolutional neural networks, HR: Hazard ratio, c-index: Concordance index, CI: Confidence interval