| Literature DB >> 31576001 |
Shazia Akbar1,2,3, Mohammad Peikari4, Sherine Salama5, Azadeh Yazdan Panah5, Sharon Nofech-Mozes5, Anne L Martel6,4,7.
Abstract
The residual cancer burden index is an important quantitative measure used for assessing treatment response following neoadjuvant therapy for breast cancer. It has shown to be predictive of overall survival and is composed of two key metrics: qualitative assessment of lymph nodes and the percentage of invasive or in situ tumour cellularity (TC) in the tumour bed (TB). Currently, TC is assessed through eye-balling of routine histopathology slides estimating the proportion of tumour cells within the TB. With the advances in production of digitized slides and increasing availability of slide scanners in pathology laboratories, there is potential to measure TC using automated algorithms with greater precision and accuracy. We describe two methods for automated TC scoring: 1) a traditional approach to image analysis development whereby we mimic the pathologists' workflow, and 2) a recent development in artificial intelligence in which features are learned automatically in deep neural networks using image data alone. We show strong agreements between automated and manual analysis of digital slides. Agreements between our trained deep neural networks and experts in this study (0.82) approach the inter-rater agreements between pathologists (0.89). We also reveal properties that are captured when we apply deep neural network to whole slide images, and discuss the potential of using such visualisations to improve upon TC assessment in the future.Entities:
Mesh:
Year: 2019 PMID: 31576001 PMCID: PMC6773948 DOI: 10.1038/s41598-019-50568-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1TB outlined in black in a digital slide scanned at 20X magnification (displayed at lower resolution). Regions of interest are shown in a higher magnification on the right alongside TC scores provided by an expert pathologist.
Figure 2Overview of two methods for generating automated TC score. Hand-engineered feature approach is shown above and a cascade approach using two deep convolutional neural networks below.
Two-way intra-class correlation (ICC) coefficients between two pathologists, and two automated methods for predicting TC scores.
| ICC Coefficient (95% CI) | ||||
|---|---|---|---|---|
| Pathologist A | Hand-engineered[ | DCNN[ | Combined | |
| Pathologist A | — | 0.74 [0.70, 0.77] | 0.76 [0.74, 0.79] | |
| Pathologist B | 0.89 [0.70, 0.95] | 0.75 [0.71, 0.79] | 0.79 [0.76, 0.81] | |
Upper and lower bounds are given in square brackets.
Figure 3TC scores between 0% and 100% predicted by a hand-engineered approach (top) and deep neural networks (bottom) against scores provided by both expert pathologists (Pathologist A, Pathologist B).
Figure 4Boxplot of distribution of scores within low (0-30%), medium (30-70%) and high (>70%) ranges of TC.
Figure 5Subset of results from TC test dataset for healthy tissue, and low/medium/high TC categories (top to bottom). Scores are given for both automated systems (H = hand-engineered features, D = deep convolutional neural networks) and Pathologist A (P).
Figure 6TC scores produced by a trained deep neural network overlaid on whole slide images. Scores are provided on a patch-by-patch level, where blue denotes healthy (0% TC) and red denotes 100% TC. Some close-up results of cellularity scores are provided to the right of each whole slide image.
Clinical pathology characteristics of patients in reported study.
| Criteria | Total | |
|---|---|---|
| Age at Diagnosis | 30–39 | 12 |
| 40–49 | 17 | |
| 50–59 | 11 | |
| 60–72 | 13 | |
| Histology | Invasive ductal carcinoma (IDC) | 50 |
| Invasive lobular carcinoma (ILC) | 4 | |
| Invasive mammary carcinoma (IMC) | 1 | |
| Grade | 1 | 8 |
| 2 | 29 | |
| 3 | 15 | |
| ER | Positive | 37 |
| Negative | 16 | |
| PR | Positive | 30 |
| Negative | 22 | |
| HER2 | Positive | 11 |
| Negative | 42 | |
Note that multiple WSIs were prepared per patient, and therefore one patient may share multiple characteristics.
Number of patches in training and test set which fall into each TC score range.
| TC Score Range | ||||
|---|---|---|---|---|
| 0% | 1–30% | 31–70% | >70% | |
| Train (Pathologist A) | 701 | 840 | 665 | 373 |
| Test (Pathologist A) | 242 | 225 | 301 | 353 |
| Test (Pathologist B) | 237 | 312 | 375 | 197 |