| Literature DB >> 35740648 |
Nikita Shvetsov1, Morten Grønnesby2, Edvard Pedersen1, Kajsa Møllersen3, Lill-Tove Rasmussen Busund2,4, Ruth Schwienbacher2,4, Lars Ailo Bongo1, Thomas Karsten Kilvaer5,6.
Abstract
Increased levels of tumor-infiltrating lymphocytes (TILs) indicate favorable outcomes in many types of cancer. The manual quantification of immune cells is inaccurate and time-consuming for pathologists. Our aim is to leverage a computational solution to automatically quantify TILs in standard diagnostic hematoxylin and eosin-stained sections (H&E slides) from lung cancer patients. Our approach is to transfer an open-source machine learning method for the segmentation and classification of nuclei in H&E slides trained on public data to TIL quantification without manual labeling of the data. Our results show that the resulting TIL quantification correlates to the patient prognosis and compares favorably to the current state-of-the-art method for immune cell detection in non-small cell lung cancer (current standard CD8 cells in DAB-stained TMAs HR 0.34, 95% CI 0.17-0.68 vs. TILs in HE WSIs: HoVer-Net PanNuke Aug Model HR 0.30, 95% CI 0.15-0.60 and HoVer-Net MoNuSAC Aug model HR 0.27, 95% CI 0.14-0.53). Our approach bridges the gap between machine learning research, translational clinical research and clinical implementation. However, further validation is warranted before implementation in a clinical setting.Entities:
Keywords: NSCLC; deep learning; digital pathology; tumor-infiltrating lymphocytes
Year: 2022 PMID: 35740648 PMCID: PMC9221016 DOI: 10.3390/cancers14122974
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Characteristics of the datasets providing annotations and/or classifications publicly available for the training and validation of models for instant cell segmentation and classification.
| Dataset | CoNSeP | PanNuke | MoNuSAC | CRCHisto * | TNBC | MoNuSeg | CPM-15 | CPM-17 |
|---|---|---|---|---|---|---|---|---|
| Author | Graham | Gamper | Verma | Sirinukunwattana | Naylor | Kumar | Vu | Vu |
| Year | 2019 | 2020 | 2020 | 2019 | 2017 | 2017 | 2019 | 2019 |
| Origin | UHCW | UHCW/TCGA | TCGA | UHCW | Curie Institute | TCGA | TCGA | TCGA |
| Tissue types | CRC | Various (19) | Various (4) | CRC | TNBC | Various (8) | Various (2) | Various (4) |
| Unique patients | ||||||||
| Number of patches | 41 | 7901 | 294 | 100 | 50 | 30 | 15 | 64 |
| Training | 27 § | 2722 | 209 | NA | NA | 16 # | NA | 32 |
| Validation | § | 2656 | NA | NA | NA | # | NA | NA |
| Testing | 14 | 2523 | 85 | NA | NA | 14 # | NA | 32 |
| Patch size | 1000 × 1000 | 256 × 256 | Various | 500 × 500 | 512 × 512 | 1024 × 1024 | 400 × 400 up to 600 × 1000 | 500 × 500 up to 600 × 600 |
| Scanner(s) | Omnyx VL120 | Various | Various | Omnyx VL120 | Philips Ultra-Fast Scanner 1.6RA | Various | Various | Various |
| Magnification | 40× | 40× | 40× | 20× | 40× | 40× | 40× and 20× | 40× and 20× |
| Resolution | 0.275 µm/px | Various | NA | 0.55 µm/px | 0.245 µm/px | NA | NA | NA |
| Annotation | NC | NC | NC | CoN | NC | NC | NC | NC |
| Cells | 24,319 | 205,343 | 46,000 | 29,756 | 4022 | 21,623 | 2905 | 7570 |
| Labeled cells | 24,319 | 205,343 | 46,000 | 22,444 | NA | NA | NA | NA |
| Cell types | 7 | 5 | 4 (5) | 4 | NA | NA | NA | NA |
Abbreviations: CoNSeP, CRCHisto, colorectal histology; PanNuke, MoNuSAC, TNBC, triple-negative breast cancer; MoNuSeg, CPM, NC, nucleus contour; UHCW, University Hospital; TCGA, The Cancer Genome Atlas; CoN, center of nucleus; NG, not given. * CRCHisto only provides center of the nucleus annotations. § Graham et al. 2019 suggested splitting the 27 training images into a training and validation set. However, the paper does not provide information on the split conducted in their published paper. # Kumar et al. 2017 suggested splitting into a training/validation set of 16 and 14 images. They also provided the split conducted in their published paper.
Figure 1For each dataset, the original patches are divided into nonoverlapping (CoNSeP) and overlapping (PanNuke/MoNuSAC) mini-patches. A sub-patch is generated by adding padding using image information from adjacent tissue in the original image patch and/or by mirroring in edge cases. The central part of the sub-patch is cropped and used in the training procedure. The output of the network (segmented/classified cells) is equal to the mini-patch. Each model is subsequently trained for 100 epochs.
Figure 2Overview of cloud deployment of the web application. WSIs are uploaded into Azure Blob storage and mapped as a file system in the web application. The web application is built by an Azure Pipeline from a docker file contained in the histology git repository. We use the Kubernetes platform in Azure to train and validate our implementation of HoVer-Net. The exported models are implemented as inference containers in the histology application service and served using Tensorflow Serving.
Figure 3Overview of the slide viewer front end, back end and inference container. Selected regions of the front end are cropped out at the back end and transferred to the inference container. The inference container returns a contour matrix that is converted into a contour image and overlaid on top of the front end WSI.
A summary of the performance of four deep learning models trained using the CoNSeP (A I and II and B I and II) and the PanNuke (A III and IV and B III and IV) datasets using the original training pipeline as published by Graham et al. without (A I and III and B I and III) and with (A II and IV and B II and IV) enhanced augmentation. The best results for each parameter (1) within each dataset are in bold, and (2) models trained on another dataset are in italics. Separate comparisons of the segmentation and classification steps are provided in Table S3.
| Test Data | CoNSeP | PanNuke | MoNuSAC | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | CoNSeP | PanNuke | MoNuSAC | CoNSeP | PanNuke | MoNuSAC | CoNSeP | PanNuke | MoNuSAC | |||||||||
| Augmentation | HoVer | Aug | HoVer | Aug | HoVer | Aug | HoVer | Aug | HoVer | Aug | HoVer | Aug | HoVer | Aug | HoVer | Aug | HoVer | Aug |
| Numbering | AI | AII | AIII | AIV | AV | AVI | BI | BII | BIII | BIV | BV | BVI | CI | CII | CIII | CIV | CV | CVI |
| Integrated segmentation and classification | ||||||||||||||||||
|
| 0.78 |
| 0.71 | 0.71 |
| 0.55 | 0.73 |
| 0.71 | 0.72 | 0.47 | 0.46 | 0.59 | 0.70 | 0.74 |
| 0.73 | 0.73 |
|
| 0.81 | 0.77 | 0.60 | 0.70 |
| 0.79 |
| 0.52 | 0.61 |
| 0.48 | 0.45 | 0.86 | 0.79 | 0.78 |
| 0.77 | 0.77 |
|
| 0.58 | 0.66 | 0.79 | 0.74 |
| 0.60 | 0.28 | 0.36 |
| 0.61 | 0.53 |
| 0.26 | 0.49 | 0.78 |
| 0.86 |
|
|
| 0.68 | 0.71 | 0.68 | 0.72 |
| 0.68 | 0.38 | 0.42 | 0.63 |
|
|
| 0.40 | 0.60 | 0.78 |
| 0.81 | 0.82 |
|
| 0.57 |
|
| 0.51 | 0.38 | 0.38 | 0.42 |
| 0.57 |
|
|
| 0.53 |
|
| 0.62 | 0.68 |
|
|
| 0.59 | 0.61 | 0.61 | 0.62 | 0.68 |
| 0.53 | 0.50 | 0.64 | 0.63 | 0.70 |
|
| 0.62 | 0.58 | 0.57 | 0.71 |
|
|
|
| 0.73 |
| 0.66 | 0.45 | 0.44 | 0.26 | 0.34 | 0.68 |
|
| 0.50 | 0.10 | 0.42 | 0.92 |
| 0.84 | 0.85 |
|
|
| 0.66 |
|
| 0.54 | 0.55 | 0.35 | 0.40 | 0.66 |
|
|
| 0.18 | 0.50 |
|
| 0.77 |
|
Figure 4(A) All possible dichotomized cut-offs for the TILs identified by CD8 IHC in TMAs or using the HeIm, PanNuke or CoNSeP models plotted against p-values indicating the significance of the DSS for all included patients (n = 87). (B–G) Disease-specific survival curves for high- and low-TIL scores for (B) the semi-quantitative model proposed by Rakaee et al. 2018. (C) The CD8 model proposed by Kilvaer et al. 2020. (D–F) High and low numbers of TILs identified using the baseline rule-based HeIm algorithm (described in the Supplementary Materials) or DL models trained on the CoNSeP, MoNuSAC and PanNuke datasets, respectively.
Figure 5The top, middle and bottom rows represent the best, worst and largest range of F1 scores for the immune cell classifications from our manual validation, respectively. The first column is an overview of the entire image patch, while the second to fifth columns represent a focused region with and without detection/classification overlays.