| Literature DB >> 34568816 |
Kyubum Lee1, John H Lockhart2, Mengyu Xie1, Ritu Chaudhary3, Robbert J C Slebos3, Elsa R Flores2,4, Christine H Chung3,5, Aik Choon Tan1,5.
Abstract
The tumor immune microenvironment (TIME) encompasses many heterogeneous cell types that engage in extensive crosstalk among the cancer, immune, and stromal components. The spatial organization of these different cell types in TIME could be used as biomarkers for predicting drug responses, prognosis and metastasis. Recently, deep learning approaches have been widely used for digital histopathology images for cancer diagnoses and prognoses. Furthermore, some recent approaches have attempted to integrate spatial and molecular omics data to better characterize the TIME. In this review we focus on machine learning-based digital histopathology image analysis methods for characterizing tumor ecosystem. In this review, we will consider three different scales of histopathological analyses that machine learning can operate within: whole slide image (WSI)-level, region of interest (ROI)-level, and cell-level. We will systematically review the various machine learning methods in these three scales with a focus on cell-level analysis. We will provide a perspective of workflow on generating cell-level training data sets using immunohistochemistry markers to "weakly-label" the cell types. We will describe some common steps in the workflow of preparing the data, as well as some limitations of this approach. Finally, we will discuss future opportunities of integrating molecular omics data with digital histopathology images for characterizing tumor ecosystem.Entities:
Keywords: cell type classification; deep learning; histopathology image analysis; image data labeling; tumor heterogeneity; tumor immune microenvironment
Year: 2021 PMID: 34568816 PMCID: PMC8461055 DOI: 10.3389/frai.2021.754641
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
FIGURE 1Slide-, Region-, and Cell-level analyses on tumor histopathology images and its characteristics.
FIGURE 2An overview of histology slide preparation and machine learning model construction process.
Selected deep learning-based histopathology image analysis studies.
| Publication | Input image type | Training annotations | Deep learning architecture | Prediction output | Other functions | Training dataset size | Level |
|---|---|---|---|---|---|---|---|
|
| Tiled images from whole slide (H&E) | Pathologist marking tumor and normal areas, input description text | CNN | Probability of being tumor at Pixel level | Text query of images using pathological terms | 913 whole slides | WSI |
| Attention module | Attention area | ||||||
| RNN | Text description | ||||||
| TOAD | Whole slide | Tumor of origin | CNN | Primary or metastatic | 22,833 whole slides from 18 cancer types +6,499 test slides | WSI | |
| Sex | |||||||
| Tumor of origin | |||||||
| Primary or metastatic | Attention module | ||||||
| Attention areas | |||||||
|
| Whole slide (H&E, frozen section) | Primary diagnosis | DenseNet | Relevant images ranked by similarity | Images features extracted as barcodes | 29,120 whole slides from 32 cancer types (TCGA) | WSI |
| HE2RNA | Whole slide images | RNA expression for training | multilayer perceptron | RNA expression | Spatial mapping of gene expression | 8,725 samples from 28 cancer types | WSI |
|
| Whole slide images (H&E) | Pathologist marking regions of lymphocytes and necrosis | CNN | TIL maps (Computational Staining) | 5,455 images from 13 cancer types (TCGA) | ROI | |
|
| Whole slide images (H&E) | Pathologist marking tumor regions, TIL annotations from Saltz et al. study | 34-layer ResNet, 16-layer VGG, and Inception v4 | Tumor probability and TIL probability heatmaps | Cancer detection: 393 breast cancer images from SEER and TCGA | ROI | |
| Lymphocyte detection: 1090 invasive breast cancer from TCGA | |||||||
|
| Whole slide images (H&E) | Pathologist marking normal vs. tumor region (grade 1–4) | CNN(ResNet18) | Normal lung tissue/airways | In house images from mouse models | ROI | |
| Lung adenocarcinoma of different grades (1–4) | |||||||
| ConvPath | Whole slide images (H&E) | Pathologist labeled ROI (tumor, stroma, lymphocytes) | CNN | “spatial map” of tumor cells, stromal cells and lymphocytes (limited to lung adenocarcinoma) | TCGA-LUAD(1337) | ROI/Cell level | |
| NLST(345) | |||||||
| Beijing(102) | |||||||
| SPORE(130) | |||||||
| CRImage | Whole slide images (H&E) | Single cell annotations, OS, omics data | Topological tumor graphs (TTG), unsupervised deep learning framework (CNx) | Cell level classified and mapped slices for further analysis and hypothesis generating | 400 SKCM (TCGA) | Cell level |
FIGURE 3Example of the IHC-stained histopathology images with different antibodies.
FIGURE 4An example of the deep learning-based cell subtype prediction results.