| Literature DB >> 35269607 |
Ji Hyun Park1, Eun Young Kim2, Claudio Luchini3,4, Albino Eccher5, Kalthoum Tizaoui6, Jae Il Shin7, Beom Jin Lim8.
Abstract
Microsatellite instability (MSI)/defective DNA mismatch repair (dMMR) is receiving more attention as a biomarker for eligibility for immune checkpoint inhibitors in advanced diseases. However, due to high costs and resource limitations, MSI/dMMR testing is not widely performed. Some attempts are in progress to predict MSI/dMMR status through histomorphological features on H&E slides using artificial intelligence (AI) technology. In this study, the potential predictive role of this new methodology was reviewed through a systematic review. Studies up to September 2021 were searched through PubMed and Embase database searches. The design and results of each study were summarized, and the risk of bias for each study was evaluated. For colorectal cancer, AI-based systems showed excellent performance with the highest standard of 0.972; for gastric and endometrial cancers they showed a relatively low but satisfactory performance, with the highest standard of 0.81 and 0.82, respectively. However, analyzing the risk of bias, most studies were evaluated at high-risk. AI-based systems showed a high potential in predicting the MSI/dMMR status of different cancer types, and particularly of colorectal cancers. Therefore, a confirmation test should be required only for the results that are positive in the AI test.Entities:
Keywords: DNA mismatch repair; artificial intelligence; deep learning; digital pathology; microsatellite instability
Mesh:
Year: 2022 PMID: 35269607 PMCID: PMC8910565 DOI: 10.3390/ijms23052462
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1A simplified version of convolutional neural networks (CNNs) workflow in digital pathology. The scanned H&E slide image passes through filters that detect specific features (e.g., lines, edges). Pooling layers summarize features from convolution layers. After a series of convolution and pooling layers, fully connected layers (classification layer) are generated, and through this layer output is created.
Comparison of studies regarding detection of MSI/dMMR from histology slides using deep learning.
| Author/Year | Organ | Neural Network | Training Cohort | Type of Internal Validation | External Validation | Test Cohort(s) with AUC (95% CI) or Accuracy | Methodology for MSI Analysis |
|---|---|---|---|---|---|---|---|
| Zhang et al., * | Colorectum | Inception-V3 without adversarial training | TCGA CRC ( | Random split | Yes | TCGA CRC Accuracy: 98.3% | - |
| TCGA CRC ( | TCGA CRC Accuracy: 72.3% | ||||||
| Inception-V3 | TCGA CRC Accuracy: 85.0% | ||||||
| Klaiman et al., * 2019 [ | Colorectum | N/A | Roche internal CRC80 dataset ( | Random split | No | Roche internal CRC80 dataset: 0.9 | - |
| Kater et al., | Stomach | ResNet-18 | TCGA STAD FFPE | Random split | Yes | TCGA STAD FFPE ( | TCGA: PCR |
| Colorectum | TCGA CRC FFPE | TCGA CRC FFPE ( | |||||
| Colorectum | TCGA CRC Frozen | TCGA CRC Frozen ( | |||||
| UCEC | TCGA UCEC FFPE | No | TCGA UCEC FFPE ( | ||||
| Pressman et al., * 2020 [ | Colorectum | ResNet18 | TCGA ( | - | Yes | TCGA: 0.79 | - |
| Schmauch et al., 2020 [ | Colorectum | HE2RNA with ResNet50 | TCGA CRC FFPE | Three-fold cross validation | No | TCGA CRC FFPE: 0.82 | PCR |
| Stomach | TCGA STAD FFPE | TCGA STAD FFPE: 0.76 | |||||
| Kather et al., 2020 [ | Colorectum | ShuffleNet | TCGA CRC FFPE | Three-fold cross-validation | Yes | DACHS FFPE ( | TCGA: PCR |
| Cao et al., | Colon | ResNet-18 | TCGA-COAD Frozen | Random split | Yes | TCGA-COAD: 0.8848 (0.8185–0.9512) | TCGA-COAD: NGS 2
|
| TCGA-COAD Frozen (90%) + | - | No | Asian-CRC FFPE ( | ||||
| TCGA-COAD Frozen (30%) + | - | No | Asian-CRC FFPE ( | ||||
| Echle et al., | Colorectum | ShuffleNet | MSIDETECT CRC | Random split | Yes | MSIDETECT CRC: 0.92 (0.90–0.93) | DACHS: PCR |
| Three-fold cross validation | MSIDETECT CRC: 0.92 (0.91–0.93) | ||||||
| YCR-BCIP-BIOPSY | Three-fold cross validation | No | YCR-BCIP-BIOPSY: 0.89 (0.88–0.91) | ||||
| Valieris et al., 2020 [ | Stomach | Resnet-34 | TCGA-STAD FFPE | Random split | No | TCGA-STAD FFPE: 0.81 (0.689–0.928) | NGS 4 |
| Yamashita et al., 2021 [ | Colorectum | MSInet | Stanford dataset ( | Random split | No | Stanford dataset ( | Stanford dataset: IHC/PCR |
| Four-fold | Yes | Stanford dataset ( | |||||
| Krause et al., 2021 [ | Colorectum | ShuffleNet | TCGA FFPE ( | Random split | No | TCGA FFPE ( | PCR |
| Lee et al., | Colorectum | Inception-V3 | TCGA FFPE | 10-fold cross validation | No | TCGA FFPE: 0.892 (0.855–0.929) | TCGA: PCR |
| TCGA FFPE | Yes | TCGA FFPE: 0.861 (0.819–0.903) | |||||
| TCGA Frozen | No | TCGA Frozen: 0.942 (0.925–0.959) | |||||
| Hong et al., | UCEC | InceptionResNetV1 | TCGA and CPTAC | Random split | Yes | TCGA and CPTAC: 0.827 (0.705–0.948) | TCGA: PCR |
AUC, Area Under the Curve; UCEC, Uterine Corpus Endometrial Carcinoma; TCGA, The Cancer Genome Atlas study; CRC, ColoRectal Cancer; WSI, Whole Slide Images; STAD, STomach ADenocarcinoma; pts, patients; FFPE, Formalin-Fixed Paraffin-Embedded; DACHS, Darmkrebs: Chancen der Verhütung durch Screening (CRC prevention through screening study abbreviation in German); KCCH, Kangawa Cancer Center Hospital (Japan); Stanford dataset, Stanford University Medical Center (USA) Gangnam sev, Gangnam Severance Hospital (South Korea); COAD, COlonic ADenocarcinoma; MSIDETECT: A consortium composed of TCGA, DACHS, the United Kingdom-based Quick and Simple and Reliable trial (QUASAR), and the Netherlands Cohort Study (NLCS); YCR-BCIP: Yorkshire Cancer Research Bowel Center Improvement Programme; SMH, Saint Mary’s Hospital (South Korea); CPTAC, Clinical Proteomic Tumor Analysis Consortium; NYU, New York University hospital. * Conference paper or abstract not officially published. † The stomach cancer cohort was used only in test cohort. 1 3-plex PCR (BAT25, BAT26, CAT25) 2 MSI sensor algorithm 3 2-plex IHC 4 Mutation signature 5 Mutation load, MMR gene mutation status, MSI sensor score, MSMuTect score, and MLH1methylation.
Figure 2Flow chart of literature search.
Figure 3Representative workflow of the digitalization of a case of colon cancer with microsatellite instability is provided here (provided by Claudio Luchini, co-author). (A) Cancer area: tumor cells and peri-tumor cells, including stromal cells and immune cells, are here shown. This is the point of the start of the analysis on a slide stained with hematoxylin-eosin (original magnification: 10×). (B) The digitalized system is able to separate cancer cells (here colored in blue) from non-cancer cells (red). (C) The immunohistochemistry for mismatch-repair proteins can be also taken into account in this process. This figure represents MSL-staining, showing the loss of the protein into the neoplastic component, while its expression in retained in non-tumor cells (original magnification: 10×, same field of hematoxylin-eosin). (D) The digitalized system is able to interpret the results of immunohistochemistry, based on a deep learning approach. In this step, the system shows its ability to separate cancer cells (here colored in blue) from non-cancer cells (brown). (E,F) Since the immunohistochemistry for mismatch-repair proteins is a nuclear staining, for finalizing its interpretation the system here shows its ability in the detection and analysis of only cell nuclei, with tumor cells in blue and non-tumor cells in red (E,F; different resolution of analysis, which can be adapted based on staining patterns and the difficulty of their interpretation). Scale bar represents 200 μm.