| Literature DB >> 34135556 |
Hiroshi Yoshida1, Tomoharu Kiyuna2.
Abstract
Tremendous advances in artificial intelligence (AI) in medical image analysis have been achieved in recent years. The integration of AI is expected to cause a revolution in various areas of medicine, including gastrointestinal (GI) pathology. Currently, deep learning algorithms have shown promising benefits in areas of diagnostic histopathology, such as tumor identification, classification, prognosis prediction, and biomarker/genetic alteration prediction. While AI cannot substitute pathologists, carefully constructed AI applications may increase workforce productivity and diagnostic accuracy in pathology practice. Regardless of these promising advances, unlike the areas of radiology or cardiology imaging, no histopathology-based AI application has been approved by a regulatory authority or for public reimbursement. Thus, implying that there are still some obstacles to be overcome before AI applications can be safely and effectively implemented in real-life pathology practice. The challenges have been identified at different stages of the development process, such as needs identification, data curation, model development, validation, regulation, modification of daily workflow, and cost-effectiveness balance. The aim of this review is to present challenges in the process of AI development, validation, and regulation that should be overcome for its implementation in real-life GI pathology practice. ©The Author(s) 2021. Published by Baishideng Publishing Group Inc. All rights reserved.Entities:
Keywords: Artificial intelligence; Clinical implementation; Deep learning; Digital image analysis; Digital pathology; Gastrointestinal cancer
Year: 2021 PMID: 34135556 PMCID: PMC8173389 DOI: 10.3748/wjg.v27.i21.2818
Source DB: PubMed Journal: World J Gastroenterol ISSN: 1007-9327 Impact factor: 5.742
Figure 1General workflow of construction of artificial intelligence model in pathology. Stained slides are converted to digital input images by a slide scanner. Both (a) hand-crafted feature engineering and (b) deep learning approach generate outputs of classification, which are applied to various clinically relevant predictions.
Figure 2Challenges for implementation in the development process of an artificial intelligence application. The process of development and implementation of an artificial intelligence (AI) application is composed of multiple steps from needs identification to use in real-life (left). In each step, various challenges keep AI applications from being implemented into clinical practice (right). AI: Artificial intelligence; IT: Information technology.
Artificial intelligence applications in gastric cancer pathology
|
|
|
|
|
|
| Bollschweiler | Prognosis prediction | 135 cases | ANN | Accuracy (93%) |
| Duraipandian | Tumor classification | 700 slides | GastricNet | Accuracy (100%) |
| Cosatto | Tumor classification | > 12000 WSIs | MIL | AUC (0.96) |
| Sharma | Tumor classification | 454 cases | CNN | Accuracy (69% for cancer classification), accuracy (81% for necrosis detection) |
| Jiang | Prognosis prediction | 786 cases | SVM classifier | AUCs (up to 0.83) |
| Qu | Tumor classification | 9720 images | DL | AUCs (up to 0.97) |
| Yoshida | Tumor classification | 3062 gastric biopsy specimens | ML | Overall concordance rate (55.6%) |
| Kather | Prediction of microsatellite instability | 1147 cases (gastric and colorectal cancer) | Deep residual learning | AUC (0.81 for gastric cancer; 0.84 for colorectal cancer) |
| Garcia | Tumor classification | 3257 images | CNN | Accuracy (96.9%) |
| León | Tumor classification | 40 images | CNN | Accuracy (up to 89.7%) |
| Fu | Prediction of genomic alterations, gene expression profiling, and immune infiltration | > 1000 cases (gastric, colorectal, esophageal, and liver cancers) | Neural networks. | AUC (0.9) for BRAF mutations prediction in thyroid cancers |
| Liang | Tumor classification | 1900 images | DL | Accuracy (91.1%) |
| Sun | Tumor classification | 500 images | DL | Accuracy (91.6%) |
| Tomita | Tumor classification | 502 cases (esophageal adenocarcinoma and Barret esophagus) | Attention-based deep learning | Accuracy (83%) |
| Wang | Tumor classification | 608 images | Recalibrated multi-instance deep learning | Accuracy (86.5%) |
| Iizuka | Tumor classification | 1746 biopsy WSIs | CNN, RNN | AUCs (up to 0.98), accuracy (95.6%) |
| Kather | Prediction of genetic alterations and gene expression signatures | > 1000 cases (gastric, colorectal, and pancreatic cancer) | Neural networks | AUC (up to 0.8) |
ANN: Artificial neural network; GastricNet: The deep learning framework; WSIs: Whole slide images; MIL: Multi-instance learning; AUC: Area under the curve; CNN: Convolutional neural networks; SVM: Support vector machine; DL: Deep learning; ML: Machine learning; RNN: Recurrent neural networks.
Artificial intelligence applications in colorectal cancer pathology
|
|
|
|
|
|
| Xu | Tumor classification: 6 classes (NL/ADC/MC/SC/PC/CCTA) | 717 patches | AlexNet | Accuracy (97.5%) |
| Awan | Tumor classification: Normal/Low-grade cancer/High-grade cancer | 454 cases | Neural networks | Accuracy (97%, for 2-class; 91%, for 3-class) |
| Haj-Hassan | Tumor classification: 3 classes (NL/AD/ADC) | 30 multispectral image patches | CNN | Accuracy (99.2%) |
| Kainz | Tumor classification: Benign/Malignant | 165 images | CNN (LeNet-5) | Accuracy (95%-98%) |
| Korbar | Tumor classification: 6 classes (NL/HP/SSP/TSA/TA/TVA-VA) | 697 cases | ResNet | Accuracy (93.0%) |
| Yoshida | Tumor classification | 1328 colorectal biopsy WSIs | ML | Accuracy (90.1%, adenoma) |
| Alom | Tumor microenvironment analysis: Classification, Segmentation and Detection | 21135 patches | DCRN/R2U-Net | Accuracy (91.1%, classification) |
| Bychkov | Prediction of colorectal cancer outcome (5-yr disease-specific survival). | 420 cases | Recurrent neural networks | HR of 2.3, AUC (0.69) |
| Weis | Evaluation of tumor budding | 401 cases | CNN | Correlation R (0.86) |
| Ponzio | Tumor classification: 3 classes (NL/AD/ADC) | 27 WSIs (13500 patches) | VGG16 | Accuracy (96 %) |
| Kather | Tumor classification: 2 classes (NL/Tumor) | 94 WSIs | ResNet18 | AUC (> 0.99) |
| Kather | Prediction of microsatellite instability | 360 TCGA- DX (93408 patches), 378 TCGA- KR (60894 patches) | ResNet18 | AUC: TCGA-DX—(0.77, TCGA-DX; 0.84, TCGA-KR) |
| Kather | Tumor microenvironment analysis: classification of 9 cell types | 86 WSIs (100000) | VGG19 | Accuracy (94%-99%) |
| Kather | Prognosis predictions | 1296 WSIs | VGG19 | Accuracy (94%-99%) |
| Kather | Prognosis prediction | 934 cases | Deep learning (comparison of 5 networks) | HR for overall survival of 1.99 (training set) and 1.63 (test set) |
| Geessink | Prognosis prediction, quantification of intratumoral stroma | 129 cases | Neural networks | HRs of 2.04 for disease-free survival |
| Sena | Tumor classification: 4 classes (NL/HP/AD/ADC) | 393 WSIs (12,565 patches) | CNN | Accuracy (80%) |
| Shapcott | Tumor microenvironment analysis: detection and classification | 853 patches and 142 TCGA images | CNN with a grid-based attention network | Accuracy (84%, training set; 65%, test set) |
| Sirinukunwattana | Prediction of consensus molecular subtypes of colorectal cancer | 1206 cases | Neural networks with domain-adversarial learning | AUC (0.84 and 0.95 in the two validation sets) |
| Swiderska-Chadaj | Tumor Microenvironment Analysis: Detection of immune cell, CD3+, CD8+ | 28 WSIs | FCN/LSM/U-Net | Sensitivity (74.0%) |
| Yoon | Tumor classification: 2 classes (NL/Tumor) | 57 WSIs (10280 patches) | VGG | Accuracy (93.5%) |
| Echle | Prediction of microsatellite instability | 8836 cases | ShuffleNet Deep learning | AUC (0.92 in development cohort; 0.96 in validation cohort) |
| Iizuka | Tumor classification: 3 classes (NL/AD/ADC) | 4036 WSIs | CNN/RNN | AUCs (0.96, ADC; 0.99, AD) |
| Skrede | Prognosis predictions | 2022 cases | Neural networks with multiple instance learning | HR (3.04 after adjusting for established prognostic markers) |
NL: Normal mucosa; ADC: Adenocarcinoma; MC: Mucinous carcinoma; SC: Serrated carcinoma; PC: Papillary carcinoma; CCTA: Cribriform comedo-type adenocarcinoma; AD: Adenoma; CNN: Convolutional neural network; HP: Hyperplastic polyp; SSP: Sessile serrated polyp; TSA: Traditional serrated adenoma; TA: Tubular adenoma; TVA: Tubulovillous adenoma; VA: Villous adenoma; WSI: Whole slide images; ML: Machine learning; DCRN: Densely connected recurrent convolutional network; R2U-Net: Recurrent residual U-Net; HR: Hazard ratio; AUC: Area under the curve; TCGA: The Cancer Genome Atlas; ResNet: Residual network; VGG: Visual geometry group; RNN: Recurrent neural network; FCN: Fully convolutional networks; LSM: Locality-sensitive method.
Advantages and disadvantages of representative machine-learning methods in the development of artificial intelligence-models for gastrointestinal pathology
|
|
|
|
| Conventional ML (supervised) | User can reflect domain knowledge to features | Requires hand-crafted features; Accuracy depends heavily on the quality of feature extraction |
| Conventional ML (unsupervised) | Executable without labels | Results are often unstable; Interpretability of the results |
| Deep neural networks (CNN) | Automatic feature extraction; High accuracy | Requires a large dataset; Low explainability (Black box) |
| Multi-instance learning | Executable without detailed labels | Requires a large dataset; High computational cost |
| Semantic segmentation (FCN, U-Net) | Pixel-level detection gives the position, size, and shape of the target | High labeling cost |
| Recurrent neural networks | Learn sequential data | High computational cost |
| Generative adversarial networks | Learn to synthesize new realistic data | Complexity and instability in training |
AI: Artificial intelligence; ML: Machine learning; CNN: Convolutional neural network; FCN: Fully convolutional network