| Literature DB >> 31788542 |
Alanna Ebigbo1, Christoph Palm2,3, Andreas Probst1, Robert Mendel2,3, Johannes Manzeneder1, Friederike Prinz1, Luis A de Souza2,4, João P Papa5, Peter Siersema6, Helmut Messmann1.
Abstract
Background and aim The growing number of publications on the application of artificial intelligence (AI) in medicine underlines the enormous importance and potential of this emerging field of research. In gastrointestinal endoscopy, AI has been applied to all segments of the gastrointestinal tract most importantly in the detection and characterization of colorectal polyps. However, AI research has been published also in the stomach and esophagus for both neoplastic and non-neoplastic disorders. The various technical as well as medical aspects of AI, however, remain confusing especially for non-expert physicians. This physician-engineer co-authored review explains the basic technical aspects of AI and provides a comprehensive overview of recent publications on AI in gastrointestinal endoscopy. Finally, a basic insight is offered into understanding publications on AI in gastrointestinal endoscopy.Entities:
Year: 2019 PMID: 31788542 PMCID: PMC6882682 DOI: 10.1055/a-1010-5705
Source DB: PubMed Journal: Endosc Int Open ISSN: 2196-9736
Fig. 1 Overview of artificial intelligence (AI), machine learning (ML) and deep learning (DL) 7 .
Fig. 2 Deep learning (DL) based on convolutional neural networks (CNN) showing the input layer with raw data of the image, the hidden layer with a series of convolutions computed for each layer and the classification of the image in the output layer.
Brief summary of AI applications.
| AI tasks | Comments |
| Frame detection task | Frames are individual pictures in a sequence of images; in this task, AI detects frames with suspicious objects which need closer examination; for example, during colonoscopy, the detection of frames bearing an adenoma or polyp. |
| Object detection task | AI recognizes and identifies a region of interest (ROI) (such as a dysplastic lesion in BE) during an endoscopic examination. |
| Classification task | AI categorizes detected lesions into classes such as neoplastic vs. non-neoplastic or adenomatous vs. hyperplastic |
| Segmentation task | AI delineates the outer margin or border of a detected lesion and correctly differentiates between pathological and normal at the interface between the lesion and the healthy tissue. |
| Task combinations | AI can ultimately combine these tasks described above in one work-flow, for example the detection and classification of a colorectal polyp followed by the delineation of the outer margin of the lesion. |
BE, Barrett’s esophagus.
Fig. 3 Automatic tumor classification and segmentation on two endoscopic images ( a, c ) are shown by colored contours ( c, d ) overlaid on the original images as so-called heat maps.
Selected studies of use of AI in the gastrointestinal tract.
| Reference/year | Organ/disease | AI application task | ML- modality | Outcome |
|
Ebigbo A, et al; 2018
| Barrett’s esophagus | Classification: cancer vs. non-cancer | DL/CNN | Sensitivity 97 % and Specificity 88 %; outperformed human endoscopists |
|
Horie Y, et al; 2018
| Esophageal SCC | Detection of cancer and classification into superficial and advanced cancer | DL/CNN | Sensitivity of 98 % in the detection of cancer and a diagnostic accuracy of 98 % in the differentiation between superficial and advanced cancer |
|
Kanesaka, et al; 2018
| Gastric cancer | Identification of cancer on NBI images; delineation task | CNN | Accuracy of 96 % and 73,8 % respectively in the identification and delineation tasks. |
|
Zhu Y, et al; 2019
| Gastric cancer | Evaluation of the invasion depth of gastric cancer | CNN | Overall accuracy of 89.16 % which was significantly higher than that of human endoscopists |
|
Nakashima, et al; 2018
|
| Optical diagnosis of H. pylori gastritis | CNN | Sensitivity/specificity > 96 % |
|
Wang P, et al; 2019
| Colonic polyps | Real-time automatic polyp detection | CNN |
Significant increase in detection of diminutive adenomas and hyperplastic polyps (29.1 % vs 20.3 %,
|
|
Mori Y, et al; 2018
| Colonic polyps | Detection task; Real-Time identification of diminutive polyps | CNN | Pathologic prediction rate of 98,1 % |
DL, deep learning; CNN, convolutional neural network; SCC, squamous cell carcinoma.
Understanding AI research: characteristics of publications.
| Characteristics | Comments |
| Origin of images Self-acquired vs. open access database | Images generated by clinicians specifically for an AI study rather than images taken from an open-access data base may provide more accurate answers to the study hypothesis. However, an open-access database could have the advantage of improved comparability when other AI methods or studies are used on images from the same open access data base. |
| Quantity of images for training | Generally, the more images used in an AI study, the more accurate the results may be. However, it is not possible to make a blanket statement about the number of images needed for a high-quality research paper. To increase the quantity of training data AI researchers sometimes make use of many small subsegments of the original image. Additionally, the number of training images may be increased due to augmentation. For this, small variations of the original images are computed to simulate variations of the real-world. Standard augmentation procedures are rotation, translation and mirroring along the horizontal and vertical axis. Also, changes in contrast, brightness, hue and saturation may be applied in a randomized fashion, while the original images remain the same. |
| Validation and cross-validation | The true performance of an AI system has to be proven on data of the daily routine in a clinic over a long period without data selection. Since these long-term evaluations are not available yet, a fixed number of image data have to be used for training and validation. But images used for validation should never be used for training. However, testing the AI system on one validation data set only might lead to an over- or underestimation of the true performance, depending on the data separation. |
| Real-time analysis of real-life images | The analysis of real-life images in real-time comes closer to the clinical reality than the analysis of optimally collected images. The latter may lead to an over estimation of the performance ability of an AI system. |
| Comparison with the human expert | Controlled trials comparing the AI system in real-time with the human expert on the same set of test images may provide useful information on the performance ability of the AI system since the human expert remains the gold standard of the computer vision diagnosis. |
| Deep learning (DL) | AI research using DL seems to have higher potential than systems which rely on hand-crafted features only. Therefore, most recent AI studies have made use of DL algorithms. |