| Literature DB >> 34307315 |
Guihua Chen1, Jun Shen1.
Abstract
Inflammatory bowel disease (IBD), which includes ulcerative colitis (UC) and Crohn's disease (CD), is an idiopathic condition related to a dysregulated immune response to commensal intestinal microflora in a genetically susceptible host. As a global disease, the morbidity of IBD reached a rate of 84.3 per 100,000 persons and reflected a continued gradual upward trajectory. The medical cost of IBD is also notably extremely high. For example, in Europe, it has €3,500 in CD and €2,000 in UC per patient per year, respectively. In addition, taking into account the work productivity loss and the reduced quality of life, the indirect costs are incalculable. In modern times, the diagnosis of IBD is still a subjective judgment based on laboratory tests and medical images. Its early diagnosis and intervention is therefore a challenging goal and also the key to control its progression. Artificial intelligence (AI)-assisted diagnosis and prognosis prediction has proven effective in many fields including gastroenterology. In this study, support vector machines were utilized to distinguish the significant features in IBD. As a result, the reliability of IBD diagnosis due to its impressive performance in classifying and addressing region problems was improved. Convolutional neural networks are advanced image processing algorithms that are currently in existence. Digestive endoscopic images can therefore be better understood by automatically detecting and classifying lesions. This study aims to summarize AI application in the area of IBD, objectively evaluate the performance of these methods, and ultimately understand the algorithm-dataset combination in the studies.Entities:
Keywords: Crohn’s disease; artificial intelligence; inflammatory bowel disease; machine learning; ulcerative colitis
Year: 2021 PMID: 34307315 PMCID: PMC8297505 DOI: 10.3389/fbioe.2021.635764
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
FIGURE 1The requirement of artificial intelligence (AI) application and the enhancement for inflammatory bowel disease (IBD) studies. The high-quality data at a certain volume are the basis of building and training the machine learning (ML) models. Making the appropriate match between the dataset and analysis methods is also an essential step, which can influence both accuracy and interpretability of results. The AI application in IBD study was summarized and sorted by the research purpose. In the field of IBD etiology, diagnosis, and treatment, AI methods played distinct roles.
FIGURE 2Artificial intelligence terminology. AI is a general term concerning the computer science that enables the machine applying the human intelligence such as “learning” and “problem-solving” to perform the practical task. ML, multi-agent systems, expert systems, knowledge representation, recommender systems, robotics, and perception are the subset of AI. ML is a subset of AI, which automatically detects patterns in data to make predictions or decisions without explicitly being programmed. (A) Support vector machine (SVM) is a discriminative classifier that has an excellent performance in classification due to its strength of regularization and convex optimization. With the application of “kernel trick,” SVM can handle non-linear problems. (B) Naïve Bayes is a probabilistic ML algorithm based on the Bayes’ theorem and assumes the independence between features. (C) Random forest is an ensemble algorithm constructing numerous decision trees at training to increase the overall result by combining learning models. (D) Artificial neural network (ANN) is a regression and classification algorithm composed of artificial neurons applying simple classifiers to output decision signals based on the weighted sum of evidence. The basic structure was the combination of an input, hidden connection, and output layer. (E) Deep learning is a class of ANNs with several hidden layers which can learn complex hierarchical representations from the data for feature extraction and transformation, and for pattern analysis and classification. (F) Convolutional neural network (CNN) is a specific subset of ANNs that imitate the organization of the animal visual cortex for image processing tasks. The convolutional layer of CNN is the essential part.
Summary of studies using artificial intelligence in IBD etiology.
| References | Published year | Aim of study | Type of AI | Number of subjects | Input variables (number/type) | Outcomes |
| 2017 | Prioritization of IBD risk genes to detect candidate novel IBD-associated genes | Four different machine-learning classification models: rf, svmPoly, xgbTree, and glmnet | 180 CD, 149 UC, 94 colorectal neoplasms, and 90 normal tissue | 309/expression data from both array and RNA-seq data sets, GO, KEGG, and the Pathway Interactions Database terms | 67 novel candidate IBD-risk genes | |
| 2017 | Screening for differential expressing genes among different clusters based on an IBD database to identify genes related to IBD | SMO | 59 CD, 26 UC, and 42 normal samples | 12,754/expression levels of 12,754 genes | 21 candidate genes related to IBD | |
| 2016 | Assessment of the predictive power of three BMLTs as classifiers for EIM in CD patients | 3 BMLTs: NB, BART, and BN | 152 patients with CD | 12/disease characteristics, risk factors, and genetic variables | Accuracy: 89% (BN achieved the best performance) | |
| 2019 | Determination of whether BMLT could improve EIM prediction | 3 BMLTs: NB, BART, and BN | 152 patients with CD | 12/disease characteristics, risk factors, and genetic variables | Sensitivity: 66.0%, specificity: 69.0% (BART achieved the best performance) | |
| 2015 | Prediction of IBD onset and relapse frequency with meteorological data | ANN | 569 UC and 332 CD patients | 5/meteorological data | Accuracy in predicting the frequency of IBD relapse (mean square error = 0.009, mean absolute percentage error = 17.1%) |
FIGURE 3Computer-aided diagnosis for endocytoscopy to identify histological inflammation. (A) Three-hundred and twelve features collected by endocytoscopic and texture analysis. (B) SVM based on the 312 features classifying 2-class diagnoses (“active” or “healing”). (C) The status and diagnostic probability were published. Adapted with permission from Maeda et al. (2019), Elsevier.
FIGURE 4Model construction and data processing. The whole project was composed of model construction, validation, and inflammatory bowel disease unclassified reclassification. The study recruited 239 pediatric patients from the Genetics of Pediatric Inflammatory Bowel Disease study at Southampton Children’s Hospital. The clinical data were used to search the best parameters for classification and train the model. The linear SVM was applied to construct the optimal model, allowing for assessing the relevance with the disease of the selected variable. Then, the optimal penalty parameter (C) tuning and fivefold cross-validation scheme (RFE-CV) help maximizing the classification accuracy by avoiding overfitting. Adapted with permission from Mossotto et al. (2017), Nature Publishing Group.
Summary of studies using artificial intelligence in IBD diagnosis.
| References | Published year | Aim of study | Type of AI | Number of subjects | Outcomes |
| 2019 | Prediction of persistent histologic inflammation associated with UC | SVM | Training set: 12,900 images from 87 patients. Test set: 9935 images from 100 patients | Sensitivity: 74%, specificity: 97%, accuracy: 91% | |
| 2016 | Segmentation of CD from abdominal MRI | Active learning framework combined with semi-supervised learning | 70 patients (fivefold cross validation) | Dice Metric: 92.4%, Hausdorff distance: 7.0 mm | |
| 2015 | Diagnostics of IBD | SVM | 114 patients | AUROC: 0.75–1.00 | |
| 2017 | Classification of Pediatric Inflammatory Bowel Disease | SVM | 239 patients | Accuracy: 82.7% (model utilizing combined endoscopic/histological data achieved the best performance) | |
| 2018 | Classification of disease state and treatment outcome in pediatric Crohn’s disease | RF | Intestinal biopsies of 20 treatment-naïve CD and 20 control pediatric patients | Accuracy: 84.2% (model utilizing 16S taxonomic datasets achieved the best performance) |
Summary of studies using artificial intelligence in IBD treatment.
| References | Published year | Aim of study | Type of AI | Number of subjects | Input variables (number/type) | Outcomes |
| 2017 | Prediction of IBD flares | RF | 20,368 patients | 6/demographic data, lab data, and clinical variables | AUROC: 0.87 (RF longitudinal model utilizing previous hospitalization or steroid use achieved the best performance) | |
| 2019 | Prediction and explanation of inflammation in CD | Gradient boosting machines | 82 patients | 40/demographic data, lab data, and clinical variables | AUROC: 0.93 | |
| 2020 | Detection of intestinal strictures in CD patients | Two-class decision forest algorithm | 67 patients | 15/serum elafin level and 14 clinical variables | AUROC: 0.92 (model utilizing serum elafin levels and commonly available clinical data achieved the best performance) | |
| 2019 | Prediction of responses to therapy of patients with Acute Severe UC | Neural networks | 47 patients | 14/9 microRNAs and five clinical variables | Accuracy: 93%, AUROC: 0.91 | |
| 2019 | Identification of CD patients likely to be durable responders to ustekinumab | RF | 401 patients | 12/5 demographic data and seven laboratory test results | AUROC: 0.78 (model utilizing data through week 8 achieved the best performance) | |
| 2019 | Prediction of CD recurrence risk after ileocolic resection | RF | 60 patients (five patients with extreme variability on whole transcriptome analysis had been excluded) | 30/expression levels of 30 transcripts | Accuracy: 91.67% (in anti-TNFα-naïve patients), 92.86% (in patients receiving anti-TNFα therapy) | |
| 2020 | Prediction of sustained remission after exclusive enteral nutrition in pediatric CD | RF | 22 patients | 34/demographic data, clinical variables, and 27 microbial data | AUROC: 0.9 (model utilizing microbial abundances, species richness, and Paris disease classification achieved the best performance) |