| Literature DB >> 31099869 |
Ryan W Stidham1,2, Wenshuo Liu1, Shrinivas Bishu2, Michael D Rice2, Peter D R Higgins2, Ji Zhu1,3, Brahmajee K Nallamothu1,4,5,6, Akbar K Waljee1,2,5,6.
Abstract
Importance: Assessing endoscopic disease severity in ulcerative colitis (UC) is a key element in determining therapeutic response, but its use in clinical practice is limited by the requirement for experienced human reviewers. Objective: To determine whether deep learning models can grade the endoscopic severity of UC as well as experienced human reviewers. Design, Setting, and Participants: In this diagnostic study, retrospective grading of endoscopic images using the 4-level Mayo subscore was performed by 2 independent reviewers with score discrepancies adjudicated by a third reviewer. Using 16 514 images from 3082 patients with UC who underwent colonoscopy at a single tertiary care referral center in the United States between January 1, 2007, and December 31, 2017, a 159-layer convolutional neural network (CNN) was constructed as a deep learning model to train and categorize images into 2 clinically relevant groups: remission (Mayo subscore 0 or 1) and moderate to severe disease (Mayo subscore, 2 or 3). Ninety percent of the cohort was used to build the model and 10% was used to test it; the process was repeated 10 times. A set of 30 full-motion colonoscopy videos, unseen by the model, was then used for external validation to mimic real-world application. Main Outcomes and Measures: Model performance was assessed using area under the receiver operating curve (AUROC), sensitivity and specificity, positive predictive value (PPV), and negative predictive value (NPV). Kappa statistics (κ) were used to measure agreement of the CNN relative to adjudicated human reference cores.Entities:
Mesh:
Year: 2019 PMID: 31099869 PMCID: PMC6537821 DOI: 10.1001/jamanetworkopen.2019.3963
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Figure 1. Mayo Endoscopic Subscore Descriptors and Representative Images
Endoscopic image features including degree of erythema, visible vascular pattern, friability, ulceration, and spontaneous bleeding are used to categorize Mayo subscore. A, Mayo 0: no friability or granularity; intact vascular pattern. B, Mayo 1: erythema; Decreased vascular pattern; mild friability. C, Mayo 2: marked erythema; absent vascular pattern; friability; erosions. D, Mayo 3: marked erythema; absent vascular pattern; friability; granularity; spontaneous bleeding; ulcerations.
Figure 2. Schematic of Deep Learning Model for Predicting Mayo Endoscopic Score
Archived still images from earlier colonoscopies of patients with ulcerative colitis (UC) who met selection criteria were independently scored (labeled) for Mayo endoscopic score by 2 independent gastroenterologists specializing in inflammatory bowel disease (BD). Scored still images were randomly split (by patients) into a model development set and a test set. Resulting deep learning models were applied to the test set and a separate validation set of still images collected from 30 colonoscopy videos not used in model building. CNN indicates convolutional neural networks.
Figure 3. Convolutional Neural Networks (CNNs) for Automated Identification of Endoscopic Remission of Ulcerative Colitis
A, A CNN was trained on reference colonoscopy images scored by 2 independent reviewers, with adjudication of disagreements by a third reviewer. CNN discrimination between endoscopic remission (Mayo 0 or 1) from moderate to severe activity (Mayo 2 or 3) had an area under the receiver operating curve (AUROC) of 0.97. B, The CNN had similar performance differentiating remission from moderate to severe disease in a separate set of images from colonoscopy videos not used in model building, with an AUROC of 0.97. Dashed lines represent a nondiscriminatory AUROC. Plus sign indicates optimal sensitivity and specificity.
Agreement Between Adjudicated Human Reviewer Scores and Automated Mayo Subscore Within an Endoscopic Still-Image Testing Data Set
| Human Mayo Score | Predicted Mayo Score, % | Human Total No. of Images Reviewed | |||
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | ||
| 0 | 89.6 | 8.2 | 2.0 | 0.2 | 922 |
| 1 | 30.1 | 54.5 | 12.7 | 1.6 | 299 |
| 2 | 5.0 | 16.4 | 69.9 | 8.6 | 256 |
| 3 | 0.5 | 0.6 | 24.6 | 74.3 | 175 |
| Predicted total, No. | 942 | 282 | 270 | 158 | 1652 |
Increasing Mayo endoscopic scores denote increasing mucosal inflammation in the colon, where a score of 0 indicates normal-appearing colonic mucosa and 3 indicates severe inflammatory changes.
Agreement Between Individual Human Reviewers Within an Endoscopic Still-Image Data Set
| Reviewer A Mayo Score | Reviewer B Mayo Score, % | Reviewer B, No. of Images Reviewed | |||
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | ||
| 0 | 77.1 | 22.4 | 0.6 | 0.0 | 9160 |
| 1 | 14.6 | 54.7 | 30.3 | 0.4 | 3430 |
| 2 | 0.2 | 6.0 | 69.3 | 24.5 | 2405 |
| 3 | 0.0 | 0.1 | 14.3 | 85.7 | 1519 |
| Reviewer A, No. | 7563 | 4069 | 2976 | 1906 | 16 514 |
Increasing Mayo endoscopic scores denote increasing mucosal inflammation in the colon, where a score of 0 indicates normal-appearing colonic mucosa and 3 indicates severe inflammatory changes.