| Literature DB >> 32859981 |
Hanna Borgli1,2, Vajira Thambawita1,3, Pia H Smedsrud1,2,4, Steven Hicks1,3, Debesh Jha1,5, Sigrun L Eskeland6, Kristin Ranheim Randel2,7, Konstantin Pogorelov8, Mathias Lux9, Duc Tien Dang Nguyen10, Dag Johansen5, Carsten Griwodz2, Håkon K Stensland2,8, Enrique Garcia-Ceja11, Peter T Schmidt12,13, Hugo L Hammer1,3, Michael A Riegler1, Pål Halvorsen14,15, Thomas de Lange6,4,16.
Abstract
Artificial intelligence is currently a hot topic in medicine. However, medical data is often sparse and hard to obtain due to legal restrictions and lack of medical personnel for the cumbersome and tedious process to manually label training data. These constraints make it difficult to develop systems for automatic analysis, like detecting disease or other lesions. In this respect, this article presents HyperKvasir, the largest image and video dataset of the gastrointestinal tract available today. The data is collected during real gastro- and colonoscopy examinations at Bærum Hospital in Norway and partly labeled by experienced gastrointestinal endoscopists. The dataset contains 110,079 images and 374 videos, and represents anatomical landmarks as well as pathological and normal findings. The total number of images and video frames together is around 1 million. Initial experiments demonstrate the potential benefits of artificial intelligence-based computer-assisted diagnosis systems. The HyperKvasir dataset can play a valuable role in developing better algorithms and computer-assisted examination systems not only for gastro- and colonoscopy, but also for other fields in medicine.Entities:
Mesh:
Year: 2020 PMID: 32859981 PMCID: PMC7455694 DOI: 10.1038/s41597-020-00622-y
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
An overview of existing GI datasets.
| Dataset | Findings | Size | Availability |
|---|---|---|---|
| CVC-356[ | Polyps | 356 images† | by request● |
| CVC-ClinicDB[ | Polyps | 612 images† | open academic |
| CVC-VideoClinicDB[ | Polyps | 11954 images† | by request● |
| CVC-ColonDB[ | Polyps | 380 images† | by request● |
| Endoscopy Artifact detection 2019[ | Endoscopic Artifacts | 5,138 images | open academic |
| ASU-Mayo polyp database[ | Polyps | 18,781 images† | by request● |
| ETIS-Larib Polyp DB[ | Polyps | 196 images† | open academic |
| KID[ | Angiectasia, bleeding, inflammations, polyps | 2371 images and 47 videos | open academic● |
| GIANA 2017[ | Polyps & Angiodysplasia | 3462 images and 38 videos | by request |
| GIANA 2018[ | Polyps & Small bowel lesions | 8262 images and 38 videos | by request |
| GASTROLAB[ | GI lesions | Some 100s of images and few videos | open academic♣ |
| WEO Clinical Endoscopy Atlas[ | GI lesions | 152 images | by request♣ |
| GI Lesions in Regular Colonoscopy Data Set[ | GI lesions | 76 images† | by request |
| Atlas of Gastrointestinal Endoscope[ | GI lesions | 1295 images | unknown● |
El salvador atlas of gastrointestinal video endoscopy[ | GI lesions | 5071 video clips | open academic♣ |
| Kvasir[ | Polyps, esophagitis, ulcerative colitis, Z-line, pylorus, cecum, dyed polyp, dyed resection margins, stool | 8000 images | open academic |
| Kvasir-SEG[ | Polyps | 1000 images† | open academic |
| Nerthus[ | Stool - categorization of bowel cleanliness | 21 videos | open academic |
†Including ground truth segmentation masks. ◊Video capsule endoscopy. ●Not available anymore. Contour.
♣Not really a dataset usable for machine learning. It is more a medical atlas or database for education with a several low-quality samples of various findings in the GI tract.
Fig. 1Image examples of the various labeled classes for images and/or videos.
Overview of the data records in the HyperKvasir dataset.
| Data Record | # Files | Description | Size (MB) |
|---|---|---|---|
| Labeled images | 10,662 images | 23 classes of findings | 3,960 |
| Segmented Images | 1,000 images | Segmentation masks for polyp findings | 57 |
| Unlabeled Images | 99,417 images | Unlabeled | 29,940 |
| Videos | 374 videos | 30 different classes | 32,539 |
Fig. 2Resolution of the 110,079 images in HyperKvasir.
Fig. 3Statistics of the 374 videos in HyperKvasir.
Fig. 4The number of images in the various HyperKvasir labeled image classes according to the file folders.
Fig. 5The various image classes structured under position and type, also the structure of the stored images.
Fig. 6The number of videos in the various HyperKvasir labeled video classes according to the file folders.
Fig. 7The various video classes structured under position and type, which is also the structure of the video folders.
Average results for the five tested classification approaches, i.e., average of the results for the two splits.
| Method | Macro Average | Micro Average | |||||
|---|---|---|---|---|---|---|---|
| Precision | Recall | F1-score | Precision | Recall | F1-score | MCC ( | |
| Pre-Trained ResNet-50 | 0.589 | 0.536 | 0.530 | 0.839 | 0.839 | 0.839 | 0.826 |
| Pre-Trained ResNet-152 | 0.639 | 0.605 | 0.606 | 0.906 | 0.906 | 0.906 | 0.898 |
| 0.640 | 0.616 | 0.619 | 0.907 | 0.907 | 0.907 | 0.899 | |
| ResNet-152 + DenseNet-161 + MLP | 0.612 | 0.606 | 0.605 | 0.909 | 0.909 | 0.909 | 0.902 |
| Random Guessing | 0.044 | 0.038 | 0.034 | 0.044 | 0.044 | 0.044 | 0.000 |
| Majority Class | 0.004 | 0.043 | 0.008 | 0.108 | 0.108 | 0.108 | N/A |
Fig. 8Confusion matrices for Averaged ResNet-152 + DenseNet-161 and Pre-Trained DenseNet-161 including both splits. These confusion matrices were selected based on their performance. Averaged ResNet-152 + DenseNet-161 achieved the best micro-averaged results while the Pre-Trained DenseNet-161 achieved the best macro-averaged result. The color codes represent the percentages of the total number of images within each class. The labeling of the classes is as follows: (A) Barrett’s; (B) bbps-0-1; (C) bbps-2-3; (D) dyed lifted polyps; (E) dyed resection margins; (F) hemorrhoids; (G) ileum; (H) impacted stool; (I) normal cecum; (J) normal pylorus; (K) normal Z-line; (L) oesophagitis-a; (M) oesophagitis-b-d; (N) polyp; (O) retroflex rectum; (P) retroflex stomach; (Q) short segment Barrett’s; (R) ulcerative colitis grade 0-1; (S) ulcerative colitis grade 1-2; (T) ulcerative colitis grade 2-3; (U) ulcerative colitis grade 1; (V) ulcerative colitis grade 2; (W) ulcerative colitis grade 3.
Fig. 9Unlabeled image data predictions for Averaged ResNet-152 + DenseNet-161 and Pre-Trained DenseNet-161.
| Measurement(s) | lumen of digestive tract • lumen of colon |
| Technology Type(s) | Gastrointestinal Endoscopy • Colonoscopy |
| Sample Characteristic - Organism | Homo sapiens |