| Literature DB >> 36202840 |
Amber Charoen1, Averill Guo2, Panisara Fangsaard3, Supakorn Taweechainaruemitr4, Nuwee Wiwatwattana4, Theekapun Charoenpong5, Harlan G Rich2.
Abstract
Complete endoscopic evaluation of the small bowel is challenging due to its length and anatomy. Although several advances have been made to achieve diagnostic and therapeutic goals, including double-balloon enteroscopy, single-balloon enteroscopy, and spiral enteroscopy, video capsule endoscopy (VCE) remains the least invasive tool for complete visualization of the small bowel and is the preferred method for initial diagnostic evaluation. At present, interpretation of VCE data requires manual annotation of landmarks and abnormalities in recorded videos, which can be time consuming. Computer-assisted diagnostic systems using artificial intelligence may help to optimize VCE reading efficiency by reducing the need for manual annotation. Here we present a large VCE data set compiled from studies performed at two United States hospitals in Providence, Rhode Island, including 424 VCE studies and 5,247,588 total labeled images. In conjunction with existing published data sets, these files may aid in the development of algorithms to further improve VCE.Entities:
Mesh:
Year: 2022 PMID: 36202840 PMCID: PMC9537421 DOI: 10.1038/s41597-022-01726-3
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1The box plot of number of images per anatomical organ for a study.
The descriptive statistics of the data set (number of images per anatomical organ).
| Esophagus | Stomach | Small bowel | Colon | |
|---|---|---|---|---|
| Minimum | 1 | 1 | 1,737 | 1 |
| 1st quartile | 6 | 372 | 6,091 | 558 |
| Median | 12 | 796 | 8,537 | 825 |
| 3rd quartile | 21 | 1,657 | 12,249 | 1,445 |
| Maximum | 3,152 | 25,081 | 37,240 | 31,184 |
| Mean | 32 | 1,314 | 9,698 | 1,332 |
| Total | 13,715 | 557,049 | 4,111,865 | 564,959 |
Fig. 2The sample images of four anatomical organs: (a) esophagus, (b) stomach, (c) small bowel, and (d) colon.
The number of images per data set.
| Data set | Esophagus | Stomach | Small bowel | Colon | Total images |
|---|---|---|---|---|---|
| Training/validation | 9,061 | 466,562 | 3,242,639 | 474,776 | 4,193,038 |
| - Down-sampling | 9,061 | 11,508 | 32,252 | 11,707 | 64,528 |
| - Training | 6,795 | 8,631 | 24,189 | 8,780 | 48,395 |
| - Validation | 2,266 | 2,877 | 8,063 | 2,927 | 16,133 |
| Testing | 4,654 | 90,487 | 869,226 | 90,783 | 1,054,550 |
The testing performance of the trained model.
| Precision | Recall | F1 score | Support (images) | |
|---|---|---|---|---|
| Esophagus | 0.7975 | 0.9951 | 0.8854 | 4,654 |
| Stomach | 0.9280 | 0.8968 | 0.9122 | 90,487 |
| Small bowel | 0.9939 | 0.9793 | 0.9865 | 869,226 |
| Colon | 0.8265 | 0.9609 | 0.8886 | 90,183 |
| Accuracy | 0.9707 | 1,054,550 | ||
| Macro average | 0.8865 | 0.9580 | 0.9182 | 1,054,550 |
| Weighted average | 0.9731 | 0.9707 | 0.9713 | 1,054,550 |
Fig. 3The unnormalized confusion matrix.
| Measurement(s) | organ |
| Technology Type(s) | Videocapsule Endoscopy |
| Sample Characteristic - Organism | Homo sapiens |
| Sample Characteristic - Location | State of Rhode Island |