| Literature DB >> 33553586 |
Rajesh N Keswani1, Daniel Byrd2, Florencia Garcia Vicente2, J Alex Heller2, Matthew Klug2, Nikhilesh R Mazumder1, Jordan Wood1, Anthony D Yang3, Mozziyar Etemadi2,4.
Abstract
Background and study aims Storage of full-length endoscopic procedures is becoming increasingly popular. To facilitate large-scale machine learning (ML) focused on clinical outcomes, these videos must be merged with the patient-level data in the electronic health record (EHR). Our aim was to present a method of accurately linking patient-level EHR data with cloud stored colonoscopy videos. Methods This study was conducted at a single academic medical center. Most procedure videos are automatically uploaded to the cloud server but are identified only by procedure time and procedure room. We developed and then tested an algorithm to match recorded videos with corresponding exams in the EHR based upon procedure time and room and subsequently extract frames of interest. Results Among 28,611 total colonoscopies performed over the study period, 21,170 colonoscopy videos in 20,420 unique patients (54.2 % male, median age 58) were matched to EHR data. Of 100 randomly sampled videos, appropriate matching was manually confirmed in all. In total, these videos represented 489,721 minutes of colonoscopy performed by 50 endoscopists (median 214 colonoscopies per endoscopist). The most common procedure indications were polyp screening (47.3 %), surveillance (28.9 %) and inflammatory bowel disease (9.4 %). From these videos, we extracted procedure highlights (identified by image capture; mean 8.5 per colonoscopy) and surrounding frames. Conclusions We report the successful merging of a large database of endoscopy videos stored with limited identifiers to rich patient-level data in a highly accurate manner. This technique facilitates the development of ML algorithms based upon relevant patient outcomes. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commecial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/).Entities:
Year: 2021 PMID: 33553586 PMCID: PMC7857968 DOI: 10.1055/a-1326-1289
Source DB: PubMed Journal: Endosc Int Open ISSN: 2196-9736
Fig. 1Representation of data amalgamation.
Details of colonoscopy videos merged with patient-level EHR data (n = 21,170).
| Number of endoscopists performing colonoscopy | 50 |
| Number of colonoscopies per endoscopist, median (IQR) | 214 (51–758) |
|
Colonoscopy Indications
| |
Screening | 10,018 (47.3 %) |
Surveillance | 6,112 (28.9 %) |
Diagnostic | 3,451(16.3 %) |
Inflammatory bowel disease | 1,998 (9.4 %) |
Positive stool test | 312 (14.7 %) |
| Inflammatory bowel disease indications (n = 1998) | |
Crohnʼs disease | 930 (46.5 %) |
Ulcerative colitis | 824 (41.2 %) |
Indeterminate/not specified | 244 (12.2 %) |
HER, electronic health record; IQR, interquartile range.
Some cases with indications fitting into ≥ 2 categories.
Fig. 2Example of frame sampling around a highlighted point of interest. The central image is the picture captured by the endoscopist. Sampling frames around the highlight displays different (and sometimes more subtle) representations of the same highlight (in this case, a small polyp in the cecum).