| Literature DB >> 34345375 |
Liu Xin1,2, Zheng Bin2, Duan Xiaoqin3,2, He Wenjing4, Li Yuandong5, Zhao Jinyu2, Zhao Chen1,6, Wang Lin2.
Abstract
Eye-tracking can help decode the intricate control mechanism in human performance. In healthcare, physicians-in-training require extensive practice to improve their healthcare skills. When a trainee encounters any difficulty in the practice, they will need feedback from experts to improve their performance. Personal feedback is time-consuming and subjected to bias. In this study, we tracked the eye movements of trainees during their colonoscopic performance in simulation. We examined changes in eye movement behavior during the moments of navigation loss (MNL), a signature sign for task difficulty during colonoscopy, and tested whether deep learning algorithms can detect the MNL by feeding data from eye-tracking. Human eye gaze and pupil characteristics were learned and verified by the deep convolutional generative adversarial networks (DCGANs); the generated data were fed to the Long Short-Term Memory (LSTM) networks with three different data feeding strategies to classify MNLs from the entire colonoscopic procedure. Outputs from deep learning were compared to the expert's judgment on the MNLs based on colonoscopic videos. The best classification outcome was achieved when we fed human eye data with 1000 synthesized eye data, where accuracy (91.80%), sensitivity (90.91%), and specificity (94.12%) were optimized. This study built an important foundation for our work of developing an education system for training healthcare skills using simulation.Entities:
Keywords: Deep Convolutional Generative Adversarial Networks (DCGANs); Long Short-Term Memory (LSTM); colonoscopy; eye-tracking; navigation; simulation
Year: 2021 PMID: 34345375 PMCID: PMC8327395 DOI: 10.16910/jemr.14.2.5
Source DB: PubMed Journal: J Eye Mov Res ISSN: 1995-8692 Impact factor: 0.957
Figure 1.Experimental setup. A Tobii X2-60 was installed under the monitor of the Accutouch VR Endoscopic Simulator. Throughout a colonoscopic procedure, a moment of navigation loss (MNL) might occur several times (bottom panel, highlighted in light yellow). During a MNL, the lumen of the colon disappeared; the participant’s eye was scanning the wall of the colon to search for the correct direction.
Time and frequency of saccade and fixation compared between MNL and non-MNL.
| phase duration (s) | 12.53 ± 1.45 | 22.97 ± 2.01 | |
| saccade duration (s) | 5.49 ± 1.00 | 10.78 ± 1.44 | |
| saccade number | 52.06 ± 7.91 | 101.19 ± 9.36 | |
| fixation number | 23.69 ± 2.77 | 40.94 ± 3.89 | |
| saccade frequency (#/s) | 4.29 ± 0.26 | 4.60 ± 0.18 | 0.329 |
| fixation frequency (#/s) | 2.04 ± 0.08 | 1.95 ± 0.08 | 0.406 |
| gaze event frequency (#/s) | 6.34 ± 0.28 | 6.55 ± 0.20 | 0.530 |
| mean duration of saccade for each time (s) | 0.10 ± 0.00 | 0.09 ± 0.00 | 0.474 |
| mean duration of fixation for each time (s) | 0.30 ± 0.02 | 0.28 ± 0.01 | 0.490 |
| saccade duration percent (%) | 41.19 ± 3.07 | 44.10 ± 2.91 | 0.506 |
| saccade number percent (%) | 65.36 ± 1.51 | 68.91 ± 1.31 | 0.082 |
| fixation number percent (%) | 34.64 ± 1.51 | 31.09 ± 1.31 | 0.082 |
Saccade amplitude and fixation distance compared over MNL and non-MNL.
| FixDis in [0,25] pixels (%) | 29.05 ± 2.61 | 43.22 ± 2.33 | -14.18 | |
| FixDis in [0,50] pixels (%) | 48.70 ± 3.28 | 66.67 ± 2.22 | -17.97 | |
| FixDis in [0,75] pixels (%) | 63.60 ± 3.02 | 82.53 ± 1.68 | ||
| FixDis in [0,100] pixels (%) | 75.80 ± 2.89 | 90.35 ± 1.22 | -14.56 | |
| SacAmp (degrees) | 2.48 ± 0.19 | 1.45 ± 0.08 | ||
| SacAmp > 1.5° (%) | 61.38 ± 2.98 | 44.13 ± 2.29 | 17.26 | |
| SacAmp > 2.0° (%) | 54.22 ± 2.96 | 34.41 ± 2.17 | 19.82 | |
| SacAmp > 2.5° (%) | 46.60 ± 2.96 | 25.12 ± 1.83 | ||
| SacAmp > 3.0° (%) | 39.26 ± 2.84 | 19.33 ± 1.62 | 19.93 | |
| SacAmp > 3.5° (%) | 32.17 ± 2.66 | 14.08 ± 1.36 | 18.09 | |
| SacAmp > 4.0° (%) | 25.33 ± 2.71 | 11.14 ± 1.16 | 14.19 | |
| SacAmp > 4.5° (%) | 21.66 ± 2.37 | 8.36 ± 1.11 | 13.29 | |
| SacAmp > 5.0° (%) | 18.78 ± 2.27 | 6.48 ± 0.95 | 12.30 | |
| SacAmp > 5.5° (%) | 16.17 ± 2.01 | 5.01 ± 0.83 | 11.15 | |
| SacAmp > 6.0° (%) | 12.99 ± 1.86 | 4.15 ± 0.81 | 8.84 | |
| SacAmp > 6.5° (%) | 11.91 ± 1.73 | 3.65 ± 0.78 | 8.25 | |
| SacAmp > 7.0° (%) | 11.03 ± 1.69 | 2.69 ± 0.64 | 8.33 | |
| SacAmp > 7.5° (%) | 9.84 ± 1.62 | 2.03 ± 0.54 | 7.81 |
Pupil size compared between MNL and non-MNL.
| APS of left eye in trial (%) | 46.96 ± 1.54 | 64.38 ± 1.43 | |
| APS of right eye in trial (%) | 46.99 ± 1.45 | 62.52 ± 1.60 | |
| APS of left eye in saccade (%) | 46.22 ± 1.52 | 63.50 ± 1.49 | |
| APS of right eye in saccade (%) | 46.80 ± 1.41 | 61.75 ± 1.63 | |
| APS of left eye in fixation (%) | 47.61 ± 1.58 | 65.16 ± 1.39 | |
| APS of right eye in fixation (%) | 47.28 ± 1.47 | 63.31 ± 1.60 | |
| cumulative frequency of APS in [55%,100%] (left eye) (%) | 25.88 ± 4.03 | 72.07 ± 3.21 | |
| cumulative frequency of APS in [55%,100%] (right eye) (%) | 26.53 ± 3.80 | 68.01 ± 3.45 | |
| index of maximum cumulative frequency of APS (left eye) | 10.05 ± 0.33 | 13.71 ± 0.31 | |
| index of maximum cumulative frequency of APS (right eye) | 9.93 ± 0.33 | 13.43 ± 0.36 |
Figure 2.Illustration of AI architecture for detecting MNLs during colonoscopy. Top Panel: DCGANs-based generator for synthesizing data for MNL and non-MNL phases independently. Bottom Panel: flowchart of feeding data to LSTM model to detect and classify MNLs in a colonoscopic procedure.
Figure 3.Fixation distance and adjusted pupil size in MNL and non-MNL. A-B): Subject 1’s fixation trajectory in MNL and non-MNL during a colonoscopy; C) Subject 5’s cumulative frequency of APS in the range of [55%, 100%]. From 1 to 26 (horizontal axis) are the cumulative frequency in 13 MNL phases; from 27 to 52 (horizontal axis) are the cumulative frequency in 13 non-MNL phases. Here, an odd number in the horizontal axis represents the left eye; an even number represents the right eye.
Figure 4.Verification of synthesized to the real human eye data by t-SNE-based visualization.
MNL and non-MNL classification results. R represents real eye data; S represents synthesized eye data.
| Training Set | Test Set | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| MNL: 51 (R) non-MNL: 77(R) | MNL: 17(R) non-MNL: 44(R) | 81.96% | 79.55% | 88.24% |
| MNL:51(R)+200(S) non-MNL: 77(R)+200(S) | 83.61% | 79.55% | 94.12% | |
| MNL: 51(R)+1000(S) non-MNL: 77(R)+1000(S) | 91.80% | 90.91% | 94.12% | |
| MNL: 51(R)+1600(S) non-MNL: 77(R)+1600(S) | 88.52% | 95.45% | 70.59% | |
| MNL: 51(R)+2000(S) non-MNL: 77(R)+2000(S) | 83.61% | 86.36% | 76.47% |
Figure 5.ROC curves for MNL and non-MNL classification.