| Literature DB >> 33709240 |
Masahiro Nishihori1, Takashi Izumi2, Yoshitaka Nagano3, Masaki Sato2, Tetsuya Tsukada2, Asuka Elisabeth Kropp2, Toshihiko Wakabayashi2.
Abstract
PURPOSE: In endovascular neurosurgery, the operator often acquires three-dimensional (3D) images of the cerebral vessels. Although workstation reoperation is required in some situations during treatment, it leads to time loss because a sterile condition cannot be maintained and treatment must be temporarily interrupted. Therefore, a workstation reoperating system is required while maintaining the desired sterility.Entities:
Keywords: Angiography; Contactless interface; Endovascular neurosurgery; Endovascular treatment; Kinect
Mesh:
Year: 2021 PMID: 33709240 PMCID: PMC7951120 DOI: 10.1007/s11548-021-02330-3
Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN: 1861-6410 Impact factor: 2.924
Fig. 1A layout with Kinect and sub-display attached to the existing angiography system
Fig. 2Scheme showing the developed network of operation interfaces in this study. The solid line indicates wired connection, whereas the broken line indicates wireless connection
A summary of the development and implementation environment for this system
| Details | |
|---|---|
| System development environment | Computer: Intel Core i5-4690 CPU at 3.50 GHz and 8.0 GB of RAM |
| OS: Windows 8.1 Pro | |
| Development platform: Visual Studio 2012 | |
| Programming language: Visual C++2012 | |
| Libraries: Opencv 2.4.10, Kinect for Windows SDK 2.0 | |
| Voice recognition software | Julius, an open-source “A large vocabulary continuous speech recognition engine” (Version.4.1.3) |
| Gesture judgment | Kinect recognizes three different shapes of both hands (rock, paper, scissors) and the 3D position of both hands |
| Measures to prevent malfunction | The right-hand movement operation is activated when the hand shape remains the same for more than 0.3 s |
| Voice command is activated when the left hand is in the shape of scissors for more than 0.5 s | |
| Microphone | Headsets with microphones (Jabra) (The microphone and PC were connected via Bluetooth). |
| Mouse and keyboard control | Microcomputer: Arduino Mirco |
| Development platform: Arduino Integrated Development Environment (IDE) 1.6.7. | |
| (The microcomputer generated the keyboard and mouse signals). |
Fig. 3A table showing the shape/direction/movement of both hands and the corresponding accompanying navigation functions. In the rightmost column, the reply sound and display indicating the confirmed operation are described. N/A: not applicable
The correspondence table between voice commands and assigned functions
| Voice command (operator pronounces in Japanese) | Function | Reply sound to operator's headset |
|---|---|---|
| Kaishi | Allow gesture operation | Start |
| Rokku | Stop gesture operation | Lock |
| Ueue, Shitashita, Ushiro, Maemae, Migimigi, Hidari, Uraura | Change to the defined viewpoint | Head, feet, posterior, anterior, right, left, back |
| Keisoku | Switch to measurement mode | Ruler |
| Hozon | Save to Bookmark & Save Snapshot | Save |
| Yomikomi | Load bookmark | Load |
| Alphagamen/betagamen | Switching the front/lateral screen mode of biplane | “A” or “B” |
| Etasubi | Switching to dual screen mode | Dual |
Kaishi, Rokku, Keisoku, Ue, Shita, Ushiro, Mae, Migi, Hidari, Ura, Hozon, and Yomikomi are all pronunciations in Japanese and have the same meaning as English words. Gamen means display and Etasubi means A plus B. Some words are repeated to increase the voice recognition rate (e.g., Ue is intentionally repeated twice with Ueue)
Fig. 4An operator performing the gesture and voice operation and a 3D volume rendering technique (VRT) image moving on the workstation in synchronization with the operator’s action. a Picking stone with his right hand allows observation of the rotation movement of the 3D VRT image just like using a mouse. b When the operator plays scissors with his hand and says “Hidari,” the VRT image instantly switches to the image seen on the left side. “Hidari” means left in Japanese
A table shows the time taken for each operation at the workstation through a contactless navigation system during endovascular surgery
| (1) AVM | (2) Aneurysm coil embolization | ||
|---|---|---|---|
| Procedure | Seconds | Procedure | Seconds |
| Adjust window level | 11 | Adjust window level | 4 |
| Rotation | 9 | Adjust window level | 3 |
| Panning | 11 | Rotation | 12 |
| Zoom | 4 | Rotation | 6 |
| Rotation | 26 | Zoom | 7 |
| Save to bookmark & save snapshot (voice) | 4 | Switch to measurement mode (Voice) | 4 |
| Prohibit gesture operation (voice) | 7 | Panning | 6 |
| Click action | 1 | ||
| Panning | 5 | ||
| Click action | 2 | ||
| Switch to measurement mode (voice) | 4 | ||
| Panning | 6 | ||
| Click action | 1 | ||
| Panning | 4 | ||
| Click action | 2 | ||
| Switch to measurement mode (voice) | 8 | ||
| Panning | 3 | ||
| Click action | 1 | ||
| Panning | 8 | ||
| Click action | 1 | ||
| Save to bookmark & save snapshot (voice) | 4 | ||
| Prohibit gesture operation (voice) | 3 | ||
| Total time | 72 | Total time | 95 |
Seconds are rounded off to the nearest whole number. The time required for voice commands includes the time required for recognition failure
Fig. 5Comparison of configurations between the conventional and newly developed interfaces