Literature DB >> 35426565

Objective assessment of laparoscopic targeting skills using a Short-Time Power of Difference (STPOD) method.

Shinji Ohtake¹, Kazuhide Makiyama², Daisuke Yamashita^2,3, Tomoyuki Tatenuma², Masahiro Yao².

Abstract

PURPOSE: To ensure that the use of surgical training tools results in improvement of surgical skills, it is necessary to be able to measure and assess surgeons' skills. We established the Short-Time Power of Difference (STPOD) method as an evaluation tool for evaluating targeting technique. The STPOD method evaluates the distance from the actual movement of the forceps to the shortest linear path between two points in a short time period. We examined the effectiveness of the STPOD method as a new forceps kinematic analysis.
METHODS: Six residents were categorized as novices and six urologists as experts. All participants performed box trainer training and LapPASS® Simulator training. During the procedure, objective scores (time, distance, and STPOD) were recorded. STPOD (Power) evaluated motion smoothness and STPOD (Stop) evaluated the stop time of the forceps.
RESULTS: STPOD (Stop) on the right side of the experts was significantly lower than that of the novices in the box trainer. Furthermore, there were significant differences in the distances of left side and STPOD (Power) between the experts and the novices in the simulator. In the correlation of parameters between the box trainer and the simulator, time showed the strongest correlation, STPOD (Power) and distance showed a mild correlation.
CONCLUSION: We showed the construct validity of STPOD (Power) and STPOD (Stop) using both the box trainer and the simulator. This method is a good evaluation tool for assessing a physician's skill; however, there are much more complex motions that are performed in actual surgery. Future studies are needed to focus on evaluation in an environment closer to actual surgery and comparing with other existing methods.

Entities: Chemical

Keywords: Laparoscopic surgery; Motion smoothness; Objective assessment; Short-Time Power of Difference (STPOD); Surgical skills; Surgical training

Mesh：

Year: 2022 PMID： 35426565 PMCID： PMC9124661 DOI： 10.1007/s11548-022-02622-2

Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN： 1861-6410 Impact factor: 3.421

Introduction

To ensure patient safety, it is desirable for surgeons to practice surgical procedures before performing them. In addition, in order to improve their skill levels and shorten learning curves and procedure times, surgeons are required to practice procedures outside of the operating room. Simulation tools meet such demands. To ensure that such training results in improvement of surgical skills, it is necessary to be able to measure and assess surgeons’ skills. Thus, it is important to understand the performance differences between experienced and novice surgeons. Previous studies distinguished experienced surgeons from novices using performance scores based on performance time, bleeding volume, and number of errors made during an operation. Some studies evaluated laparoscopic skills using objective evaluating methods such as the Global Operative Assessment of Laparoscopic Skills (GOALS) [1] and Global Evaluation Assessment of Robotic Skills (GEARS) [2]. Especially in laparoscopic surgery, some studies examined psychomotor skills. Using electromagnetic position-tracking sensors, kinematic analyses of motion, involving parameters such as time, path length, and speed, have been performed [3, 4]. Compared with the novices, experienced surgeons were expected to handle manipulators more smoothly during laparoscopic surgery and spend a shorter time to imagine their next action due to their experience, which would result in a shorter time without moving the forceps. However, quantitative parameters, such as motion smoothness (MS) and the blank time when the forceps are not moved, are controversial. There is no established objective method for evaluating targeting technique (applying forceps to an object), which is the basic movement of laparoscopic surgery. Most existing evaluation methods of motion smoothness use acceleration in the form of three-dimension vectors. We initially used the same method; however, it was difficult to distinguish between novices and experts. Therefore, we established the Short-Time Power of Difference (STPOD) method as an evaluation tool for assessing targeting technique. Herein we examined the effectiveness (construct validation) of the STPOD method as a new forceps kinematic analysis method.

Materials and methods

This study was approved by the Institutional Review Board of Yokohama City University. Participants signed their informed consent to participate in the study. The data gathered were coded, and all reporting was confidential and did not impact the official evaluation. Participants could choose to withdraw at any point during the study and they were made explicitly aware of this at the time of informed consent. Six residents were categorized as laparoscopic novices (no experience in laparoscopic surgery) and six urology doctors as laparoscopic experts (> 20 laparoscopic procedures completed and having a surgical skill qualification issued by the Japanese Society of Endourology) [5]. The dominant hand was all right. All participants were oriented to the box trainer and the training simulator LapPASS® (Mitsubishi Precision, Japan, https://www.mpcnet.co.jp/product/lappass/) [6-9], and given a demonstration of both. The subjects in each category were then randomized into two groups using a randomizing program (https://en.calc-site.com/randoms/grouping). Group 1 underwent training with the box trainer followed by LapPASS® and Group 2 with LapPASS® followed by the box trainer (Fig. 1a and b).

Fig. 1

A graphical representation of the experiment. A box trainer with five numbered pins on a cork board. This task requires touching the pins with the tuppel in order. B targeting training image of LapPASS®. This task requires touching the red balloon with the right hand and the white balloon with the left in order The task in the box trainer was touching targets in order using forceps five times using each hand. One participant performed 10 times in total. The electromagnetic tracking system TrakSTAR® (Mikimoto Beans, Japan, https://tracklab.com.au/products/brands/ndi/ascension-trakstar/) was used to acquire the position information of the forceps. The device was attached to the tip of the tuppel to get position information. The task in LapPASS® consisted of participants performing hand-eye training six times. This hand-eye training consisted of applying the forceps to the targets in order. The STPOD method is entirely different from existing methods for kinematic analysis. STPOD is a way to evaluate and quantify “How much the distance is from the position of the tip of the tuppel either to the shortest path between the starting point and the end point, or to the average of all positions visited during a short time period.” The value obtained is denoted as . We calculate STPOD (Power) and STPOD (Stop) making a graph with on the y-axis and time on the x-axis. When is less than the threshold, the surgical tool is considered as “not moving.” When is bigger than the threshold, the surgical tool is considered as “not linear nor smooth.” It was calculated using a simple product-sum operation and was suitable for real-time evaluation. A schematic diagram of the STPOD method is shown in Fig. 2a–c.

Fig. 2

a A schematic diagram of the STPOD method. b A schematic diagram of power. Left image represents the calculation of . The box means the short time ‘m’ and is the regression line ‘’. We calculate the Power ‘’ using . Right image is a chronological representation of ‘’. Time over threshold means ‘time not smooth.’ c A schematic diagram of blank time. In left image, is average is average ‘’. In right image, time under threshold means ‘time not moving’ (i start from 1 to n) The following is an explanation of the STPOD method. In the time series data , extract short time period ‘m’ and define it as using superscript ‘m’. Here means time and means the position of the tip of the tuppel. represents a three-dimensional vector, but here is simplified to for clarity. Time period m means 0.5 [s] in the box trainer, and 1 [s] in the simulator. Next, we defined Power ‘’ in this period as Here, is a standard formula and is defined as average ‘’ or regression line ‘’. a and b were defined as: and were the averages of , respectively. The latter part of the formula is the Hamming window function. In conclusion, formula (1) represents the sum of the squares of the distance from the position of the tip of the tuppel either to the shortest path between the starting point and the end point, or to the average of all positions visited during time period m, given weight by the window function. There are two evaluation indexes by the STPOD method: ‘power,’ which indicates wasteful movement, and ‘blank time’, which indicates when movement is stopped.

[Evaluation item 1: power]

We used the regression line as the standard. When ‘’ is bigger than the threshold, the surgical tool is considered to be “not linear.” We calculated the sum of the time sections above the threshold. A smaller value means that the surgical instrument moves linearly and smoothly (Fig. 2b).

[Evaluation item 2: blank time]

We used the average as the standard. When ‘’ was less than the threshold value, the surgical tool is considered to be “stop.” We calculated the sum of the time sections below the threshold. A smaller value means that there was shorter stagnation of movement (Fig. 2c). We evaluated “time” and “distance” as conventional comparative values. Statistical analysis was performed using EZR [10], which is a graphical user interface for R (The R Foundation for Statistical Computing, Vienna, Austria). Mann–Whitney U test was performed for each pair of groups. In order to correct for multiplicity, we performed the Bonferroni correction. As a result, the significance level of the box trainer (n = 4) changed from p < 0.05 to p < 0.0125 (0.05/4 = 0.0125), and of the simulator (n = 5) changed to p < 0.01 (0.05/5 = 0.01).

Results

In the box trainer, there was a tendency for all the parameters to improve in both groups as the number of times increased (Fig. 3). For all repetitions together, STPOD (Stop) on the right side and time on the left side of the experts were significantly lower than those of the novices. This shows the construct validation of STPOD (Stop) in the box trainer (Fig. 4) (Table 1).

Fig. 3

Learning curve of box trainer between the expert and novice groups. “rt” stands for right hand and “lt” for left hand

Fig. 4

Comparison of the box trainer at all times between the expert and novice groups. The results are presented as box and whisker plots, in which every box has a line at every quartile, median, and upper quartile value. The whiskers are presented as lines that extend from each end of the box to show the extent of the remaining data. Outliners are plotted with circles

Table 1

Results of the Box trainer task between Expert and Novice groups

	Playtime (s)		p	Distance (mm)		p	Power		p	Stop		p
	Expert	Novice	p	Expert	Novice	p	Expert	Novice	p	Expert	Novice	p
Right
Time
1	17.2 ± 2.94	26.3 ± 9.68	0.026	1495 ± 274	1422 ± 182	0.240	43.0 ± 6.02	44.5 ± 5.35	0.629	190 ± 60	464 ± 286	0.009
2	13.1 ± 2.28	17.7 ± 4.3	0.180	1417 ± 267	1362 ± 168	0.589	36.2 ± 3.48	39.5 ± 5.28	0.329	106 ± 75	212 ± 103	0.065
3	11.3 ± 2.09	15.2 ± 3.88	0.132	1362 ± 226	1339 ± 214	0.589	33.8 ± 6.81	37.7 ± 6.94	0.520	72 ± 55	161 ± 66	0.065
4	10.6 ± 2.15	14.6 ± 3.57	0.132	1328 ± 255	1316 ± 134	0.818	31.3 ± 5.52	38.2 ± 5.43	0.107	67 ± 60	140 ± 60	0.132
5	10.8 ± 2.73	13.3 ± 2.95	0.240	1317 ± 233	1293 ± 179	0.310	32.1 ± 5.58	36.3 ± 4.82	0.297	79 ± 73	118 ± 61	0.378
Average	12,6 ± 2.12	17.4 ± 7.17	0.132	1384 ± 240	1346 ± 183	0.589	35.3 ± 5.02	39.2 ± 6.28	0.336	103 ± 51	219 ± 192	0.015
Left
Time
1	13.8 ± 3.40	15.9 ± 3.82	0.394	1368 ± 258	1432 ± 87	0.937	42.0 ± 9.7	44.5 ± 5.7	0.936	114 ± 55	157 ± 82	0.423
2	11.9 ± 2.66	15.0 ± 3.86	0.240	1283 ± 205	1335 ± 211	0.937	36.7 ± 7.6	41 ± 8.3	0.575	104 ± 49	166 ± 88	0.240
3	11.4 ± 2.35	14.0 ± 3.48	0.180	1286 ± 218	1322 ± 192	0.818	32.2 ± 4.9	37.8 ± 4.7	0.146	103 ± 52	138 ± 74	0.589
4	11.8 ± 2.55	13.3 ± 3.42	0.180	1254 ± 263	1364 ± 316	0.818	33.1 ± 6.3	36.1 ± 6.4	0.465	87 ± 52	119 ± 58	0.336
5	10.9 ± 1.86	13.2 ± 3.62	0.394	1217 ± 215	1317 ± 214	0.818	34 ± 5.7	37.2 ± 6.1	0.520	95 ± 42	119 ± 64	0.699
Average	11.7 ± 2.84	14.3 ± 3.79	0.394	1282 ± 238	1354 ± 221	1.000	35.6 ± 7.9	39.3 ± 7.1	0.310	101 ± 51	140 ± 77	0.485

Learning curve of box trainer between the expert and novice groups. “rt” stands for right hand and “lt” for left hand Comparison of the box trainer at all times between the expert and novice groups. The results are presented as box and whisker plots, in which every box has a line at every quartile, median, and upper quartile value. The whiskers are presented as lines that extend from each end of the box to show the extent of the remaining data. Outliners are plotted with circles Results of the Box trainer task between Expert and Novice groups In the simulator, both groups improved their score as the number of times increased (Fig. 5).

Fig. 5

Learning curve of the simulator between the expert and novice groups. “rt” stands for right hand and “lt” for left hand

Learning curve of the simulator between the expert and novice groups. “rt” stands for right hand and “lt” for left hand For all repetitions together, there was a significant difference in distance on the left side and STPOD (Power) between the experts and the novices. This shows the construct validation of STPOD (Power) in the simulator (Fig. 6) (Table 2).

Fig. 6

Table 2

Results of the simulator task between Expert and Novice groups

	Playtime (s)		p	rt Distance (mm)		p	lt Distance (mm)		p	Power		p	Stop		p
	Expert	Novice	p	Expert	Novice	p	Expert	Novice	p	Expert	Novice	p	Expert	Novice	p
Time
1	51.5 ± 14.5	66.1 ± 25.5	0.485	1498 ± 150	1469 ± 262	0.699	1068 ± 61.2	932 ± 197	0.240	4.83 ± 2.4	10.8 ± 11	0.580	62 ± 18	68 ± 15	0.810
2	40 ± 8.35	51.7 ± 20.3	0.375	1410 ± 156	1352 ± 128	0.699	1053 ± 92.3	891 ± 155	0.093	2.17 ± 0.90	5.5 ± 7.4	1.000	52 ± 12	64 ± 15	0.297
3	36.7 ± 9.14	42.8 ± 11.7	0.377	1372 ± 149	1282 ± 134	0.394	981 ± 137	828 ± 117	0.093	1.67 ± 1.1	2.8 ± 2.5	0.564	50 ± 14	79 ± 31	0.170
4	32.3 ± 6.21	40.5 ± 11.8	0.394	1343 ± 151	1281 ± 136	0.589	946 ± 82.1	849 ± 152	0.240	1.17 ± 0.90	2.2 ± 1.2	0.168	43 ± 6.3	69 ± 37	0.198
5	33.2 ± 6.89	40.8 ± 13.0	0.422	1234 ± 125	1225 ± 79	0.937	935 ± 150	805 ± 110	0.394	1.83 ± 0.90	2 ± 1	0.738	56 ± 16	55 ± 9.6	0.810
6	30.7 ± 7.06	37 ± 10.3	0.261	1298 ± 156	1117 ± 60	0.026	877 ± 114	748 ± 70	0.132	1.17 ± 0.90	2 ± 1.2	0.214	46 ± 9.6	53 ± 10	0.394
Average	37.4 ± 11.5	46.5 ± 19.1	0.240	1359 ± 170	1288 ± 183	0.240	977 ± 129	842 ± 151	0.132	2.14 ± 1.81	4.2 ± 6.5	0.147	51 ± 15	65 ± 24	0.065

Comparison of box trainer at all times between the expert and novice groups. The results are presented as box and whisker plots, in which every box has a line at every quartile, median, and upper quartile value. The whiskers are presented as lines that extend from each end of the box to show the extent of the remaining data. Outliners are plotted with circles Results of the simulator task between Expert and Novice groups By evaluating the correlation of parameters between the box trainer and the simulator, time showed the strongest correlation (γ = 0.719). Next, STPOD (Power) and distance showed a mild correlation (γ = 0.396 and γ = 0.347, respectively) (Fig. 7).

Fig. 7

Pearson correlation plot of box trainer versus simulator in the expert and novice groups

Discussion

Conventionally, for the evaluation of the simulator, time, the amount of bleeding, and objective evaluations, such as GOALS and GEARS, have been used. When the surgeon performed the procedure carefully and spent a lot of time, the amount of bleeding was small and the results were often good; however, it was expected that the burden on the patient would be large. Previous studies reported that a shorter operation time lead to a reduction in postoperative complications [11-14]. Ideally, a smooth and short surgery is required; however, it has been difficult to evaluate smoothness. In general, it is predicted that the more experienced surgeons have smoother motions and obtain lower values for MS compared to less experienced ones. Furthermore, it is predicted that the experts need a shorter time to think about what they should do next and this results in a shorter blank time. As an evaluation method, a scoring system, such as GOALS and GEARS, is used; however, these evaluation methods depend on the surgical expert’s subjective decisions. For this reason, studies were conducted to distinguish MS between experts and novices using objective methods; however, this was challenging. Maithel et al. evaluated MS using the Computer-enhanced Laparoscopic Training System [15], and found no significant difference. Hofstad et al. evaluated psychomotor skills using the D-Box Basic Simulator [16]. The Aurora Electromagnetic Measurement System was used for the tracking of the forceps and MS was defined as a total change in acceleration of the tip of the instrument. There was a significant difference only in the non-dominant hand when comparing novices to experts and intermediates. A further study assessed cholecystectomy performed in a porcine liver box model [17], and MS was defined the same as Hofstad et al.; however, there was no significant difference between the groups. On the other hand, Escamirosa et al. evaluated surgical skills by 13 motion analysis parameters using the EndoViS Training System [18]. The motion of the forceps was recorded by a video-tracking system. They defined MS as abrupt changes in acceleration resulting in jerky movements of the instrument (m/s3). There was a significant difference between the experts and novices in all three tasks, peg transfer, pattern cutting, and intracorporeal knot suture. We initially used acceleration to evaluate MS using existing methods; however, it could not distinguish between the novices and experts. On the other hand, STPOD was able to distinguish the two groups. Thus, this new method is an alternative way to evaluate MS. In the present study, right-hand distance of the box trainer in the expert group was longer than that of the novice group. This may reflect the habits of each surgeon. In actual laparoscopic surgery, when moving forceps between two points, the forceps are not moved in a straight line and are often moved after the forceps have been pulled back to the hand. Because of their real-world experience, experts may take a longer distance. This study was conducted in the order of right to left hand; thus, the participants recognized the position of the pins and became familiar with the movement in the right-hand session, and this may have resulted in the lack of a significant difference between the two groups in the left-hand session. The most suitable objective method to evaluate blank time is controversial. Uemura et al. evaluated forceps movements using a labeling system based on predefined terminology. They concluded that skilled participants had a shorter blank time (time without forceps movement) than novices. They proposed that the time spent holding the forceps but not moving them was time spent thinking, and that the shorter blank time for the skilled users was due to their experience and stable manipulation movements [19]. In addition, Hofstad et al. reported that there was a significant difference in the idle percentage (percentage of total time the instrument is moved at speed < 2 mm/s) [17]. On the other hand, some reports concluded that there were no significant differences between expert and novices when defining idle time as the percentage of time where the instrument is considered to be still [4, 18]. By using the STPOD method, we evaluated the degree of flicker of the forceps and the blank time when the forceps were not moved. We predicted that there would be a little flicker and blank time in the experts’ surgery. In this study, we found that the experts had less flicker and a shorter blank time in both the box and the simulator. Furthermore, although it is inferior to the objective evaluation method “time” that has been used for a long time, we showed a correlation of STPOD between the box and the simulator. Thus, the STPOD method is effective for evaluating forceps dynamics. There are some limitations in this study. First, the number of subjects was limited. Second, we focused only on targeting and the task that the participants performed was just moving the forceps between fixed targets. In an actual surgery, there are much more complex motions. For this reason, we did not use GOALS evaluation. In the future, evaluation in an environment closer to actual surgery and comparing the STPOD method with other methods, such as GOALS, are necessary. Third, a comparison between our proposal and existing motion analysis parameters was not performed. Comparing STPOD with the existing method to evaluate motion smoothness is necessary.

19 in total

1. Development of a patient-specific simulator for laparoscopic renal surgery.

Authors: Kazuhide Makiyama; Manabu Nagasaka; Toru Inuiya; Kentaro Takanami; Masato Ogata; Yoshinobu Kubota
Journal: Int J Urol Date: 2012-05-28 Impact factor: 3.369

2. How slow is too slow? Correlation of operative time to complications: an analysis from the Tennessee Surgical Quality Collaborative.

Authors: Brian J Daley; William Cecil; P Chris Clarke; Joseph B Cofer; Oscar D Guillamondegui
Journal: J Am Coll Surg Date: 2015-01-09 Impact factor: 6.113

3. Simulated laparoscopy using a head-mounted display vs traditional video monitor: an assessment of performance and muscle fatigue.

Authors: S K Maithel; L Villegas; N Stylopoulos; S Dawson; D B Jones
Journal: Surg Endosc Date: 2004-12-23 Impact factor: 4.584

4. Psychomotor skills assessment by motion analysis in minimally invasive surgery on an animal organ.

Authors: Erlend Fagertun Hofstad; Cecilie Våpenstad; Lars Eirik Bø; Thomas Langø; Esther Kuhry; Ronald Mårvik
Journal: Minim Invasive Ther Allied Technol Date: 2017-02-08 Impact factor: 2.442

5. Analysis of hand motion differentiates expert and novice surgeons.

Authors: Munenori Uemura; Morimasa Tomikawa; Ryuichi Kumashiro; Tiejun Miao; Ryota Souzaki; Satoshi Ieiri; Kenoki Ohuchida; Alan T Lefor; Makoto Hashizume
Journal: J Surg Res Date: 2013-12-19 Impact factor: 2.192

6. A study of psychomotor skills in minimally invasive surgery: what differentiates expert and nonexpert performance.

Authors: Erlend Fagertun Hofstad; Cecilie Våpenstad; Magdalena Karolina Chmarra; Thomas Langø; Esther Kuhry; Ronald Mårvik
Journal: Surg Endosc Date: 2012-10-06 Impact factor: 4.584

7. EVA: laparoscopic instrument tracking based on Endoscopic Video Analysis for psychomotor skills assessment.

Authors: Ignacio Oropesa; Patricia Sánchez-González; Magdalena K Chmarra; Pablo Lamata; Alvaro Fernández; Juan A Sánchez-Margallo; Frank Willem Jansen; Jenny Dankelman; Francisco M Sánchez-Margallo; Enrique J Gómez
Journal: Surg Endosc Date: 2012-10-06 Impact factor: 4.584

8. Objective classification of psychomotor laparoscopic skills of surgeons based on three different approaches.

Authors: Fernando Pérez-Escamirosa; Antonio Alarcón-Paredes; Gustavo Adolfo Alonso-Silverio; Ignacio Oropesa; Oscar Camacho-Nieto; Daniel Lorias-Espinoza; Arturo Minor-Martínez
Journal: Int J Comput Assist Radiol Surg Date: 2019-10-11 Impact factor: 2.924

9. Clinical use of a patient-specific simulator for patients who were scheduled for robot-assisted laparoscopic partial nephrectomy.

Authors: Kazuhide Makiyama; Tomoyuki Tatenuma; Shinji Ohtake; Atsuto Suzuki; Kentaro Muraoka; Masahiro Yao
Journal: Int J Urol Date: 2020-11-02 Impact factor: 3.369

Review 10. Prolonged Operative Duration Increases Risk of Surgical Site Infections: A Systematic Review.

Authors: Hang Cheng; Brian Po-Han Chen; Ireena M Soleas; Nicole C Ferko; Chris G Cameron; Piet Hinoul
Journal: Surg Infect (Larchmt) Date: 2017 Aug/Sep Impact factor: 2.150