Karl-Friedrich Kowalewski1, Carly R Garrow1, Mona W Schmidt1, Laura Benner2, Beat P Müller-Stich1, Felix Nickel3. 1. Department of General, Visceral, and Transplantation Surgery, University of Heidelberg, Im Neuenheimer Feld 110, 69120, Heidelberg, Germany. 2. Department of Medical Biometry and Informatics, University of Heidelberg, Im Neuenheimer Feld 130.3, 69120, Heidelberg, Germany. 3. Department of General, Visceral, and Transplantation Surgery, University of Heidelberg, Im Neuenheimer Feld 110, 69120, Heidelberg, Germany. felix.nickel@med.uni-heidelberg.de.
Abstract
INTRODUCTION: The most common way of assessing surgical performance is by expert raters to view a surgical task and rate a trainee's performance. However, there is huge potential for automated skill assessment and workflow analysis using modern technology. The aim of the present study was to evaluate machine learning (ML) algorithms using the data of a Myo armband as a sensor device for skills level assessment and phase detection in laparoscopic training. MATERIALS AND METHODS: Participants of three experience levels in laparoscopy performed a suturing and knot tying task on silicon models. Experts rated performance using Objective Structured Assessment of Surgical Skills (OSATS). Participants wore Myo armbands (Thalmic Labs™, Ontario, Canada) to record acceleration, angular velocity, orientation, and Euler orientation. ML algorithms (decision forest, neural networks, boosted decision tree) were compared for skill level assessment and phase detection. RESULTS: 28 participants (8 beginner, 10 intermediate, 10 expert) were included, and 99 knots were available for analysis. A neural network regression model had the lowest mean absolute error in predicting OSATS score (3.7 ± 0.6 points, r2 = 0.03 ± 0.81; OSATS min.-max.: 4-37 points). An ensemble of binary-class neural networks yielded the highest accuracy in predicting skill level (beginners: 82.2% correctly identified, intermediate: 3.0%, experts: 79.5%) whereas standard statistical analysis failed to discriminate between skill levels. Phase detection on raw data showed the best results with a multi-class decision jungle (average 16% correctly identified), but improved to 43% average accuracy with two-class boosted decision trees after Dynamic time warping (DTW) application. CONCLUSION: Modern machine learning algorithms aid in interpreting complex surgical motion data, even when standard analysis fails. Dynamic time warping offers the potential to process and compare surgical motion data in order to allow automated surgical workflow detection. However, further research is needed to interpret and standardize available data and improve sensor accuracy.
INTRODUCTION: The most common way of assessing surgical performance is by expert raters to view a surgical task and rate a trainee's performance. However, there is huge potential for automated skill assessment and workflow analysis using modern technology. The aim of the present study was to evaluate machine learning (ML) algorithms using the data of a Myo armband as a sensor device for skills level assessment and phase detection in laparoscopic training. MATERIALS AND METHODS:Participants of three experience levels in laparoscopy performed a suturing and knot tying task on silicon models. Experts rated performance using Objective Structured Assessment of Surgical Skills (OSATS). Participants wore Myo armbands (Thalmic Labs™, Ontario, Canada) to record acceleration, angular velocity, orientation, and Euler orientation. ML algorithms (decision forest, neural networks, boosted decision tree) were compared for skill level assessment and phase detection. RESULTS: 28 participants (8 beginner, 10 intermediate, 10 expert) were included, and 99 knots were available for analysis. A neural network regression model had the lowest mean absolute error in predicting OSATS score (3.7 ± 0.6 points, r2 = 0.03 ± 0.81; OSATS min.-max.: 4-37 points). An ensemble of binary-class neural networks yielded the highest accuracy in predicting skill level (beginners: 82.2% correctly identified, intermediate: 3.0%, experts: 79.5%) whereas standard statistical analysis failed to discriminate between skill levels. Phase detection on raw data showed the best results with a multi-class decision jungle (average 16% correctly identified), but improved to 43% average accuracy with two-class boosted decision trees after Dynamic time warping (DTW) application. CONCLUSION: Modern machine learning algorithms aid in interpreting complex surgical motion data, even when standard analysis fails. Dynamic time warping offers the potential to process and compare surgical motion data in order to allow automated surgical workflow detection. However, further research is needed to interpret and standardize available data and improve sensor accuracy.
Authors: Yaron Munz; Alex M Almoudaris; Krishna Moorthy; Aristotelis Dosis; Alexander D Liddle; Ara W Darzi Journal: Am J Surg Date: 2007-06 Impact factor: 2.565
Authors: Felix Nickel; Jonathan D Hendrie; Karl-Friedrich Kowalewski; Thomas Bruckner; Carly R Garrow; Maisha Mantel; Hannes G Kenngott; Philipp Romero; Lars Fischer; Beat P Müller-Stich Journal: Langenbecks Arch Surg Date: 2016-04-07 Impact factor: 3.445
Authors: Ignacio Oropesa; Patricia Sánchez-González; Magdalena K Chmarra; Pablo Lamata; Alvaro Fernández; Juan A Sánchez-Margallo; Frank Willem Jansen; Jenny Dankelman; Francisco M Sánchez-Margallo; Enrique J Gómez Journal: Surg Endosc Date: 2012-10-06 Impact factor: 4.584
Authors: Sandeep Ganni; Sanne M B I Botden; Magdalena Chmarra; Richard H M Goossens; Jack J Jakimowicz Journal: Surg Endosc Date: 2018-01-16 Impact factor: 4.584
Authors: Sandeep Ganni; Sanne M B I Botden; Magdalena Chmarra; Meng Li; Richard H M Goossens; Jack J Jakimowicz Journal: J Med Syst Date: 2020-01-24 Impact factor: 4.460
Authors: Karl-Friedrich Kowalewski; Luisa Egen; Chanel E Fischetti; Stefano Puliatti; Gomez Rivas Juan; Mark Taratkin; Rivero Belenchon Ines; Marie Angela Sidoti Abate; Julia Mühlbauer; Frederik Wessels; Enrico Checcucci; Giovanni Cacciamani Journal: Asian J Urol Date: 2022-06-18
Authors: Reinhard Fuchs; Karel M Van Praet; Richard Bieck; Jörg Kempfert; David Holzhey; Markus Kofler; Michael A Borger; Stephan Jacobs; Volkmar Falk; Thomas Neumuth Journal: Int J Comput Assist Radiol Surg Date: 2022-03-16 Impact factor: 3.421
Authors: E Willuth; S F Hardon; F Lang; C M Haney; E A Felinska; K F Kowalewski; B P Müller-Stich; T Horeman; F Nickel Journal: Surg Endosc Date: 2021-02-26 Impact factor: 4.584