Literature DB >> 33000365

Bidirectional long short-term memory for surgical skill classification of temporally segmented tasks.

Jason D Kelly1, Ashley Petersen2, Thomas S Lendvay3, Timothy M Kowalewski4.   

Abstract

PURPOSE: The majority of historical surgical skill research typically analyzes holistic summary task-level metrics to create a skill classification for a performance. Recent advances in machine learning allow time series classification at the sub-task level, allowing predictions on segments of tasks, which could improve task-level technical skill assessment.
METHODS: A bidirectional long short-term memory (LSTM) network was used with 8-s windows of multidimensional time-series data from the Basic Laparoscopic Urologic Skills dataset. The network was trained on experts and novices from four common surgical tasks. Stratified cross-validation with regularization was used to avoid overfitting. The misclassified cases were re-submitted for surgical technical skill assessment to crowds using Amazon Mechanical Turk to re-evaluate and to analyze the level of agreement with previous scores.
RESULTS: Performance was best for the suturing task, with 96.88% accuracy at predicting whether a performance was an expert or novice, with 1 misclassification, when compared to previously obtained crowd evaluations. When compared with expert surgeon ratings, the LSTM predictions resulted in a Spearman coefficient of 0.89 for suturing tasks. When crowds re-evaluated misclassified performances, it was found that for all 5 misclassified cases from peg transfer and suturing tasks, the crowds agreed more with our LSTM model than with the previously obtained crowd scores.
CONCLUSION: The technique presented shows results not incomparable with labels which would be obtained from crowd-sourced labels of surgical tasks. However, these results bring about questions of the reliability of crowd sourced labels in videos of surgical tasks. We, as a research community, should take a closer look at crowd labeling with higher scrutiny, systematically look at biases, and quantify label noise.

Entities:  

Keywords:  Bidirectional LSTM; Crowd sourcing; Machine learning; Surgical skill; Surgical technical skill

Mesh:

Year:  2020        PMID: 33000365      PMCID: PMC7677176          DOI: 10.1007/s11548-020-02269-x

Source DB:  PubMed          Journal:  Int J Comput Assist Radiol Surg        ISSN: 1861-6410            Impact factor:   2.924


  21 in total

1.  Development and validation of a comprehensive program of education and assessment of the basic fundamentals of laparoscopic surgery.

Authors:  Jeffrey H Peters; Gerald M Fried; Lee L Swanstrom; Nathaniel J Soper; Lelan F Sillin; Bruce Schirmer; Kaaren Hoffman
Journal:  Surgery       Date:  2004-01       Impact factor: 3.982

2.  Crowd-Sourced Assessment of Technical Skills for Validation of Basic Laparoscopic Urologic Skills Tasks.

Authors:  Timothy M Kowalewski; Bryan Comstock; Robert Sweet; Cory Schaffhausen; Ashleigh Menhadji; Timothy Averch; Geoffrey Box; Timothy Brand; Michael Ferrandino; Jihad Kaouk; Bodo Knudsen; Jaime Landman; Benjamin Lee; Bradley F Schwartz; Elspeth McDougall; Thomas S Lendvay
Journal:  J Urol       Date:  2016-01-14       Impact factor: 7.450

3.  A global assessment tool for evaluation of intraoperative laparoscopic skills.

Authors:  Melina C Vassiliou; Liane S Feldman; Christopher G Andrew; Simon Bergman; Karen Leffondré; Donna Stanbridge; Gerald M Fried
Journal:  Am J Surg       Date:  2005-07       Impact factor: 2.565

4.  Surgical skill and complication rates after bariatric surgery.

Authors:  John D Birkmeyer; Jonathan F Finks; Amanda O'Reilly; Mary Oerline; Arthur M Carlin; Andre R Nunn; Justin Dimick; Mousumi Banerjee; Nancy J O Birkmeyer
Journal:  N Engl J Med       Date:  2013-10-10       Impact factor: 91.245

5.  Development of a model for training and evaluation of laparoscopic skills.

Authors:  A M Derossis; G M Fried; M Abrahamowicz; H H Sigman; J S Barkun; J L Meakins
Journal:  Am J Surg       Date:  1998-06       Impact factor: 2.565

6.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

7.  The minimally acceptable classification criterion for surgical skill: intent vectors and separability of raw motion data.

Authors:  Rodney L Dockter; Thomas S Lendvay; Robert M Sweet; Timothy M Kowalewski
Journal:  Int J Comput Assist Radiol Surg       Date:  2017-05-18       Impact factor: 2.924

Review 8.  Deep learning with convolutional neural network for objective skill evaluation in robot-assisted surgery.

Authors:  Ziheng Wang; Ann Majewicz Fey
Journal:  Int J Comput Assist Radiol Surg       Date:  2018-09-25       Impact factor: 2.924

9.  Surgical motion analysis using discriminative interpretable patterns.

Authors:  Germain Forestier; François Petitjean; Pavel Senin; Fabien Despinoy; Arnaud Huaulmé; Hassan Ismail Fawaz; Jonathan Weber; Lhassane Idoumghar; Pierre-Alain Muller; Pierre Jannin
Journal:  Artif Intell Med       Date:  2018-08-30       Impact factor: 5.326

10.  Beyond task time: automated measurement augments fundamentals of laparoscopic skills methodology.

Authors:  Timothy M Kowalewski; Lee W White; Thomas S Lendvay; Iris S Jiang; Robert Sweet; Andrew Wright; Blake Hannaford; Mika N Sinanan
Journal:  J Surg Res       Date:  2014-06-04       Impact factor: 2.192

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.