Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Bidirectional long short-term memory for surgical skill classification of temporally segmented tasks.

Literature DB >> 33000365

Bidirectional long short-term memory for surgical skill classification of temporally segmented tasks.

Jason D Kelly¹, Ashley Petersen², Thomas S Lendvay³, Timothy M Kowalewski⁴.

Abstract

PURPOSE: The majority of historical surgical skill research typically analyzes holistic summary task-level metrics to create a skill classification for a performance. Recent advances in machine learning allow time series classification at the sub-task level, allowing predictions on segments of tasks, which could improve task-level technical skill assessment.
METHODS: A bidirectional long short-term memory (LSTM) network was used with 8-s windows of multidimensional time-series data from the Basic Laparoscopic Urologic Skills dataset. The network was trained on experts and novices from four common surgical tasks. Stratified cross-validation with regularization was used to avoid overfitting. The misclassified cases were re-submitted for surgical technical skill assessment to crowds using Amazon Mechanical Turk to re-evaluate and to analyze the level of agreement with previous scores.
RESULTS: Performance was best for the suturing task, with 96.88% accuracy at predicting whether a performance was an expert or novice, with 1 misclassification, when compared to previously obtained crowd evaluations. When compared with expert surgeon ratings, the LSTM predictions resulted in a Spearman coefficient of 0.89 for suturing tasks. When crowds re-evaluated misclassified performances, it was found that for all 5 misclassified cases from peg transfer and suturing tasks, the crowds agreed more with our LSTM model than with the previously obtained crowd scores.
CONCLUSION: The technique presented shows results not incomparable with labels which would be obtained from crowd-sourced labels of surgical tasks. However, these results bring about questions of the reliability of crowd sourced labels in videos of surgical tasks. We, as a research community, should take a closer look at crowd labeling with higher scrutiny, systematically look at biases, and quantify label noise.

Entities: Chemical

Keywords: Bidirectional LSTM; Crowd sourcing; Machine learning; Surgical skill; Surgical technical skill

Mesh：

Year: 2020 PMID： 33000365 PMCID： PMC7677176 DOI： 10.1007/s11548-020-02269-x

Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN： 1861-6410 Impact factor: 2.924

Keyword Cloud
References

21 in total

1. Development and validation of a comprehensive program of education and assessment of the basic fundamentals of laparoscopic surgery.

Authors: Jeffrey H Peters; Gerald M Fried; Lee L Swanstrom; Nathaniel J Soper; Lelan F Sillin; Bruce Schirmer; Kaaren Hoffman
Journal: Surgery Date: 2004-01 Impact factor: 3.982

2. Crowd-Sourced Assessment of Technical Skills for Validation of Basic Laparoscopic Urologic Skills Tasks.

Authors: Timothy M Kowalewski; Bryan Comstock; Robert Sweet; Cory Schaffhausen; Ashleigh Menhadji; Timothy Averch; Geoffrey Box; Timothy Brand; Michael Ferrandino; Jihad Kaouk; Bodo Knudsen; Jaime Landman; Benjamin Lee; Bradley F Schwartz; Elspeth McDougall; Thomas S Lendvay
Journal: J Urol Date: 2016-01-14 Impact factor: 7.450

3. A global assessment tool for evaluation of intraoperative laparoscopic skills.

Authors: Melina C Vassiliou; Liane S Feldman; Christopher G Andrew; Simon Bergman; Karen Leffondré; Donna Stanbridge; Gerald M Fried
Journal: Am J Surg Date: 2005-07 Impact factor: 2.565

4. Surgical skill and complication rates after bariatric surgery.

Authors: John D Birkmeyer; Jonathan F Finks; Amanda O'Reilly; Mary Oerline; Arthur M Carlin; Andre R Nunn; Justin Dimick; Mousumi Banerjee; Nancy J O Birkmeyer
Journal: N Engl J Med Date: 2013-10-10 Impact factor: 91.245

5. Development of a model for training and evaluation of laparoscopic skills.

Authors: A M Derossis; G M Fried; M Abrahamowicz; H H Sigman; J S Barkun; J L Meakins
Journal: Am J Surg Date: 1998-06 Impact factor: 2.565

6. Long short-term memory.

Authors: S Hochreiter; J Schmidhuber
Journal: Neural Comput Date: 1997-11-15 Impact factor: 2.026

7. The minimally acceptable classification criterion for surgical skill: intent vectors and separability of raw motion data.

Authors: Rodney L Dockter; Thomas S Lendvay; Robert M Sweet; Timothy M Kowalewski
Journal: Int J Comput Assist Radiol Surg Date: 2017-05-18 Impact factor: 2.924