Literature DB >> 25847670

Data-driven spatio-temporal RGBD feature encoding for action recognition in operating rooms.

Andru P Twinanda1, Emre O Alkan, Afshin Gangi, Michel de Mathelin, Nicolas Padoy.   

Abstract

PURPOSE: Context-aware systems for the operating room (OR) provide the possibility to significantly improve surgical workflow through various applications such as efficient OR scheduling, context-sensitive user interfaces, and automatic transcription of medical procedures. Being an essential element of such a system, surgical action recognition is thus an important research area. In this paper, we tackle the problem of classifying surgical actions from video clips that capture the activities taking place in the OR.
METHODS: We acquire recordings using a multi-view RGBD camera system mounted on the ceiling of a hybrid OR dedicated to X-ray-based procedures and annotate clips of the recordings with the corresponding actions. To recognize the surgical actions from the video clips, we use a classification pipeline based on the bag-of-words (BoW) approach. We propose a novel feature encoding method that extends the classical BoW approach. Instead of using the typical rigid grid layout to divide the space of the feature locations, we propose to learn the layout from the actual 4D spatio-temporal locations of the visual features. This results in a data-driven and non-rigid layout which retains more spatio-temporal information compared to the rigid counterpart.
RESULTS: We classify multi-view video clips from a new dataset generated from 11-day recordings of real operations. This dataset is composed of 1734 video clips of 15 actions. These include generic actions (e.g., moving patient to the OR bed) and actions specific to the vertebroplasty procedure (e.g., hammering). The experiments show that the proposed non-rigid feature encoding method performs better than the rigid encoding one. The classifier's accuracy is increased by over 4 %, from 81.08 to 85.53 %.
CONCLUSION: The combination of both intensity and depth information from the RGBD data provides more discriminative power in carrying out the surgical action recognition task as compared to using either one of them alone. Furthermore, the proposed non-rigid spatio-temporal feature encoding scheme provides more discriminative histogram representations than the rigid counterpart. To the best of our knowledge, this is also the first work that presents action recognition results on multi-view RGBD data recorded in the OR.

Entities:  

Mesh:

Year:  2015        PMID: 25847670     DOI: 10.1007/s11548-015-1186-1

Source DB:  PubMed          Journal:  Int J Comput Assist Radiol Surg        ISSN: 1861-6410            Impact factor:   2.924


  6 in total

1.  A framework for the recognition of high-level surgical tasks from video images for cataract surgeries.

Authors:  F Lalys; L Riffaud; D Bouget; P Jannin
Journal:  IEEE Trans Biomed Eng       Date:  2011-12-23       Impact factor: 4.538

2.  Statistical modeling and recognition of surgical workflow.

Authors:  Nicolas Padoy; Tobias Blum; Seyed-Ahmad Ahmadi; Hubertus Feussner; Marie-Odile Berger; Nassir Navab
Journal:  Med Image Anal       Date:  2010-12-08       Impact factor: 8.545

3.  Modeling and segmentation of surgical workflow from laparoscopic video.

Authors:  Tobias Blum; Hubertus Feussner; Nassir Navab
Journal:  Med Image Comput Comput Assist Interv       Date:  2010

4.  Surgical gesture classification from video and kinematic data.

Authors:  Luca Zappella; Benjamín Béjar; Gregory Hager; René Vidal
Journal:  Med Image Anal       Date:  2013-04-28       Impact factor: 8.545

5.  Seeing is believing: increasing intraoperative awareness to scattered radiation in interventional procedures by combining augmented reality, Monte Carlo simulations and wireless dosimeters.

Authors:  Nicolas Loy Rodas; Nicolas Padoy
Journal:  Int J Comput Assist Radiol Surg       Date:  2015-02-26       Impact factor: 2.924

6.  3D Sensing Algorithms Towards Building an Intelligent Intensive Care Unit.

Authors:  Colin Lea; James Facker; Gregory Hager; Russell Taylor; Suchi Saria
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2013-03-18
  6 in total
  10 in total

1.  CAI4CAI: The Rise of Contextual Artificial Intelligence in Computer Assisted Interventions.

Authors:  Tom Vercauteren; Mathias Unberath; Nicolas Padoy; Nassir Navab
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2019-10-23       Impact factor: 10.961

Review 2.  Video content analysis of surgical procedures.

Authors:  Constantinos Loukas
Journal:  Surg Endosc       Date:  2017-10-26       Impact factor: 4.584

3.  Measuring Patient Mobility in the ICU Using a Novel Noninvasive Sensor.

Authors:  Andy J Ma; Nishi Rawat; Austin Reiter; Christine Shrock; Andong Zhan; Alex Stone; Anahita Rabiee; Stephanie Griffin; Dale M Needham; Suchi Saria
Journal:  Crit Care Med       Date:  2017-04       Impact factor: 7.598

4.  Real-time medical phase recognition using long-term video understanding and progress gate method.

Authors:  Yanyi Zhang; Ivan Marsic; Randall S Burd
Journal:  Med Image Anal       Date:  2021-09-03       Impact factor: 8.545

5.  Computer Vision in the Operating Room: Opportunities and Caveats.

Authors:  Lauren R Kennedy-Metz; Pietro Mascagni; Antonio Torralba; Roger D Dias; Pietro Perona; Julie A Shah; Nicolas Padoy; Marco A Zenati
Journal:  IEEE Trans Med Robot Bionics       Date:  2020-11-24

Review 6.  Surgical data science - from concepts toward clinical translation.

Authors:  Lena Maier-Hein; Matthias Eisenmann; Duygu Sarikaya; Keno März; Toby Collins; Anand Malpani; Johannes Fallert; Hubertus Feussner; Stamatia Giannarou; Pietro Mascagni; Hirenkumar Nakawala; Adrian Park; Carla Pugh; Danail Stoyanov; Swaroop S Vedula; Kevin Cleary; Gabor Fichtinger; Germain Forestier; Bernard Gibaud; Teodor Grantcharov; Makoto Hashizume; Doreen Heckmann-Nötzel; Hannes G Kenngott; Ron Kikinis; Lars Mündermann; Nassir Navab; Sinan Onogur; Tobias Roß; Raphael Sznitman; Russell H Taylor; Minu D Tizabi; Martin Wagner; Gregory D Hager; Thomas Neumuth; Nicolas Padoy; Justin Collins; Ines Gockel; Jan Goedeke; Daniel A Hashimoto; Luc Joyeux; Kyle Lam; Daniel R Leff; Amin Madani; Hani J Marcus; Ozanan Meireles; Alexander Seitel; Dogu Teber; Frank Ückert; Beat P Müller-Stich; Pierre Jannin; Stefanie Speidel
Journal:  Med Image Anal       Date:  2021-11-18       Impact factor: 13.828

Review 7.  Deep learning-enabled medical computer vision.

Authors:  Andre Esteva; Katherine Chou; Serena Yeung; Nikhil Naik; Ali Madani; Ali Mottaghi; Yun Liu; Eric Topol; Jeff Dean; Richard Socher
Journal:  NPJ Digit Med       Date:  2021-01-08

Review 8.  State-of-the-art of situation recognition systems for intraoperative procedures.

Authors:  D Junger; S M Frommer; O Burgert
Journal:  Med Biol Eng Comput       Date:  2022-02-17       Impact factor: 2.602

9.  Multiple Attention Mechanism Graph Convolution HAR Model Based on Coordination Theory.

Authors:  Kai Hu; Yiwu Ding; Junlan Jin; Min Xia; Huaming Huang
Journal:  Sensors (Basel)       Date:  2022-07-14       Impact factor: 3.847

10.  Video-based fully automatic assessment of open surgery suturing skills.

Authors:  Adam Goldbraikh; Anne-Lise D'Angelo; Carla M Pugh; Shlomi Laufer
Journal:  Int J Comput Assist Radiol Surg       Date:  2022-02-01       Impact factor: 3.421

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.