Literature DB >> 31639622

Multi-task recurrent convolutional network with correlation loss for surgical video analysis.

Yueming Jin1, Huaxia Li1, Qi Dou2, Hao Chen1, Jing Qin3, Chi-Wing Fu1, Pheng-Ann Heng4.   

Abstract

Surgical tool presence detection and surgical phase recognition are two fundamental yet challenging tasks in surgical video analysis as well as very essential components in various applications in modern operating rooms. While these two analysis tasks are highly correlated in clinical practice as the surgical process is typically well-defined, most previous methods tackled them separately, without making full use of their relatedness. In this paper, we present a novel method by developing a multi-task recurrent convolutional network with correlation loss (MTRCNet-CL) to exploit their relatedness to simultaneously boost the performance of both tasks. Specifically, our proposed MTRCNet-CL model has an end-to-end architecture with two branches, which share earlier feature encoders to extract general visual features while holding respective higher layers targeting for specific tasks. Given that temporal information is crucial for phase recognition, long-short term memory (LSTM) is explored to model the sequential dependencies in the phase recognition branch. More importantly, a novel and effective correlation loss is designed to model the relatedness between tool presence and phase identification of each video frame, by minimizing the divergence of predictions from the two branches. Mutually leveraging both low-level feature sharing and high-level prediction correlating, our MTRCNet-CL method can encourage the interactions between the two tasks to a large extent, and hence can bring about benefits to each other. Extensive experiments on a large surgical video dataset (Cholec80) demonstrate outstanding performance of our proposed method, consistently exceeding the state-of-the-art methods by a large margin, e.g., 89.1% v.s. 81.0% for the mAP in tool presence detection and 87.4% v.s. 84.5% for F1 score in phase recognition.
Copyright © 2019 Elsevier B.V. All rights reserved.

Keywords:  Correlation loss; Deep learning; Multi-task learning; Surgical video analysis

Year:  2019        PMID: 31639622     DOI: 10.1016/j.media.2019.101572

Source DB:  PubMed          Journal:  Med Image Anal        ISSN: 1361-8415            Impact factor:   8.545


  10 in total

1.  Real-time medical phase recognition using long-term video understanding and progress gate method.

Authors:  Yanyi Zhang; Ivan Marsic; Randall S Burd
Journal:  Med Image Anal       Date:  2021-09-03       Impact factor: 8.545

2.  Improving situation recognition using endoscopic videos and navigation information for endoscopic sinus surgery.

Authors:  Kazuya Kawamura; Ryu Ebata; Ryoichi Nakamura; Nobuyoshi Otori
Journal:  Int J Comput Assist Radiol Surg       Date:  2022-09-23       Impact factor: 3.421

3.  Data-centric multi-task surgical phase estimation with sparse scene segmentation.

Authors:  Ricardo Sanchez-Matilla; Maria Robu; Maria Grammatikopoulou; Imanol Luengo; Danail Stoyanov
Journal:  Int J Comput Assist Radiol Surg       Date:  2022-05-03       Impact factor: 3.421

4.  Surgical workflow recognition with 3DCNN for Sleeve Gastrectomy.

Authors:  Bokai Zhang; Amer Ghanem; Alexander Simes; Henry Choi; Andrew Yoo
Journal:  Int J Comput Assist Radiol Surg       Date:  2021-08-20       Impact factor: 2.924

Review 5.  State-of-the-art of situation recognition systems for intraoperative procedures.

Authors:  D Junger; S M Frommer; O Burgert
Journal:  Med Biol Eng Comput       Date:  2022-02-17       Impact factor: 2.602

6.  Gauze Detection and Segmentation in Minimally Invasive Surgery Video Using Convolutional Neural Networks.

Authors:  Guillermo Sánchez-Brizuela; Francisco-Javier Santos-Criado; Daniel Sanz-Gobernado; Eusebio de la Fuente-López; Juan-Carlos Fraile; Javier Pérez-Turiel; Ana Cisnal
Journal:  Sensors (Basel)       Date:  2022-07-11       Impact factor: 3.847

7.  Reducing prediction volatility in the surgical workflow recognition of endoscopic pituitary surgery.

Authors:  Adrito Das; Sophia Bano; Francisco Vasconcelos; Danyal Z Khan; Hani J Marcus; Danail Stoyanov
Journal:  Int J Comput Assist Radiol Surg       Date:  2022-04-01       Impact factor: 3.421

8.  Real-Time Tool Detection for Workflow Identification in Open Cranial Vault Remodeling.

Authors:  Alicia Pose Díez de la Lastra; Lucía García-Duarte Sáenz; David García-Mato; Luis Hernández-Álvarez; Santiago Ochandiano; Javier Pascau
Journal:  Entropy (Basel)       Date:  2021-06-26       Impact factor: 2.524

9.  ClipAssistNet: bringing real-time safety feedback to operating rooms.

Authors:  Florian Aspart; Jon L Bolmgren; Joël L Lavanchy; Guido Beldi; Michael S Woods; Nicolas Padoy; Enes Hosgor
Journal:  Int J Comput Assist Radiol Surg       Date:  2021-07-23       Impact factor: 2.924

10.  Object and anatomical feature recognition in surgical video images based on a convolutional neural network.

Authors:  Yoshiko Bamba; Shimpei Ogawa; Michio Itabashi; Hironari Shindo; Shingo Kameoka; Takahiro Okamoto; Masakazu Yamamoto
Journal:  Int J Comput Assist Radiol Surg       Date:  2021-06-24       Impact factor: 2.924

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.