Attention-based spatial–temporal neural network for accurate phase recognition in minimally invasive surgery: feasibility and efficiency verification-Reference-Cited by-同舟云学术

Attention-based spatial–temporal neural network for accurate phase recognition in minimally invasive surgery: feasibility and efficiency verification

Published:2022-02-25 Issue:2 Volume:9 Page:406-416
ISSN:2288-5048
Container-title:Journal of Computational Design and Engineering
language:en
Short-container-title:

Author:

Shi Pan¹,Zhao Zijian¹,Liu Kaidi¹,Li Feng²

Affiliation:

1. School of Control Science and Engineering, Shandong University, Jinan 250061, China

2. Department of General Surgery, Qilu Hospital of Shandong University, Jinan 250012, China

Abstract

Abstract Laparoscopic surgery, as a representative minimally invasive surgery (MIS), is an active research area of clinical practice. Automatic surgical phase recognition of laparoscopic videos is a vital task with the potential to improve surgeons’ efficiency and has gradually become an integral part of computer-assisted intervention systems in MIS. However, the performance of most methods currently employed for surgical phase recognition is deteriorated by optimization difficulties and inefficient computation, which hinders their large-scale practical implementation. This study proposes an efficient and novel surgical phase recognition method using an attention-based spatial–temporal neural network consisting of a spatial model and a temporal model for accurate recognition by end-to-end training. The former subtly incorporates the attention mechanism to enhance the model’s ability to focus on the key regions in video frames and efficiently capture more informative visual features. In the temporal model, we employ independently recurrent long short-term memory (IndyLSTM) and non-local block to extract long-term temporal information of video frames. We evaluated the performance of our method on the publicly available Cholec80 dataset. Our attention-based spatial–temporal neural network purely produces the phase predictions without any post-processing strategies, achieving excellent recognition performance and outperforming other state-of-the-art phase recognition methods.

Funder

National Key Research and Development Program of China

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computer Graphics and Computer-Aided Design,Human-Computer Interaction,Engineering (miscellaneous),Modeling and Simulation,Computational Mechanics

Link

https://academic.oup.com/jcde/article-pdf/9/2/406/42616845/qwac011.pdf

Reference39 articles.

1. Enhancing Arabic aspect-based sentiment analysis using deep learning models;Al-Dabet;Computer Speech & Language,2021

2. Unsupervised temporal context learning using convolutional neural networks for laparoscopic workflow analysis;Bodenstedt,2017

3. Vision-based and marker-less surgical tool detection and tracking: A review of the literature;Bouget;Medical Image Analysis,2017

4. TeCNO: Surgical phase recognition with multi-stage temporal convolutional networks;Czempiel,2020

5. OperA: Attention-regularized transformers for surgical phase recognition;Czempiel,2021

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The development of a deep learning model for automated segmentation of the robotic pancreaticojejunostomy;Surgical Endoscopy;2024-03-15

2. Hierarchical RNNs with graph policy and attention for drone swarm;Journal of Computational Design and Engineering;2024-03-06

3. Event Recognition in Laparoscopic Gynecology Videos with Hybrid Transformers;Lecture Notes in Computer Science;2024

4. Deep Learning in Surgical Workflow Analysis: A Review of Phase and Step Recognition;IEEE Journal of Biomedical and Health Informatics;2023-11

5. A Study of Scoring English Tests Using an Automatic Scoring Model Incorporating Semantics;Automatic Control and Computer Sciences;2023-10