Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Spatio-temporal visual attention modelling of standard biometry plane-finding navigation.

Literature DB >> 32623278

Spatio-temporal visual attention modelling of standard biometry plane-finding navigation.

Yifan Cai¹, Richard Droste², Harshita Sharma², Pierre Chatelain², Lior Drukker³, Aris T Papageorghiou³, J Alison Noble².

Abstract

We present a novel multi-task neural network called Temporal SonoEyeNet (TSEN) with a primary task to describe the visual navigation process of sonographers by learning to generate visual attention maps of ultrasound images around standard biometry planes of the fetal abdomen, head (trans-ventricular plane) and femur. TSEN has three components: a feature extractor, a temporal attention module (TAM), and an auxiliary video classification module (VCM). A soft dynamic time warping (sDTW) loss function is used to improve visual attention modelling. Variants of the model are trained on a dataset of 280 video clips, each containing one of the three biometry planes and lasting 3-7 seconds, with corresponding real-time recorded gaze tracking data of an experienced sonographer. We report the performances of the different variants of TSEN for visual attention prediction at standard biometry plane detection. The best model performance is achieved using bi-directional convolutional long-short term memory (biCLSTM) in both TAM and VCM, and it outperforms a previous spatial model on all static and dynamic saliency metrics. As an auxiliary task to validate the clinical relevance of the visual attention modelling, the predicted visual attention maps were used to guide standard biometry plane detection in consecutive US video frames. All spatio-temporal TSEN models achieve higher scores compared to a spatial-only baseline; the best performing TSEN model achieves F1 scores on these standard biometry planes of 83.7%, 89.9% and 81.1%, respectively.

Entities: Chemical

Keywords: Fetal ultrasound; Gaze tracking; Multi-task learning; Saliency prediction; Standard plane detection

Mesh：

Year: 2020 PMID： 32623278 DOI： 10.1016/j.media.2020.101762

Source DB: PubMed Journal: Med Image Anal ISSN： 1361-8415 Impact factor: 8.545

Keyword Cloud
Cited

4 in total

1. Multi-Modal Learning from Video, Eye Tracking, and Pupillometry for Operator Skill Characterization in Clinical Fetal Ultrasound.

Authors: Harshita Sharma; Lior Drukker; Aris T Papageorghiou; J Alison Noble
Journal: Proc IEEE Int Symp Biomed Imaging Date: 2021-05-25

2. Toward deep observation: A systematic survey on artificial intelligence techniques to monitor fetus via ultrasound images.

Authors: Mahmood Alzubaidi; Marco Agus; Khalid Alyafei; Khaled A Althelaya; Uzair Shah; Alaa Abd-Alrazaq; Mohammed Anbar; Michel Makhlouf; Mowafa Househ
Journal: iScience Date: 2022-07-03

3. Towards Scale and Position Invariant Task Classification using Normalised Visual Scanpaths in Clinical Fetal Ultrasound.

Authors: Clare Teng; Harshita Sharma; Lior Drukker; Aris T Papageorghiou; J Alison Noble
Journal: Simpl Med Ultrasound (2021) Date: 2021-09-21

4. Transforming obstetric ultrasound into data science using eye tracking, voice recording, transducer motion and ultrasound video.

Authors: Lior Drukker; Harshita Sharma; Richard Droste; Mohammad Alsharid; Pierre Chatelain; J Alison Noble; Aris T Papageorghiou
Journal: Sci Rep Date: 2021-07-08 Impact factor: 4.379

4 in total