Literature DB >> 32623278

Spatio-temporal visual attention modelling of standard biometry plane-finding navigation.

Yifan Cai1, Richard Droste2, Harshita Sharma2, Pierre Chatelain2, Lior Drukker3, Aris T Papageorghiou3, J Alison Noble2.   

Abstract

We present a novel multi-task neural network called Temporal SonoEyeNet (TSEN) with a primary task to describe the visual navigation process of sonographers by learning to generate visual attention maps of ultrasound images around standard biometry planes of the fetal abdomen, head (trans-ventricular plane) and femur. TSEN has three components: a feature extractor, a temporal attention module (TAM), and an auxiliary video classification module (VCM). A soft dynamic time warping (sDTW) loss function is used to improve visual attention modelling. Variants of the model are trained on a dataset of 280 video clips, each containing one of the three biometry planes and lasting 3-7 seconds, with corresponding real-time recorded gaze tracking data of an experienced sonographer. We report the performances of the different variants of TSEN for visual attention prediction at standard biometry plane detection. The best model performance is achieved using bi-directional convolutional long-short term memory (biCLSTM) in both TAM and VCM, and it outperforms a previous spatial model on all static and dynamic saliency metrics. As an auxiliary task to validate the clinical relevance of the visual attention modelling, the predicted visual attention maps were used to guide standard biometry plane detection in consecutive US video frames. All spatio-temporal TSEN models achieve higher scores compared to a spatial-only baseline; the best performing TSEN model achieves F1 scores on these standard biometry planes of 83.7%, 89.9% and 81.1%, respectively.
Copyright © 2020. Published by Elsevier B.V.

Entities:  

Keywords:  Fetal ultrasound; Gaze tracking; Multi-task learning; Saliency prediction; Standard plane detection

Mesh:

Year:  2020        PMID: 32623278     DOI: 10.1016/j.media.2020.101762

Source DB:  PubMed          Journal:  Med Image Anal        ISSN: 1361-8415            Impact factor:   8.545


  4 in total

1.  Multi-Modal Learning from Video, Eye Tracking, and Pupillometry for Operator Skill Characterization in Clinical Fetal Ultrasound.

Authors:  Harshita Sharma; Lior Drukker; Aris T Papageorghiou; J Alison Noble
Journal:  Proc IEEE Int Symp Biomed Imaging       Date:  2021-05-25

2.  Toward deep observation: A systematic survey on artificial intelligence techniques to monitor fetus via ultrasound images.

Authors:  Mahmood Alzubaidi; Marco Agus; Khalid Alyafei; Khaled A Althelaya; Uzair Shah; Alaa Abd-Alrazaq; Mohammed Anbar; Michel Makhlouf; Mowafa Househ
Journal:  iScience       Date:  2022-07-03

3.  Towards Scale and Position Invariant Task Classification using Normalised Visual Scanpaths in Clinical Fetal Ultrasound.

Authors:  Clare Teng; Harshita Sharma; Lior Drukker; Aris T Papageorghiou; J Alison Noble
Journal:  Simpl Med Ultrasound (2021)       Date:  2021-09-21

4.  Transforming obstetric ultrasound into data science using eye tracking, voice recording, transducer motion and ultrasound video.

Authors:  Lior Drukker; Harshita Sharma; Richard Droste; Mohammad Alsharid; Pierre Chatelain; J Alison Noble; Aris T Papageorghiou
Journal:  Sci Rep       Date:  2021-07-08       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.