Literature DB >> 31976493

Captioning Ultrasound Images Automatically.

Mohammad Alsharid1, Harshita Sharma1, Lior Drukker1, Pierre Chatelain1, Aris T Papageorghiou1, J Alison Noble1.   

Abstract

We describe an automatic natural language processing (NLP)-based image captioning method to describe fetal ultrasound video content by modelling the vocabulary commonly used by sonographers and sonologists. The generated captions are similar to the words spoken by a sonographer when describing the scan experience in terms of visual content and performed scanning actions. Using full-length second-trimester fetal ultrasound videos and text derived from accompanying expert voice-over audio recordings, we train deep learning models consisting of convolutional neural networks and recurrent neural networks in merged configurations to generate captions for ultrasound video frames. We evaluate different model architectures using established general metrics (BLEU, ROUGE-L) and application-specific metrics. Results show that the proposed models can learn joint representations of image and text to generate relevant and descriptive captions for anatomies, such as the spine, the abdomen, the heart, and the head, in clinical fetal ultrasound scans.

Entities:  

Keywords:  Deep Learning; Fetal Ultrasound; Image Captioning; Image Description; Natural Language Processing; Recurrent Neural Networks

Year:  2019        PMID: 31976493      PMCID: PMC6978141          DOI: 10.1007/978-3-030-32251-9_37

Source DB:  PubMed          Journal:  Med Image Comput Comput Assist Interv


  2 in total

1.  MTLD, vocd-D, and HD-D: a validation study of sophisticated approaches to lexical diversity assessment.

Authors:  Philip M McCarthy; Scott Jarvis
Journal:  Behav Res Methods       Date:  2010-05

2.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

  2 in total
  3 in total

1.  A Course-Focused Dual Curriculum For Image Captioning.

Authors:  Mohammad Alsharid; Rasheed El-Bouri; Harshita Sharma; Lior Drukker; Aris T Papageorghiou; J Alison Noble
Journal:  Proc IEEE Int Symp Biomed Imaging       Date:  2021-05-25

2.  Automatic captioning for medical imaging (MIC): a rapid review of literature.

Authors:  Djamila-Romaissa Beddiar; Mourad Oussalah; Tapio Seppänen
Journal:  Artif Intell Rev       Date:  2022-09-17       Impact factor: 9.588

3.  Transforming obstetric ultrasound into data science using eye tracking, voice recording, transducer motion and ultrasound video.

Authors:  Lior Drukker; Harshita Sharma; Richard Droste; Mohammad Alsharid; Pierre Chatelain; J Alison Noble; Aris T Papageorghiou
Journal:  Sci Rep       Date:  2021-07-08       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.