Literature DB >> 36212702

CMRI2SPEC: CINE MRI SEQUENCE TO SPECTROGRAM SYNTHESIS VIA A PAIRWISE HETEROGENEOUS TRANSLATOR.

Xiaofeng Liu1, Fangxu Xing1, Maureen Stone2, Jerry L Prince3, Jangwon Kim4, Georges El Fakhri1, Jonghye Woo1.   

Abstract

Multimodal representation learning using visual movements from cine magnetic resonance imaging (MRI) and their acoustics has shown great potential to learn shared representation and to predict one modality from another. Here, we propose a new synthesis framework to translate from cine MRI sequences to spectrograms with a limited dataset size. Our framework hinges on a novel fully convolutional heterogeneous translator, with a 3D CNN encoder for efficient sequence encoding and a 2D transpose convolution decoder. In addition, a pairwise correlation of the samples with the same speech word is utilized with a latent space representation disentanglement scheme. Furthermore, an adversarial training approach with generative adversarial networks is incorporated to provide enhanced realism on our generated spectrograms. Our experimental results, carried out with a total of 63 cine MRI sequences alongside speech acoustics, show that our framework improves synthesis accuracy, compared with competing methods. Our framework thereby has shown the potential to aid in better understanding the relationship between the two modalities.

Entities:  

Keywords:  Encoder and Decoder; GAN; Magnetic Resonance Imaging; Video to Spectrogram Synthesis

Year:  2022        PMID: 36212702      PMCID: PMC9544268          DOI: 10.1109/icassp43922.2022.9746381

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Acoust Speech Signal Process        ISSN: 1520-6149


  7 in total

1.  3D tongue motion from tagged and cine MR images.

Authors:  Fangxu Xing; Jonghye Woo; Emi Z Murano; Junghoon Lee; Maureen Stone; Jerry L Prince
Journal:  Med Image Comput Comput Assist Interv       Date:  2013

2.  Automated interpretation of congenital heart disease from multi-view echocardiograms.

Authors:  Jing Wang; Xiaofeng Liu; Fangyun Wang; Lin Zheng; Fengqiao Gao; Hanwen Zhang; Xin Zhang; Wanqing Xie; Binbin Wang
Journal:  Med Image Anal       Date:  2020-12-26       Impact factor: 8.545

3.  SEMI-AUTOMATIC SEGMENTATION OF THE TONGUE FOR 3D MOTION ANALYSIS WITH DYNAMIC MRI.

Authors:  Junghoon Lee; Jonghye Woo; Fangxu Xing; Emi Z Murano; Maureen Stone; Jerry L Prince
Journal:  Proc IEEE Int Symp Biomed Imaging       Date:  2013-12-31

4.  Deep 3D-CNN for Depression Diagnosis with Facial Video Recording of Self-Rating Depression Scale Questionnaire.

Authors:  Wanqing Xie; Lizhong Liang; Yao Lu; Hui Luo; Xiaofeng Liu
Journal:  Annu Int Conf IEEE Eng Med Biol Soc       Date:  2021-11

5.  Mutual Information Regularized Feature-Level Frankenstein for Discriminative Recognition.

Authors:  Xiaofeng Liu; Chao Yang; Jane You; C-C Jay Kuo; B V K Vijaya Kumar
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2022-08-04       Impact factor: 9.322

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.