Literature DB >> 26372202

A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data.

Yin Zheng, Yu-Jin Zhang, Hugo Larochelle.   

Abstract

Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Another popular approach to model the multimodal data is through deep neural networks, such as the deep Boltzmann machine (DBM). Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for text document modeling. In this work, we show how to successfully apply and extend this model to multimodal data, such as simultaneous image classification and annotation. First, we propose SupDocNADE, a supervised extension of DocNADE, that increases the discriminative power of the learned hidden topic features and show how to employ it to learn a joint representation from image visual words, annotation words and class label information. We test our model on the LabelMe and UIUC-Sports data sets and show that it compares favorably to other topic models. Second, we propose a deep extension of our model and provide an efficient way of training the deep model. Experimental results show that our deep model outperforms its shallow version and reaches state-of-the-art performance on the Multimedia Information Retrieval (MIR) Flickr data set.

Entities:  

Year:  2015        PMID: 26372202     DOI: 10.1109/TPAMI.2015.2476802

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  3 in total

1.  Data Analysis and Visualization of Newspaper Articles on Thirdhand Smoke: A Topic Modeling Approach.

Authors:  Qian Liu; Qiuyi Chen; Jiayi Shen; Huailiang Wu; Yimeng Sun; Wai-Kit Ming
Journal:  JMIR Med Inform       Date:  2019-01-29

2.  Health Communication Through News Media During the Early Stage of the COVID-19 Outbreak in China: Digital Topic Modeling Approach.

Authors:  Zequan Zheng; Jiabin Zheng; Qian Liu; Qiuyi Chen; Guan Liu; Sihan Chen; Bojia Chu; Hongyu Zhu; Babatunde Akinwunmi; Jian Huang; Casper J P Zhang; Wai-Kit Ming
Journal:  J Med Internet Res       Date:  2020-04-28       Impact factor: 5.428

3.  Association between community psychological label and user portrait model based on multimodal neural network.

Authors:  Hao Jiang; Xuehong Yin
Journal:  Front Psychol       Date:  2022-08-24
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.