Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Learning multimodal latent attributes.

Literature DB >> 24356351

Learning multimodal latent attributes.

Yanwei Fu¹, Timothy M Hospedales¹, Tao Xiang¹, Shaogang Gong¹.

Abstract

The rapid development of social media sharing has created a huge demand for automatic media classification and annotation techniques. Attribute learning has emerged as a promising paradigm for bridging the semantic gap and addressing data sparsity via transferring attribute knowledge in object recognition and relatively simple action classification. In this paper, we address the task of attribute learning for understanding multimedia data with sparse and incomplete labels. In particular, we focus on videos of social group activities, which are particularly challenging and topical examples of this task because of their multimodal content and complex and unstructured nature relative to the density of annotations. To solve this problem, we 1) introduce a concept of semilatent attribute space, expressing user-defined and latent attributes in a unified framework, and 2) propose a novel scalable probabilistic topic model for learning multimodal semilatent attributes, which dramatically reduces requirements for an exhaustive accurate attribute ontology and expensive annotation effort. We show that our framework is able to exploit latent attributes to outperform contemporary approaches for addressing a variety of realistic multimedia sparse data learning tasks including: multitask learning, learning with label noise, N-shot transfer learning, and importantly zero-shot learning.

Mesh：

Year: 2014 PMID： 24356351 DOI： 10.1109/TPAMI.2013.128

Source DB: PubMed Journal: IEEE Trans Pattern Anal Mach Intell ISSN： 0098-5589 Impact factor: 6.226

Keyword Cloud
Cited

2 in total

1. Online multi-modal robust non-negative dictionary learning for visual tracking.

Authors: Xiang Zhang; Naiyang Guan; Dacheng Tao; Xiaogang Qiu; Zhigang Luo
Journal: PLoS One Date: 2015-05-11 Impact factor: 3.240

2. Hessian-regularized co-training for social activity recognition.

Authors: Weifeng Liu; Yang Li; Xu Lin; Dacheng Tao; Yanjiang Wang
Journal: PLoS One Date: 2014-09-26 Impact factor: 3.240

2 in total