Literature DB >> 28885168

Generalized Multi-View Embedding for Visual Recognition and Cross-Modal Retrieval.

Guanqun Cao, Alexandros Iosifidis, Ke Chen, Moncef Gabbouj.   

Abstract

In this paper, the problem of multi-view embedding from different visual cues and modalities is considered. We propose a unified solution for subspace learning methods using the Rayleigh quotient, which is extensible for multiple views, supervised learning, and nonlinear embeddings. Numerous methods including canonical correlation analysis, partial least square regression, and linear discriminant analysis are studied using specific intrinsic and penalty graphs within the same framework. Nonlinear extensions based on kernels and (deep) neural networks are derived, achieving better performance than the linear ones. Moreover, a novel multi-view modular discriminant analysis is proposed by taking the view difference into consideration. We demonstrate the effectiveness of the proposed multi-view embedding methods on visual object recognition and cross-modal image retrieval, and obtain superior results in both applications compared to related methods.

Year:  2017        PMID: 28885168     DOI: 10.1109/TCYB.2017.2742705

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  2 in total

1.  Multi-View Broad Learning System for Primate Oculomotor Decision Decoding.

Authors:  Zhenhua Shi; Xiaomo Chen; Changming Zhao; He He; Veit Stuphorn; Dongrui Wu
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2020-06-18       Impact factor: 3.802

2.  Student behavior analysis to measure engagement levels in online learning environments.

Authors:  Khawlah Altuwairqi; Salma Kammoun Jarraya; Arwa Allinjawi; Mohamed Hammami
Journal:  Signal Image Video Process       Date:  2021-05-14       Impact factor: 2.157

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.