Literature DB >> 31299625

A multimodal convolutional neuro-fuzzy network for emotion understanding of movie clips.

Tuan-Linh Nguyen1, Swathi Kavuri2, Minho Lee3.   

Abstract

Multimodal emotion understanding enables AI systems to interpret human emotions. With accelerated video surge, emotion understanding remains challenging due to inherent data ambiguity and diversity of video content. Although deep learning has made a considerable progress in big data feature learning, they are viewed as deterministic models used in a "black-box" manner which does not have capabilities to represent inherent ambiguities with data. Since the possibility theory of fuzzy logic focuses on knowledge representation and reasoning under uncertainty, we intend to incorporate the concepts of fuzzy logic into deep learning framework. This paper presents a novel convolutional neuro-fuzzy network, which is an integration of convolutional neural networks in fuzzy logic domain to extract high-level emotion features from text, audio, and visual modalities. The feature sets extracted by fuzzy convolutional layers are compared with those of convolutional neural networks at the same level using t-distributed Stochastic Neighbor Embedding. This paper demonstrates a multimodal emotion understanding framework with an adaptive neural fuzzy inference system that can generate new rules to classify emotions. For emotion understanding of movie clips, we concatenate audio, visual, and text features extracted using the proposed convolutional neuro-fuzzy network to train adaptive neural fuzzy inference system. In this paper, we go one step further to explain how deep learning arrives at a conclusion that can guide us to an interpretable AI. To identify which visual/text/audio aspects are important for emotion understanding, we use direct linear non-Gaussian additive model to explain the relevance in terms of causal relationships between features of deep hidden layers. The critical features extracted are input to the proposed multimodal framework to achieve higher accuracy.
Copyright © 2019 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Convolutional Neural Network (CNN); Convolutional Neuro-Fuzzy Network (CNFN); Deep learning (DL); Fuzzy logic; Interpretable AI; Multimodal emotion understanding

Year:  2019        PMID: 31299625     DOI: 10.1016/j.neunet.2019.06.010

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  3 in total

1.  Particle Swarm Optimized Fuzzy CNN With Quantitative Feature Fusion for Ultrasound Image Quality Identification.

Authors:  Muhammad Minoar Hossain; Md Mahmodul Hasan; Md Abdur Rahim; Mohammad Motiur Rahman; Mohammad Abu Yousuf; Samer Al-Ashhab; Hanan F Akhdar; Salem A Alyami; Akm Azad; Mohammad Ali Moni
Journal:  IEEE J Transl Eng Health Med       Date:  2022-08-10

2.  Deep Neuro-Fuzzy System application trends, challenges, and future perspectives: a systematic survey.

Authors:  Noureen Talpur; Said Jadid Abdulkadir; Hitham Alhussian; Mohd Hilmi Hasan; Norshakirah Aziz; Alwi Bamhdi
Journal:  Artif Intell Rev       Date:  2022-04-13       Impact factor: 8.139

3.  Modeling Subjective Affect Annotations with Multi-Task Learning.

Authors:  Hassan Hayat; Carles Ventura; Agata Lapedriza
Journal:  Sensors (Basel)       Date:  2022-07-13       Impact factor: 3.847

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.