Literature DB >> 33817003

Joint embedding VQA model based on dynamic word vector.

Zhiyang Ma1, Wenfeng Zheng1, Xiaobing Chen1, Lirong Yin2.   

Abstract

The existing joint embedding Visual Question Answering models use different combinations of image characterization, text characterization and feature fusion method, but all the existing models use static word vectors for text characterization. However, in the real language environment, the same word may represent different meanings in different contexts, and may also be used as different grammatical components. These differences cannot be effectively expressed by static word vectors, so there may be semantic and grammatical deviations. In order to solve this problem, our article constructs a joint embedding model based on dynamic word vector-none KB-Specific network (N-KBSN) model which is different from commonly used Visual Question Answering models based on static word vectors. The N-KBSN model consists of three main parts: question text and image feature extraction module, self attention and guided attention module, feature fusion and classifier module. Among them, the key parts of N-KBSN model are: image characterization based on Faster R-CNN, text characterization based on ELMo and feature enhancement based on multi-head attention mechanism. The experimental results show that the N-KBSN constructed in our experiment is better than the other 2017-winner (glove) model and 2019-winner (glove) model. The introduction of dynamic word vector improves the accuracy of the overall results.
© 2021 Ma et al.

Entities:  

Keywords:  ELMo; Faster R-CNN; MA; VQA

Year:  2021        PMID: 33817003      PMCID: PMC7959642          DOI: 10.7717/peerj-cs.353

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  5 in total

1.  Object detection with discriminatively trained part-based models.

Authors:  Pedro F Felzenszwalb; Ross B Girshick; David McAllester; Deva Ramanan
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2010-09       Impact factor: 6.226

2.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

Authors:  Kaiming He; Xiangyu Zhang; Shaoqing Ren; Jian Sun
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-09       Impact factor: 6.226

3.  FVQA: Fact-based Visual Question Answering.

Authors:  Peng Wang; Qi Wu; Chunhua Shen; Anthony Dick; Anton van den Hengel
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-09-19       Impact factor: 6.226

4.  Construction of force haptic reappearance system based on Geomagic Touch haptic device.

Authors:  Yushan Tang; Shan Liu; Yaru Deng; Yuhui Zhang; Lirong Yin; Wenfeng Zheng
Journal:  Comput Methods Programs Biomed       Date:  2020-01-24       Impact factor: 5.428

5.  User Adaptive Text Predictor for Mentally Disabled Huntington's Patients.

Authors:  Julius Gelšvartas; Rimvydas Simutis; Rytis Maskeliūnas
Journal:  Comput Intell Neurosci       Date:  2016-02-23
  5 in total
  12 in total

1.  Pharmaceutical Reagent Inventory Strategy Based on Contract Shelf Life and Patient Demand.

Authors:  Lingling Li; Zheng Liu; Qingshan Qian; Zhao Zhao; Yuanjun Zhao
Journal:  Contrast Media Mol Imaging       Date:  2022-04-21       Impact factor: 3.009

2.  Control of Time Delay Force Feedback Teleoperation System With Finite Time Convergence.

Authors:  Jingwen Wang; Jiawei Tian; Xia Zhang; Bo Yang; Shan Liu; Lirong Yin; Wenfeng Zheng
Journal:  Front Neurorobot       Date:  2022-05-06       Impact factor: 3.493

Review 3.  Machine learning applications for COVID-19 outbreak management.

Authors:  Arash Heidari; Nima Jafari Navimipour; Mehmet Unal; Shiva Toumaj
Journal:  Neural Comput Appl       Date:  2022-06-10       Impact factor: 5.102

4.  The Pharmacological Mechanism of the Effect of Plant Extract Compound Drugs on Cancer Pain Based on Network Pharmacology.

Authors:  Yuanyuan Shen; Jun Wang; Pengpeng Yan; Tiantian Chen; Xingrui Li; Ming Jiang
Journal:  J Healthc Eng       Date:  2022-02-27       Impact factor: 2.682

5.  A Robust Deep-Learning Model for Landslide Susceptibility Mapping: A Case Study of Kurdistan Province, Iran.

Authors:  Bahareh Ghasemian; Himan Shahabi; Ataollah Shirzadi; Nadhir Al-Ansari; Abolfazl Jaafari; Victoria R Kress; Marten Geertsema; Somayeh Renoud; Anuar Ahmad
Journal:  Sensors (Basel)       Date:  2022-02-17       Impact factor: 3.576

6.  Earnings Management Behavior of Enterprise Managers Based on Evolutionary Game Theory.

Authors:  Yang Wang; Anqi Li; Jiahuan Liu
Journal:  Comput Intell Neurosci       Date:  2022-03-19

7.  Characterization inference based on joint-optimization of multi-layer semantics and deep fusion matching network.

Authors:  Wenfeng Zheng; Lirong Yin
Journal:  PeerJ Comput Sci       Date:  2022-04-12

8.  The Impact of Corporate Capital Structure on Financial Performance Based on Convolutional Neural Network.

Authors:  Yiheng Luo; Chenxi Jiang
Journal:  Comput Intell Neurosci       Date:  2022-04-26

9.  Arm Movement Analysis Technology of Wushu Competition Image Based on Deep Learning.

Authors:  Xiaoou Zhang; Xingdong Wu; Ling Song
Journal:  Comput Intell Neurosci       Date:  2022-08-12

10.  Estimating the density of deep eutectic solvents applying supervised machine learning techniques.

Authors:  Mohammadjavad Abdollahzadeh; Marzieh Khosravi; Behnam Hajipour Khire Masjidi; Amin Samimi Behbahan; Ali Bagherzadeh; Amir Shahkar; Farzad Tat Shahdost
Journal:  Sci Rep       Date:  2022-03-23       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.