Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 FVQA: Fact-based Visual Question Answering.

Literature DB >> 28945588

FVQA: Fact-based Visual Question Answering.

Peng Wang, Qi Wu, Chunhua Shen, Anthony Dick, Anton van den Hengel.

Abstract

Visual Question Answering (VQA) has attracted much attention in both computer vision and natural language processing communities, not least because it offers insight into the relationships between two important sources of information. Current datasets, and the models built upon them, have focused on questions which are answerable by direct analysis of the question and image alone. The set of such questions that require no external information to answer is interesting, but very limited. It excludes questions which require common sense, or basic factual knowledge to answer, for example. Here we introduce FVQA (Fact-based VQA), a VQA dataset which requires, and supports, much deeper reasoning. FVQA primarily contains questions that require external information to answer. We thus extend a conventional visual question answering dataset, which contains image-question-answer triplets, through additional image-question-answer-supporting fact tuples. Each supporting-fact is represented as a structural triplet, such as <Cat,CapableOf,ClimbingTrees>.

Year: 2017 PMID： 28945588 DOI： 10.1109/TPAMI.2017.2754246

Source DB: PubMed Journal: IEEE Trans Pattern Anal Mach Intell ISSN： 0098-5589 Impact factor: 6.226

Keyword Cloud
Cited

4 in total

FVQA: Fact-based Visual Question Answering.

1. Joint embedding VQA model based on dynamic word vector.

2. Learning to Reason on Tree Structures for Knowledge-Based Visual Question Answering.

3. Deep Modular Bilinear Attention Network for Visual Question Answering.

4. Linguistic issues behind visual question answering.