| Literature DB >> 29903970 |
S M Ali Eslami1, Danilo Jimenez Rezende2, Frederic Besse2, Fabio Viola2, Ari S Morcos2, Marta Garnelo2, Avraham Ruderman2, Andrei A Rusu2, Ivo Danihelka2, Karol Gregor2, David P Reichert2, Lars Buesing2, Theophane Weber2, Oriol Vinyals2, Dan Rosenbaum2, Neil Rabinowitz2, Helen King2, Chloe Hillier2, Matt Botvinick2, Daan Wierstra2, Koray Kavukcuoglu2, Demis Hassabis2.
Abstract
Scene representation-the process of converting visual sensory data into concise descriptions-is a requirement for intelligent behavior. Recent work has shown that neural networks excel at this task when provided with large, labeled datasets. However, removing the reliance on human labeling remains an important open problem. To this end, we introduce the Generative Query Network (GQN), a framework within which machines learn to represent scenes using only their own sensors. The GQN takes as input images of a scene taken from different viewpoints, constructs an internal representation, and uses this representation to predict the appearance of that scene from previously unobserved viewpoints. The GQN demonstrates representation learning without human labels or domain knowledge, paving the way toward machines that autonomously learn to understand the world around them.Entities:
Mesh:
Year: 2018 PMID: 29903970 DOI: 10.1126/science.aar6170
Source DB: PubMed Journal: Science ISSN: 0036-8075 Impact factor: 47.728