Literature DB >> 21162663

Learning a generative model of images by factoring appearance and shape.

Nicolas Le Roux1, Nicolas Heess, Jamie Shotton, John Winn.   

Abstract

Computer vision has grown tremendously in the past two decades. Despite all efforts, existing attempts at matching parts of the human visual system's extraordinary ability to understand visual scenes lack either scope or power. By combining the advantages of general low-level generative models and powerful layer-based and hierarchical models, this work aims at being a first step toward richer, more flexible models of images. After comparing various types of restricted Boltzmann machines (RBMs) able to model continuous-valued data, we introduce our basic model, the masked RBM, which explicitly models occlusion boundaries in image patches by factoring the appearance of any patch region from its shape. We then propose a generative model of larger images using a field of such RBMs. Finally, we discuss how masked RBMs could be stacked to form a deep model able to generate more complicated structures and suitable for various tasks such as segmentation or object recognition.

Entities:  

Year:  2010        PMID: 21162663     DOI: 10.1162/NECO_a_00086

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  2 in total

1.  Gaussian-binary restricted Boltzmann machines for modeling natural image statistics.

Authors:  Jan Melchior; Nan Wang; Laurenz Wiskott
Journal:  PLoS One       Date:  2017-02-02       Impact factor: 3.240

2.  Three learning stages and accuracy-efficiency tradeoff of restricted Boltzmann machines.

Authors:  Lennart Dabelow; Masahito Ueda
Journal:  Nat Commun       Date:  2022-09-17       Impact factor: 17.694

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.