Literature DB >> 25164175

Discovering binary codes for documents by learning deep generative models.

Geoffrey Hinton1, Ruslan Salakhutdinov.   

Abstract

We describe a deep generative model in which the lowest layer represents the word-count vector of a document and the top layer represents a learned binary code for that document. The top two layers of the generative model form an undirected associative memory and the remaining layers form a belief net with directed, top-down connections. We present efficient learning and inference procedures for this type of generative model and show that it allows more accurate and much faster retrieval than latent semantic analysis. By using our method as a filter for a much slower method called TF-IDF we achieve higher accuracy than TF-IDF alone and save several orders of magnitude in retrieval time. By using short binary codes as addresses, we can perform retrieval on very large document sets in a time that is independent of the size of the document set using only one word of memory to describe each document.
Copyright © 2010 Cognitive Science Society, Inc.

Keywords:  Auto-encoders; Binary codes; Deep learning; Document retrieval; Restricted Boltzmann machines; Semantic hashing

Mesh:

Year:  2010        PMID: 25164175     DOI: 10.1111/j.1756-8765.2010.01109.x

Source DB:  PubMed          Journal:  Top Cogn Sci        ISSN: 1756-8757


  3 in total

1.  Temperature based Restricted Boltzmann Machines.

Authors:  Guoqi Li; Lei Deng; Yi Xu; Changyun Wen; Wei Wang; Jing Pei; Luping Shi
Journal:  Sci Rep       Date:  2016-01-13       Impact factor: 4.379

2.  High quality topic extraction from business news explains abnormal financial market volatility.

Authors:  Ryohei Hisano; Didier Sornette; Takayuki Mizuno; Takaaki Ohnishi; Tsutomu Watanabe
Journal:  PLoS One       Date:  2013-06-06       Impact factor: 3.240

3.  DNdisorder: predicting protein disorder using boosting and deep networks.

Authors:  Jesse Eickholt; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2013-03-06       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.