Literature DB >> 33526868

Adaptive hyperparameter updating for training restricted Boltzmann machines on quantum annealers.

Guanglei Xu1,2, William S Oates3,4.   

Abstract

Restricted Boltzmann Machines (RBMs) have been proposed for developing neural networks for a variety of unsupervised machine learning applications such as image recognition, drug discovery, and materials design. The Boltzmann probability distribution is used as a model to identify network parameters by optimizing the likelihood of predicting an output given hidden states trained on available data. Training such networks often requires sampling over a large probability space that must be approximated during gradient based optimization. Quantum annealing has been proposed as a means to search this space more efficiently which has been experimentally investigated on D-Wave hardware. D-Wave implementation requires selection of an effective inverse temperature or hyperparameter ([Formula: see text]) within the Boltzmann distribution which can strongly influence optimization. Here, we show how this parameter can be estimated as a hyperparameter applied to D-Wave hardware during neural network training by maximizing the likelihood or minimizing the Shannon entropy. We find both methods improve training RBMs based upon D-Wave hardware experimental validation on an image recognition problem. Neural network image reconstruction errors are evaluated using Bayesian uncertainty analysis which illustrate more than an order magnitude lower image reconstruction error using the maximum likelihood over manually optimizing the hyperparameter. The maximum likelihood method is also shown to out-perform minimizing the Shannon entropy for image reconstruction.

Entities:  

Year:  2021        PMID: 33526868      PMCID: PMC7851404          DOI: 10.1038/s41598-021-82197-1

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  3 in total

1.  Training products of experts by minimizing contrastive divergence.

Authors:  Geoffrey E Hinton
Journal:  Neural Comput       Date:  2002-08       Impact factor: 2.026

2.  Quantum machine learning.

Authors:  Jacob Biamonte; Peter Wittek; Nicola Pancotti; Patrick Rebentrost; Nathan Wiebe; Seth Lloyd
Journal:  Nature       Date:  2017-09-13       Impact factor: 49.962

3.  Temperature based Restricted Boltzmann Machines.

Authors:  Guoqi Li; Lei Deng; Yi Xu; Changyun Wen; Wei Wang; Jing Pei; Luping Shi
Journal:  Sci Rep       Date:  2016-01-13       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.