Literature DB >> 29735249

An adaptive deep Q-learning strategy for handwritten digit recognition.

Junfei Qiao1, Gongming Wang2, Wenjing Li3, Min Chen4.   

Abstract

Handwritten digits recognition is a challenging problem in recent years. Although many deep learning-based classification algorithms are studied for handwritten digits recognition, the recognition accuracy and running time still need to be further improved. In this paper, an adaptive deep Q-learning strategy is proposed to improve accuracy and shorten running time for handwritten digit recognition. The adaptive deep Q-learning strategy combines the feature-extracting capability of deep learning and the decision-making of reinforcement learning to form an adaptive Q-learning deep belief network (Q-ADBN). First, Q-ADBN extracts the features of original images using an adaptive deep auto-encoder (ADAE), and the extracted features are considered as the current states of Q-learning algorithm. Second, Q-ADBN receives Q-function (reward signal) during recognition of the current states, and the final handwritten digits recognition is implemented by maximizing the Q-function using Q-learning algorithm. Finally, experimental results from the well-known MNIST dataset show that the proposed Q-ADBN has a superiority to other similar methods in terms of accuracy and running time.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Keywords:  Adaptive Q-learning deep belief network; Adaptive deep auto-encoder; Deep learning; Handwritten digits recognition; Reinforcement learning

Mesh:

Year:  2018        PMID: 29735249     DOI: 10.1016/j.neunet.2018.02.010

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  3 in total

1.  AD or Non-AD: A Deep Learning Approach to Detect Advertisements from Magazines.

Authors:  Khaled Almgren; Murali Krishnan; Fatima Aljanobi; Jeongkyu Lee
Journal:  Entropy (Basel)       Date:  2018-12-17       Impact factor: 2.524

2.  Research on predicting 2D-HP protein folding using reinforcement learning with full state space.

Authors:  Hongjie Wu; Ru Yang; Qiming Fu; Jianping Chen; Weizhong Lu; Haiou Li
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

3.  Applications of machine learning in decision analysis for dose management for dofetilide.

Authors:  Andrew E Levy; Minakshi Biswas; Rachel Weber; Khaldoun Tarakji; Mina Chung; Peter A Noseworthy; Christopher Newton-Cheh; Michael A Rosenberg
Journal:  PLoS One       Date:  2019-12-31       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.