Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game.

Literature DB >> 31500931

Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game.

Devdhar Patel¹, Hananel Hazan², Daniel J Saunders², Hava T Siegelmann², Robert Kozma³.

Abstract

Deep Reinforcement Learning (RL) demonstrates excellent performance on tasks that can be solved by trained policy. It plays a dominant role among cutting-edge machine learning approaches using multi-layer Neural networks (NNs). At the same time, Deep RL suffers from high sensitivity to noisy, incomplete, and misleading input data. Following biological intuition, we involve Spiking Neural Networks (SNNs) to address some deficiencies of deep RL solutions. Previous studies in image classification domain demonstrated that standard NNs (with ReLU nonlinearity) trained using supervised learning can be converted to SNNs with negligible deterioration in performance. In this paper, we extend those conversion results to the domain of Q-Learning NNs trained using RL. We provide a proof of principle of the conversion of standard NN to SNN. In addition, we show that the SNN has improved robustness to occlusion in the input image. Finally, we introduce results with converting full-scale Deep Q-network to SNN, paving the way for future research to robust Deep RL applications.

Entities: Disease

Keywords: Atari; Deep learning; Reinforcement learning; Robustness; Spiking neural networks

Mesh：

Year: 2019 PMID： 31500931 DOI： 10.1016/j.neunet.2019.08.009

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

4 in total

Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game.

1. Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning.

2. Solving the spike feature information vanishing problem in spiking deep Q network with potential based normalization.

3. Training Spiking Neural Networks for Reinforcement Learning Tasks With Temporal Coding Method.

4. Training spiking neuronal networks to perform motor control using reinforcement and evolutionary learning.