Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.

Literature DB >> 29395652

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.

Stefan Elfwing¹, Eiji Uchibe², Kenji Doya³.

Abstract

In recent years, neural networks have enjoyed a renaissance as function approximators in reinforcement learning. Two decades after Tesauro's TD-Gammon achieved near top-level human performance in backgammon, the deep reinforcement learning algorithm DQN achieved human-level performance in many Atari 2600 games. The purpose of this study is twofold. First, we propose two activation functions for neural network function approximation in reinforcement learning: the sigmoid-weighted linear unit (SiLU) and its derivative function (dSiLU). The activation of the SiLU is computed by the sigmoid function multiplied by its input. Second, we suggest that the more traditional approach of using on-policy learning with eligibility traces, instead of experience replay, and softmax action selection can be competitive with DQN, without the need for a separate target network. We validate our proposed approach by, first, achieving new state-of-the-art results in both stochastic SZ-Tetris and Tetris with a small 10 × 10 board, using TD(λ) learning and shallow dSiLU network agents, and, then, by outperforming DQN in the Atari 2600 domain by using a deep Sarsa(λ) agent with SiLU and dSiLU hidden units.

Entities: Chemical Species

Keywords: Atari 2600; Deep learning; Function approximation; Reinforcement learning; Sigmoid-weighted linear unit; Tetris

Mesh：

Year: 2018 PMID： 29395652 DOI： 10.1016/j.neunet.2017.12.012

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

15 in total

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.

1. Air pollution prediction by using an artificial neural network model.

2. Lightweight ViT Model for Micro-Expression Recognition Enhanced by Transfer Learning.

3. NewtonNet: a Newtonian message passing network for deep learning of interatomic potentials and forces.

4. Δ-Quantum machine-learning for medicinal chemistry.

5. Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules.

6. A Generalization Performance Study Using Deep Learning Networks in Embedded Systems.

7. SpookyNet: Learning force fields with electronic degrees of freedom and nonlocal effects.

8. A pavement distresses identification method optimized for YOLOv5s.

9. Effective Face Detector Based on YOLOv5 and Superresolution Reconstruction.

10. Detection of Pine Cones in Natural Environment Using Improved YOLOv4 Deep Learning Algorithm.