Literature DB >> 29395652

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.

Stefan Elfwing1, Eiji Uchibe2, Kenji Doya3.   

Abstract

In recent years, neural networks have enjoyed a renaissance as function approximators in reinforcement learning. Two decades after Tesauro's TD-Gammon achieved near top-level human performance in backgammon, the deep reinforcement learning algorithm DQN achieved human-level performance in many Atari 2600 games. The purpose of this study is twofold. First, we propose two activation functions for neural network function approximation in reinforcement learning: the sigmoid-weighted linear unit (SiLU) and its derivative function (dSiLU). The activation of the SiLU is computed by the sigmoid function multiplied by its input. Second, we suggest that the more traditional approach of using on-policy learning with eligibility traces, instead of experience replay, and softmax action selection can be competitive with DQN, without the need for a separate target network. We validate our proposed approach by, first, achieving new state-of-the-art results in both stochastic SZ-Tetris and Tetris with a small 10 × 10 board, using TD(λ) learning and shallow dSiLU network agents, and, then, by outperforming DQN in the Atari 2600 domain by using a deep Sarsa(λ) agent with SiLU and dSiLU hidden units.
Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Atari 2600; Deep learning; Function approximation; Reinforcement learning; Sigmoid-weighted linear unit; Tetris

Mesh:

Year:  2018        PMID: 29395652     DOI: 10.1016/j.neunet.2017.12.012

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  15 in total

1.  Air pollution prediction by using an artificial neural network model.

Authors:  Heidar Maleki; Armin Sorooshian; Gholamreza Goudarzi; Zeynab Baboli; Yaser Tahmasebi Birgani; Mojtaba Rahmati
Journal:  Clean Technol Environ Policy       Date:  2019-05-28       Impact factor: 3.636

2.  Lightweight ViT Model for Micro-Expression Recognition Enhanced by Transfer Learning.

Authors:  Yanju Liu; Yange Li; Xinhai Yi; Zuojin Hu; Huiyu Zhang; Yanzhong Liu
Journal:  Front Neurorobot       Date:  2022-06-30       Impact factor: 3.493

3.  NewtonNet: a Newtonian message passing network for deep learning of interatomic potentials and forces.

Authors:  Mojtaba Haghighatlari; Jie Li; Xingyi Guan; Oufan Zhang; Akshaya Das; Christopher J Stein; Farnaz Heidar-Zadeh; Meili Liu; Martin Head-Gordon; Luke Bertels; Hongxia Hao; Itai Leven; Teresa Head-Gordon
Journal:  Digit Discov       Date:  2022-04-27

4.  Δ-Quantum machine-learning for medicinal chemistry.

Authors:  Kenneth Atz; Clemens Isert; Markus N A Böcker; José Jiménez-Luna; Gisbert Schneider
Journal:  Phys Chem Chem Phys       Date:  2022-05-11       Impact factor: 3.945

5.  Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules.

Authors:  Eiji Uchibe
Journal:  Front Neurorobot       Date:  2018-09-27       Impact factor: 2.650

6.  A Generalization Performance Study Using Deep Learning Networks in Embedded Systems.

Authors:  Joseba Gorospe; Rubén Mulero; Olatz Arbelaitz; Javier Muguerza; Miguel Ángel Antón
Journal:  Sensors (Basel)       Date:  2021-02-03       Impact factor: 3.576

7.  SpookyNet: Learning force fields with electronic degrees of freedom and nonlocal effects.

Authors:  Oliver T Unke; Stefan Chmiela; Michael Gastegger; Kristof T Schütt; Huziel E Sauceda; Klaus-Robert Müller
Journal:  Nat Commun       Date:  2021-12-14       Impact factor: 14.919

8.  A pavement distresses identification method optimized for YOLOv5s.

Authors:  Keyou Guo; Chengbo He; Min Yang; Sudong Wang
Journal:  Sci Rep       Date:  2022-03-03       Impact factor: 4.379

9.  Effective Face Detector Based on YOLOv5 and Superresolution Reconstruction.

Authors:  Qingqing Xu; Zhiyu Zhu; Huilin Ge; Zheqing Zhang; Xu Zang
Journal:  Comput Math Methods Med       Date:  2021-11-16       Impact factor: 2.238

10.  Detection of Pine Cones in Natural Environment Using Improved YOLOv4 Deep Learning Algorithm.

Authors:  Ze Luo; Yizhuo Zhang; Keqi Wang; Liping Sun
Journal:  Comput Intell Neurosci       Date:  2021-12-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.