Literature DB >> 26017443

Reinforcement learning improves behaviour from evaluative feedback.

Michael L Littman1.   

Abstract

Reinforcement learning is a branch of machine learning concerned with using experience gained through interacting with the world and evaluative feedback to improve a system's ability to make behavioural decisions. It has been called the artificial intelligence problem in a microcosm because learning algorithms must act autonomously to perform well and achieve their goals. Partly driven by the increasing availability of rich data, recent years have seen exciting advances in the theory and practice of reinforcement learning, including developments in fundamental technical areas such as generalization, planning, exploration and empirical methodology, leading to increasing applicability to real-life problems.

Entities:  

Mesh:

Year:  2015        PMID: 26017443     DOI: 10.1038/nature14540

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


  10 in total

1.  An experimental design for the development of adaptive treatment strategies.

Authors:  S A Murphy
Journal:  Stat Med       Date:  2005-05-30       Impact factor: 2.373

Review 2.  Reinforcement learning: the good, the bad and the ugly.

Authors:  Peter Dayan; Yael Niv
Journal:  Curr Opin Neurobiol       Date:  2008-08-22       Impact factor: 6.627

3.  Action, outcome, and value: a dual-system framework for morality.

Authors:  Fiery Cushman
Journal:  Pers Soc Psychol Rev       Date:  2013-08

4.  Neuroscience: Dopamine ramps up.

Authors:  Yael Niv
Journal:  Nature       Date:  2013-08-29       Impact factor: 49.962

Review 5.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

Review 6.  Rapid learning for precision oncology.

Authors:  Jeff Shrager; Jay M Tenenbaum
Journal:  Nat Rev Clin Oncol       Date:  2014-01-21       Impact factor: 66.675

7.  Toward a modern theory of adaptive networks: expectation and prediction.

Authors:  R S Sutton; A G Barto
Journal:  Psychol Rev       Date:  1981-03       Impact factor: 8.934

8.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

9.  Is probability matching smart? Associations between probabilistic choices and cognitive ability.

Authors:  Keith E Stanovich
Journal:  Mem Cognit       Date:  2003-03

10.  Stochastic Games.

Authors:  L S Shapley
Journal:  Proc Natl Acad Sci U S A       Date:  1953-10       Impact factor: 11.205

  10 in total
  11 in total

Review 1.  Control of synaptic plasticity in deep cortical networks.

Authors:  Pieter R Roelfsema; Anthony Holtmaat
Journal:  Nat Rev Neurosci       Date:  2018-02-16       Impact factor: 34.870

2.  Interactive machine learning for health informatics: when do we need the human-in-the-loop?

Authors:  Andreas Holzinger
Journal:  Brain Inform       Date:  2016-03-02

3.  Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.

Authors:  Shan Zhong; Quan Liu; QiMing Fu
Journal:  Comput Intell Neurosci       Date:  2016-10-03

4.  Electrophysiological Correlates of Error Monitoring and Feedback Processing in Second Language Learning.

Authors:  Sybrine Bultena; Claudia Danielmeier; Harold Bekkering; Kristin Lemhöfer
Journal:  Front Hum Neurosci       Date:  2017-01-30       Impact factor: 3.169

5.  Disgust and the rubber hand illusion: a registered replication report of Jalal, Krishnakumar, and Ramachandran (2015).

Authors:  Hiroshi Nitta; Haruto Tomita; Yi Zhang; Xinxin Zhou; Yuki Yamada
Journal:  Cogn Res Princ Implic       Date:  2018-05-16

6.  The Convergence of a Cooperation Markov Decision Process System.

Authors:  Xiaoling Mo; Daoyun Xu; Zufeng Fu
Journal:  Entropy (Basel)       Date:  2020-08-30       Impact factor: 2.524

7.  Deep-Reinforcement Learning-Based Co-Evolution in a Predator-Prey System.

Authors:  Xueting Wang; Jun Cheng; Lei Wang
Journal:  Entropy (Basel)       Date:  2019-08-08       Impact factor: 2.524

8.  A cortical circuit for audio-visual predictions.

Authors:  Aleena R Garner; Georg B Keller
Journal:  Nat Neurosci       Date:  2021-12-02       Impact factor: 28.771

9.  Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks.

Authors:  Tobias Brosch; Heiko Neumann; Pieter R Roelfsema
Journal:  PLoS Comput Biol       Date:  2015-10-23       Impact factor: 4.475

10.  Precision Medicine for Hypertension Patients with Type 2 Diabetes via Reinforcement Learning.

Authors:  Sang Ho Oh; Su Jin Lee; Jongyoul Park
Journal:  J Pers Med       Date:  2022-01-11
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.