Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reinforcement learning improves behaviour from evaluative feedback.

Literature DB >> 26017443

Reinforcement learning improves behaviour from evaluative feedback.

Abstract

Reinforcement learning is a branch of machine learning concerned with using experience gained through interacting with the world and evaluative feedback to improve a system's ability to make behavioural decisions. It has been called the artificial intelligence problem in a microcosm because learning algorithms must act autonomously to perform well and achieve their goals. Partly driven by the increasing availability of rich data, recent years have seen exciting advances in the theory and practice of reinforcement learning, including developments in fundamental technical areas such as generalization, planning, exploration and empirical methodology, leading to increasing applicability to real-life problems.

Entities: Disease

Mesh：

Year: 2015 PMID： 26017443 DOI： 10.1038/nature14540

Source DB: PubMed Journal: Nature ISSN： 0028-0836 Impact factor: 49.962

10 in total

1. An experimental design for the development of adaptive treatment strategies.

Authors: S A Murphy
Journal: Stat Med Date: 2005-05-30 Impact factor: 2.373

Review 2. Reinforcement learning: the good, the bad and the ugly.

Authors: Peter Dayan; Yael Niv
Journal: Curr Opin Neurobiol Date: 2008-08-22 Impact factor: 6.627

3. Action, outcome, and value: a dual-system framework for morality.

Authors: Fiery Cushman
Journal: Pers Soc Psychol Rev Date: 2013-08

4. Neuroscience: Dopamine ramps up.

Authors: Yael Niv
Journal: Nature Date: 2013-08-29 Impact factor: 49.962

Review 5. A neural substrate of prediction and reward.

Authors: W Schultz; P Dayan; P R Montague
Journal: Science Date: 1997-03-14 Impact factor: 47.728

Review 6. Rapid learning for precision oncology.

Authors: Jeff Shrager; Jay M Tenenbaum
Journal: Nat Rev Clin Oncol Date: 2014-01-21 Impact factor: 66.675

7. Toward a modern theory of adaptive networks: expectation and prediction.

Authors: R S Sutton; A G Barto
Journal: Psychol Rev Date: 1981-03 Impact factor: 8.934

8. Human-level control through deep reinforcement learning.

Authors: Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal: Nature Date: 2015-02-26 Impact factor: 49.962

9. Is probability matching smart? Associations between probabilistic choices and cognitive ability.

Authors: Keith E Stanovich
Journal: Mem Cognit Date: 2003-03

10. Stochastic Games.

Authors: L S Shapley
Journal: Proc Natl Acad Sci U S A Date: 1953-10 Impact factor: 11.205

10 in total

11 in total

Review 1. Control of synaptic plasticity in deep cortical networks.

Authors: Pieter R Roelfsema; Anthony Holtmaat
Journal: Nat Rev Neurosci Date: 2018-02-16 Impact factor: 34.870

2. Interactive machine learning for health informatics: when do we need the human-in-the-loop?

Authors: Andreas Holzinger
Journal: Brain Inform Date: 2016-03-02

3. Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.

Authors: Shan Zhong; Quan Liu; QiMing Fu
Journal: Comput Intell Neurosci Date: 2016-10-03

4. Electrophysiological Correlates of Error Monitoring and Feedback Processing in Second Language Learning.

Authors: Sybrine Bultena; Claudia Danielmeier; Harold Bekkering; Kristin Lemhöfer
Journal: Front Hum Neurosci Date: 2017-01-30 Impact factor: 3.169

5. Disgust and the rubber hand illusion: a registered replication report of Jalal, Krishnakumar, and Ramachandran (2015).

Authors: Hiroshi Nitta; Haruto Tomita; Yi Zhang; Xinxin Zhou; Yuki Yamada
Journal: Cogn Res Princ Implic Date: 2018-05-16