Literature DB >> 20399248

Tug-of-war model for the two-bandit problem: nonlocally-correlated parallel exploration via resource conservation.

Song-Ju Kim1, Masashi Aono, Masahiko Hara.   

Abstract

We propose a model - the "tug-of-war (TOW) model" - to conduct unique parallel searches using many nonlocally-correlated search agents. The model is based on the property of a single-celled amoeba, the true slime mold Physarum, which maintains a constant intracellular resource volume while collecting environmental information by concurrently expanding and shrinking its branches. The conservation law entails a "nonlocal correlation" among the branches, i.e., volume increment in one branch is immediately compensated by volume decrement(s) in the other branch(es). This nonlocal correlation was shown to be useful for decision making in the case of a dilemma. The multi-armed bandit problem is to determine the optimal strategy for maximizing the total reward sum with incompatible demands, by either exploiting the rewards obtained using the already collected information or exploring new information for acquiring higher payoffs involving risks. Our model can efficiently manage the "exploration-exploitation dilemma" and exhibits good performances. The average accuracy rate of our model is higher than those of well-known algorithms such as the modified -greedy algorithm and modified softmax algorithm, especially, for solving relatively difficult problems. Moreover, our model flexibly adapts to changing environments, a property essential for living organisms surviving in uncertain environments.

Mesh:

Year:  2010        PMID: 20399248     DOI: 10.1016/j.biosystems.2010.04.002

Source DB:  PubMed          Journal:  Biosystems        ISSN: 0303-2647            Impact factor:   1.973


  6 in total

1.  Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers.

Authors:  Kensei Morijiri; Takatomo Mihana; Kazutaka Kanno; Makoto Naruse; Atsushi Uchida
Journal:  Sci Rep       Date:  2022-05-16       Impact factor: 4.996

2.  Single-photon decision maker.

Authors:  Makoto Naruse; Martin Berthel; Aurélien Drezet; Serge Huant; Masashi Aono; Hirokazu Hori; Song-Ju Kim
Journal:  Sci Rep       Date:  2015-08-17       Impact factor: 4.379

3.  Ultrafast photonic reinforcement learning based on laser chaos.

Authors:  Makoto Naruse; Yuta Terashima; Atsushi Uchida; Song-Ju Kim
Journal:  Sci Rep       Date:  2017-08-18       Impact factor: 4.379

4.  On-chip photonic decision maker using spontaneous mode switching in a ring laser.

Authors:  Ryutaro Homma; Satoshi Kochi; Tomoaki Niiyama; Takatomo Mihana; Yusuke Mitsui; Kazutaka Kanno; Atsushi Uchida; Makoto Naruse; Satoshi Sunada
Journal:  Sci Rep       Date:  2019-07-01       Impact factor: 4.379

5.  Decision maker based on nanoscale photo-excitation transfer.

Authors:  Song-Ju Kim; Makoto Naruse; Masashi Aono; Motoichi Ohtsu; Masahiko Hara
Journal:  Sci Rep       Date:  2013       Impact factor: 4.379

6.  Ionic decision-maker created as novel, solid-state devices.

Authors:  Takashi Tsuchiya; Tohru Tsuruoka; Song-Ju Kim; Kazuya Terabe; Masakazu Aono
Journal:  Sci Adv       Date:  2018-09-07       Impact factor: 14.136

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.