Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 MDPs with Non-Deterministic Policies.

Literature DB >> 21625292

MDPs with Non-Deterministic Policies.

Abstract

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for problems modelled as MDPs. Although finding the optimal policy is sufficient in many domains, in certain applications such as decision support systems where the policy is executed by a human (rather than a machine), finding all possible near-optimal policies might be useful as it provides more flexibility to the person executing the policy. In this paper we introduce the new concept of non-deterministic MDP policies, and address the question of finding near-optimal non-deterministic policies. We propose two solutions to this problem, one based on a Mixed Integer Program and the other one based on a search algorithm. We include experimental results obtained from applying this framework to optimize treatment choices in the context of a medical decision support system.

Entities: Chemical Disease Gene Species

Year: 2009 PMID： 21625292 PMCID： PMC3103230

Source DB: PubMed Journal: Adv Neural Inf Process Syst ISSN： 1049-5258

3 in total

1. Deciding when to intervene: a Markov decision process approach.

Authors: P Magni; S Quaglini; M Marchetti; G Barosi
Journal: Int J Med Inform Date: 2000-12 Impact factor: 4.046

2. Planning treatment of ischemic heart disease with partially observable Markov decision processes.

Authors: M Hauskrecht; H Fraser
Journal: Artif Intell Med Date: 2000-03 Impact factor: 5.326

Review 3. Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study.

Authors: Maurizio Fava; A John Rush; Madhukar H Trivedi; Andrew A Nierenberg; Michael E Thase; Harold A Sackeim; Frederic M Quitkin; Steven Wisniewski; Philip W Lavori; Jerrold F Rosenbaum; David J Kupfer
Journal: Psychiatr Clin North Am Date: 2003-06

3 in total

2 in total

1. Informing sequential clinical decision-making through reinforcement learning: an empirical study.

Authors: Susan M Shortreed; Eric Laber; Daniel J Lizotte; T Scott Stroup; Joelle Pineau; Susan A Murphy
Journal: Mach Learn Date: 2011-07-01 Impact factor: 2.940

2. A P300-based brain-computer interface with stimuli on moving objects: four-session single-trial and triple-trial tests with a game-like task design.

Authors: Ilya P Ganin; Sergei L Shishkin; Alexander Y Kaplan
Journal: PLoS One Date: 2013-10-31 Impact factor: 3.240

2 in total