Markov decision process - Open Access .click

Results 11 to 20 of about 57,303 (303)

A Promising Approach to Optimizing Sequential Treatment Decisions for Depression: Markov Decision Process. [PDF]

Pharmacoeconomics, 2022
The most appropriate next step in depression treatment after the initial treatment fails is unclear. This study explores the suitability of the Markov decision process for optimizing sequential treatment decisions for depression.
Li F, Jörg F, Li X, Feenstra T.
europepmc +2 more sources

Variance-penalized Markov decision process

Mathematics of Operations Research, 1989
We consider a Markov decision process with both the expected limiting average, and the discounted total return criteria, appropriately modified to include a penalty for the variability in the stream of rewards.
Filar, Jerzy A., Kallenberg, L. C. M., Lee, Huey-Miin +2 more
core +4 more sources

Quantile Markov Decision Processes

Operations Research, 2022
Title: Sequential Decision Making Using Quantiles The goal of a traditional Markov decision process (MDP) is to maximize the expectation of cumulative reward over a finite or infinite horizon. In many applications, however, a decision maker may be interested in optimizing a specific quantile of the cumulative reward. For example, a physician may want
Xiaocheng Li, Huaiyang Zhong, Margaret L. Brandeau +2 more
openaire +6 more sources

A Weighted Markov Decision Process [PDF]

Operations Research, 1992
The two most commonly considered reward criteria for Markov decision processes are the discounted reward and the long-term average reward. The first tends to “neglect” the future, concentrating on the short-term rewards, while the second one tends to do the opposite.
Krass, D, Filar, JA, Sinha, SS
openaire +4 more sources

Quantum logic gate synthesis as a Markov decision process

npj Quantum Information, 2023
Reinforcement learning has witnessed recent applications to a variety of tasks in quantum programming. The underlying assumption is that those tasks could be modeled as Markov decision processes (MDPs).
M. Sohaib Alam, Noah F. Berthusen, Peter P. Orth +2 more
doaj +1 more source

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems Part 2—Applications in Transportation, Industries, Communications and Networking and More Topics

Machine Learning and Knowledge Extraction, 2021
The two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) for solving partially observable Markov decision processes (POMDP) problems.
Xuanchen Xiang, Simon Foo, Huanyu Zang
doaj +1 more source

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing

Machine Learning and Knowledge Extraction, 2021
The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems.
Xuanchen Xiang, Simon Foo
doaj +1 more source

The Complexity of Markov Decision Processes [PDF]

Mathematics of Operations Research, 1987
We investigate the complexity of the classical problem of optimal policy computation in Markov decision processes. All three variants of the problem (finite horizon, infinite horizon discounted, and infinite horizon average cost) were known to be solvable in polynomial time by dynamic programming (finite horizon problems), linear programming, or ...
Christos H. Papadimitriou, John N. Tsitsiklis +1 more
openaire +1 more source

Robust Markov Decision Processes [PDF]

Mathematics of Operations Research, 2013
Markov decision processes (MDPs) are powerful tools for decision making in uncertain dynamic environments. However, the solutions of MDPs are of limited practical use because of their sensitivity to distributional model parameters, which are typically unknown and have to be estimated by the decision maker.
Wolfram Wiesemann, Daniel Kuhn, Berç Rustem +2 more
openaire +2 more sources

Health Status-Based Predictive Maintenance Decision-Making via LSTM and Markov Decision Process

Mathematics, 2022
Maintenance decision-making is essential to achieve safe and reliable operation with high performance for equipment. To avoid unexpected shutdown and increase machine life as well as system efficiency, it is fundamental to design an effective maintenance
Pan Zheng +4 more
doaj +1 more source

reinforcement learning
q-learning