Continuous-Time Markov Decision Processes with Exponential Utility [PDF]
In this paper, we consider a continuous-time Markov decision process (CTMDP) in Borel spaces, where the certainty equivalent with respect to the exponential utility of the total undiscounted cost is to be minimized. The cost rate is nonnegative.
Zhang, Y
core +7 more sources
Randomized and Relaxed Strategies in Continuous-Time Markov Decision Processes. [PDF]
One of the goals of this article is to describe a wide class of control strategies, which includes the traditional relaxed strategies, as well as the so called randomized strategies which appeared earlier only in the framework of semi-Markov decision ...
Piunovskiy, AB
core +5 more sources
Bisimulations and Logical Characterizations on Continuous-time Markov Decision Processes [PDF]
In this paper we study strong and weak bisimulation equivalences for continuous-time Markov decision processes (CTMDPs) and the logical characterizations of these relations with respect to the continuous-time stochastic logic (CSL).
Godskesen, Jens Chr. +2 more
core +6 more sources
Markov Decision Processes with Applications in Wireless Sensor Networks: A Survey [PDF]
Wireless sensor networks (WSNs) consist of autonomous and resource-limited devices. The devices cooperate to monitor one or more physical phenomena within an area of interest.
Alsheikh, Mohammad Abu +4 more
core +5 more sources
Discounted continuous-time constrained Markov decision processes in Polish spaces
This paper is devoted to studying constrained continuous-time Markov decision processes (MDPs) in the class of randomized policies depending on state histories. The transition rates may be unbounded, the reward and costs are admitted to be unbounded from
Guo, Xianping, Song, Xinyuan
core +4 more sources
Average optimality for continuous-time Markov decision processes in polish spaces
This paper is devoted to studying the average optimality in continuous-time Markov decision processes with fairly general state and action spaces. The criterion to be maximized is expected average rewards.
Guo, Xianping, Rieder, Ulrich
core +3 more sources
Constrained Continuous-Time Markov Decision Processes on the Finite Horizon [PDF]
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Guo, Xianping, Huang, Yonghui, Zhang, Yi
core +4 more sources
Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes [PDF]
One of the basic facts known for discrete-time Markov decision processes is that, if the probability distribution of an initial state is fixed, then for every policy it is easy to construct a (randomized) Markov policy with the same marginal distributions of state-action pairs as for the original policy. This equality of marginal distributions implies
Eugene A. Feinberg +2 more
openaire +2 more sources
Impulsive Control for Continuous-Time Markov Decision Processes [PDF]
In this paper our objective is to study continuous-time Markov decision processes on a general Borel state space with both impulsive and continuous controls for the infinite time horizon discounted cost. The continuous-time controlled process is shown to be nonexplosive under appropriate hypotheses.
Dufour, François, Piunovskiy, Alexei B.
openaire +5 more sources
Policy Iteration for Continuous-Time Average Reward Markov Decision Processes in Polish Spaces
We study the policy iteration algorithm (PIA) for continuous-time jump Markov decision processes in general state and action spaces. The corresponding transition rates are allowed to be unbounded, and the reward rates may have neither upper nor lower ...
Quanxin Zhu +2 more
doaj +1 more source

