Results 141 to 150 of about 81,989 (164)
Some of the next articles are maybe not open access.
Semi-Markov decision processes with variance minimization criterion
4OR, 2014zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Wei, Qingda, Guo, Xianping
openaire +2 more sources
Time-average optimal constrained semi-Markov decision processes
Advances in Applied Probability, 1986Optimal causal policies maximizing the time-average reward over a semi-Markov decision process (SMDP), subject to a hard constraint on a time-average cost, are considered. Rewards and costs depend on the state and action, and contain running as well as switching components. It is supposed that the state space of the SMDP is finite, and the action space
Beutler, Frederick J., Ross, Keith W.
openaire +2 more sources
Semi-Markov Decision Process With Partial Information for Maintenance Decisions
IEEE Transactions on Reliability, 2014A critical factor that prevents optimal scheduling of maintenance interventions is the uncertainty regarding the current condition of the asset under consideration, as well as the rate at which deterioration takes place. However, current maintenance modeling and optimization techniques assume that the condition of the asset is either known, or assumed ...
Rengarajan Srinivasan +1 more
openaire +1 more source
Mean-Variance Problems for Finite Horizon Semi-Markov Decision Processes
Applied Mathematics & Optimization, 2014zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Huang, Yonghui, Guo, Xianping
openaire +2 more sources
Constrained Semi-Markov decision processes with average rewards
ZOR Zeitschrift f�r Operations Research Mathematical Methods of Opeartions Research, 1994This paper deals with constrained average reward semi-Markov decision processes with finite state and action sets. Two average reward criteria are considered, namely time average and ratio average. The author proved the existence of optimal mixed stationary policies and showed, under the unichain condition, the existence of randomized stationary ...
openaire +2 more sources
Semi-Markov decision processes with polynomial reward
Journal of Applied Probability, 1982A semi-Markov decision process, with a denumerable multidimensional state space, is considered. At any given state only a finite number of actions can be taken to control the process. The immediate reward earned in one transition period is merely assumed to be bounded by a polynomial and a bound is imposed on a weighted moment of the next state reached
openaire +1 more source
Discrete-time equivalence for constrained semi-Markov decision processes
1985 24th IEEE Conference on Decision and Control, 1985A continuous-time average reward Markov decision process problem is most easily solved in terms of an equivalent discrete-time Markov decision process (DMDP); customary hypotheses include that the process is a Markov jump process with denumerable state space and bounded transition rates, that actions are chosen at the jump points of the process, and ...
Frederick Beutler, Keith Ross
openaire +1 more source
Constrained Discounted Semi-Markov Decision Processes
2002This paper reduces problems on the existence and the finding of optimal policies for multiple criterion discounted SMDPs to similar problems for MDPs. We prove this reduction and illustrate it by extending to SMDPs several results for constrained discounted MDPs.
openaire +1 more source
Uniformization for semi-Markov decision processes under stationary policies
Journal of Applied Probability, 1987Uniformization permits the replacement of a semi-Markov decision process (SMDP) by a Markov chain exhibiting the same average rewards for simple (non-randomized) policies. It is shown that various anomalies may occur, especially for stationary (randomized) policies; uniformization introduces virtual jumps with concomitant action changes not present in ...
Beutler, Frederick J., Ross, Keith W.
openaire +2 more sources
Reinforcement learning for semi-Markov decision processes with applications
2023This thesis focuses on semi-Markov decision processes and their connection with Reinforcement Learning via Q-learning technique. We start by discussing some general ideas around Machine Learning, Reinforcement Learning and Hierarchical Reinforcement Learning.
openaire +1 more source

