Results 131 to 140 of about 81,989 (164)
Some of the next articles are maybe not open access.

Semi-Markov Decision Processes

Probability in the Engineering and Informational Sciences, 2007
Considered are semi-Markov decision processes (SMDPs) with finite state and action spaces. We study two criteria: the expected average reward per unit time subject to a sample path constraint on the average cost per unit time and the expected time-average variability.
M. Baykal-Gürsoy, K. Gürsoy
openaire   +2 more sources

Risk-aware semi-Markov decision processes

2017 IEEE 56th Annual Conference on Decision and Control (CDC), 2017
In this work we construct a basic theory of risk-aware continuous-time Markov decision processes, and even more broadly, that of semi-Markov decision processes. Methods that account for the preferences of risk-aware agents have been introduced and studied in the context of discrete time problems, however, there has been virtually no such development ...
Jukka Isohataia, William B. Haskell
openaire   +1 more source

Policy Gradient Semi-markov Decision Process

2008 20th IEEE International Conference on Tools with Artificial Intelligence, 2008
This paper proposes a simulation-based algorithm for optimizing the average reward in a parameterized continuous-time, finite-state semi-Markov decision process (SMDP). Our contributions are twofold: First, we compute the approximate gradient of the average reward with respect to the parameters in SMDP controlled by parameterized stochastic policies ...
Ngo, Vien, Chung, TaeChoong
openaire   +1 more source

Optimum maintenance policy using semi-Markov decision processes

Electric Power Systems Research, 2006
A method is presented to solve for the optimum maintenance policy of repairable power equipment. The approach uses a continuous-time semi-Markov process (SMP) to first find the optimal maintenance rate for maximum availability of the equipment. Then a semi-Markov decision process (SMDP) is utilized to determine whether maintenance should be performed ...
Curtis L. Tomasevicz, Sohrab Asgarpoor
openaire   +1 more source

Average cost semi-markov decision processes

Journal of Applied Probability, 1970
The semi-Markov decision model is considered under the criterion of long-run average cost. A new criterion, which for any policy considers the limit of the expected cost incurred during the first n transitions divided by the expected length of the first n transitions, is considered.
openaire   +2 more sources

Generalized semi-Markov decision processes

Journal of Applied Probability, 1979
Various authors have derived the necessary and sufficient conditions for optimality in semi-Markov decision processes in which the state remains constant between jumps. In this paper similar results are presented for a generalized semi-Markov decision process in which the state varies between jumps according to a Markov process with continuous sample ...
openaire   +2 more sources

Semi-Markov Decision Processes with Unbounded Rewards

Management Science, 1973
We consider a semi-Markov decision process with arbitrary action space; the state space is the nonnegative integers. As in queueing systems, we assume that {0, 1, 2, …, n + N} is the set of states accessible from state n in one transition, where N is finite and independent of n.
openaire   +1 more source

Discounted semi-markov decision process in a semi-markov environment

Optimization, 1997
This paper presents the discounted semi-Markov decision process (SMDP) with Borel state space in a semi-Markov environment. It describes a system which behaves like a SMDP except that the system is influenced by its semi-Markov process environment. Following each state transition of the environment, the parameters of the SMDP changes.
openaire   +1 more source

Application of Semi-Markov Decision Process in Bridge Management

IABSE Reports, 2015
<p>The state-of-the-art Bridge Management Systems (BMSs) feature tightly coupled deterioration and preservation optimization model that enable determining the most cost-effective maintenance strategies at both the project and network levels. In other to improve deterioration model, many authors suggest the application of Weibull distribution for ...
Rade Hajdin   +2 more
openaire   +2 more sources

Denumerable Undiscounted Semi-Markov Decision Processes with Unbounded Rewards

Mathematics of Operations Research, 1983
This paper establishes the existence of a solution to the optimality equations in undis-counted semi-Markov decision models with countable state space, under conditions generalizing the hitherto obtained results. In particular, we merely require the existence of a finite set of states in which every pair of states can reach each other via some ...
Federgruen, A.   +2 more
openaire   +2 more sources

Home - About - Disclaimer - Privacy