Results 11 to 20 of about 67,931 (252)
Optimal maintenance of deteriorating equipment using semi-Markov decision processes and linear programming [PDF]
This paper considers a mathematical model analysing the deterioration of system equipment and available maintenance options. Under specific conditions on costs and transition probabilities of the model, the issue of ideal maintenance of the equipment by ...
Giannis Kechagias +3 more
doaj +1 more source
A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index
The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and ...
José Niño-Mora
doaj +1 more source
Reactive Reinforcement Learning in Asynchronous Environments
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision
Jaden B. Travnik +6 more
doaj +1 more source
Randomized and Relaxed Strategies in Continuous-Time Markov Decision Processes. [PDF]
One of the goals of this article is to describe a wide class of control strategies, which includes the traditional relaxed strategies, as well as the so called randomized strategies which appeared earlier only in the framework of semi-Markov decision ...
Piunovskiy, AB
core +1 more source
In the article a common semi-Markov mathematical model is considered that allows one to investigate the productivity and reliability of various technological processes of mechanical assembly production.
Rapatskiy Yuri +5 more
doaj +1 more source
A survey on semi-Markov decision processes [PDF]
This paper is a survey on semi-Markov decision processes (SMDPs). We present the background, the signi cance, and the research actuality of the in nite horizon expected discounted reward criterion, the long-run expected average reward criterion, the nite horizon expected reward criterion, the expected rst passage reward criterion, the probability ...
YongHui HUANG, XianPing GUO
openaire +1 more source
Optimal Intervention in Semi-Markov-Based Asynchronous Probabilistic Boolean Networks
Synchronous probabilistic Boolean networks (PBNs) and generalized asynchronous PBNs have received significant attention over the past decade as a tool for modeling complex genetic regulatory networks.
Qiuli Liu +3 more
doaj +1 more source
Cost rate heuristics for semi-Markov decision processes
In response to the computational complexity of the dynamic programming/backwards induction approach to the development of optimal policies for semi-Markov decision processes, we propose a class of heuristics resulting from an inductive process which proceeds forwards in time.
Glazebrook, K.D. +2 more
openaire +3 more sources
Timed Comparisons of Semi-Markov Processes [PDF]
Semi-Markov processes are Markovian processes in which the firing time of the transitions is modelled by probabilistic distributions over positive reals interpreted as the probability of firing a transition at a certain moment in time.
Bacci, Giorgio +4 more
core +2 more sources
Abstract When deploying mobile robots in real‐world scenarios, such as airports, train stations, hospitals, and schools, collisions with pedestrians are intolerable and catastrophic. Motion safety becomes one of the most fundamental requirements for mobile robots.
Zhiqian Zhou +7 more
wiley +1 more source

