Semi-markov decision processes - Open Access .click

Results 11 to 20 of about 67,931 (252)

Optimal maintenance of deteriorating equipment using semi-Markov decision processes and linear programming [PDF]

International Journal of Industrial Engineering and Management
This paper considers a mathematical model analysing the deterioration of system equipment and available maintenance options. Under specific conditions on costs and transition probabilities of the model, the issue of ideal maintenance of the equipment by ...
Giannis Kechagias +3 more
doaj +1 more source

A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index

Mathematics, 2020
The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and ...
José Niño-Mora
doaj +1 more source

Reactive Reinforcement Learning in Asynchronous Environments

Frontiers in Robotics and AI, 2018
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision
Jaden B. Travnik +6 more
doaj +1 more source

Randomized and Relaxed Strategies in Continuous-Time Markov Decision Processes. [PDF]

, 2015
One of the goals of this article is to describe a wide class of control strategies, which includes the traditional relaxed strategies, as well as the so called randomized strategies which appeared earlier only in the framework of semi-Markov decision ...
Piunovskiy, AB
core +1 more source

Research of reliability and efficiency of technological processes of mechanical assembly production on the basis of the common semi-Markov model

MATEC Web of Conferences, 2018
In the article a common semi-Markov mathematical model is considered that allows one to investigate the productivity and reliability of various technological processes of mechanical assembly production.
Rapatskiy Yuri +5 more
doaj +1 more source

A survey on semi-Markov decision processes [PDF]

SCIENTIA SINICA Mathematica, 2015
This paper is a survey on semi-Markov decision processes (SMDPs). We present the background, the signi cance, and the research actuality of the in nite horizon expected discounted reward criterion, the long-run expected average reward criterion, the nite horizon expected reward criterion, the expected rst passage reward criterion, the probability ...
YongHui HUANG, XianPing GUO
openaire +1 more source

Optimal Intervention in Semi-Markov-Based Asynchronous Probabilistic Boolean Networks

Complexity, 2018
Synchronous probabilistic Boolean networks (PBNs) and generalized asynchronous PBNs have received significant attention over the past decade as a tool for modeling complex genetic regulatory networks.
Qiuli Liu +3 more
doaj +1 more source

Cost rate heuristics for semi-Markov decision processes

Journal of Applied Probability, 1992
In response to the computational complexity of the dynamic programming/backwards induction approach to the development of optimal policies for semi-Markov decision processes, we propose a class of heuristics resulting from an inductive process which proceeds forwards in time.
Glazebrook, K.D., Bailey, Michael P., Whitaker, Lyn R. +2 more
openaire +3 more sources

Timed Comparisons of Semi-Markov Processes [PDF]

, 2017
Semi-Markov processes are Markovian processes in which the firing time of the transitions is modelled by probabilistic distributions over positive reals interpreted as the probability of firing a transition at a certain moment in time.
Bacci, Giorgio +4 more
core +2 more sources

A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments

CAAI Transactions on Intelligence Technology, EarlyView., 2023
Abstract When deploying mobile robots in real‐world scenarios, such as airports, train stations, hospitals, and schools, collisions with pedestrians are intolerable and catastrophic. Motion safety becomes one of the most fundamental requirements for mobile robots.
Zhiqian Zhou +7 more
wiley +1 more source

markov and semi-markov decision processes
16. peace & justice
semi-markov decision process

dynamic programming
markov renewal processes, semi-markov processes
linear programming

4. education
optimal stochastic control
reinforcement learning