Results 1 to 10 of about 45,486 (253)
Optimal maintenance of deteriorating equipment using semi-Markov decision processes and linear programming [PDF]
This paper considers a mathematical model analysing the deterioration of system equipment and available maintenance options. Under specific conditions on costs and transition probabilities of the model, the issue of ideal maintenance of the equipment by ...
Giannis Kechagias +3 more
doaj +3 more sources
A Faster-Than Relation for Semi-Markov Decision Processes [PDF]
When modeling concurrent or cyber-physical systems, non-functional requirements such as time are important to consider. In order to improve the timing aspects of a model, it is necessary to have some notion of what it means for a process to be faster ...
Mathias Ruggaard Pedersen +2 more
doaj +3 more sources
Deterministic policy gradient algorithms for semi‐Markov decision processes
A large class of sequential decision‐making problems under uncertainty, with broad applications from preventive maintenance to event‐triggered control can be modeled in the framework of semi‐Markov decision processes (SMDPs).
A. H. Hosseinloo, M. Dahleh
semanticscholar +1 more source
Some work and some play: microscopic and macroscopic approaches to labor and leisure. [PDF]
Given the option, humans and other animals elect to distribute their time between work and leisure, rather than choosing all of one and none of the other.
Ritwik K Niyogi +2 more
doaj +1 more source
Oil tanker disasters have been a cause of major environmental disasters, with multi-generational impacts. One of the greatest hazards is damage to the propulsion system that causes the ship to turn sideways to a wave and lose stability, which in storm ...
Zbigniew Łosiewicz +4 more
doaj +1 more source
Discounted semi‐Markov decision processes: linear programming and policy iteration [PDF]
AbstractFor semi‐Markov decision processes with discounted rewards we derive the well known results regarding the structure of optimal strategies (nonrandomized, stationary Markov strategies) and the standard algorithms (linear programming, policy iteration). Our analysis is completely based on a primal linear programming formulation of the problem.
Wessels, J., van Nunen, J.A.E.E.
openaire +3 more sources
Adaptive Honeypot Engagement through Reinforcement Learning of Semi-Markov Decision Processes [PDF]
A honeynet is a promising active cyber defense mechanism. It reveals the fundamental Indicators of Compromise (IoCs) by luring attackers to conduct adversarial behaviors in a controlled and monitored environment.
Linan Huang, Quanyan Zhu
semanticscholar +1 more source
Application of the Pareto front for risk control in the transport system [PDF]
The article describes the developed model of controlling the process of means of transport operation, in which the choice of control strategy is carried out using non-deterministic methods.
Sołtysiak Agnieszka, Migawa Klaudiusz
doaj +1 more source
A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index
The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and ...
José Niño-Mora
doaj +1 more source
Reactive Reinforcement Learning in Asynchronous Environments
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision
Jaden B. Travnik +6 more
doaj +1 more source

