Results 1 to 10 of about 45,486 (253)

Optimal maintenance of deteriorating equipment using semi-Markov decision processes and linear programming [PDF]

open access: yesInternational Journal of Industrial Engineering and Management
This paper considers a mathematical model analysing the deterioration of system equipment and available maintenance options. Under specific conditions on costs and transition probabilities of the model, the issue of ideal maintenance of the equipment by ...
Giannis Kechagias   +3 more
doaj   +3 more sources

A Faster-Than Relation for Semi-Markov Decision Processes [PDF]

open access: yesElectronic Proceedings in Theoretical Computer Science, 2020
When modeling concurrent or cyber-physical systems, non-functional requirements such as time are important to consider. In order to improve the timing aspects of a model, it is necessary to have some notion of what it means for a process to be faster ...
Mathias Ruggaard Pedersen   +2 more
doaj   +3 more sources

Deterministic policy gradient algorithms for semi‐Markov decision processes

open access: yesInternational Journal of Intelligent Systems, 2021
A large class of sequential decision‐making problems under uncertainty, with broad applications from preventive maintenance to event‐triggered control can be modeled in the framework of semi‐Markov decision processes (SMDPs).
A. H. Hosseinloo, M. Dahleh
semanticscholar   +1 more source

Some work and some play: microscopic and macroscopic approaches to labor and leisure. [PDF]

open access: yesPLoS Computational Biology, 2014
Given the option, humans and other animals elect to distribute their time between work and leisure, rather than choosing all of one and none of the other.
Ritwik K Niyogi   +2 more
doaj   +1 more source

Application of Generator-Electric Motor System for Emergency Propulsion of a Vessel in the Event of Loss of the Full Serviceability of the Diesel Main Engine

open access: yesEnergies, 2022
Oil tanker disasters have been a cause of major environmental disasters, with multi-generational impacts. One of the greatest hazards is damage to the propulsion system that causes the ship to turn sideways to a wave and lose stability, which in storm ...
Zbigniew Łosiewicz   +4 more
doaj   +1 more source

Discounted semi‐Markov decision processes: linear programming and policy iteration [PDF]

open access: yesStatistica Neerlandica, 1975
AbstractFor semi‐Markov decision processes with discounted rewards we derive the well known results regarding the structure of optimal strategies (nonrandomized, stationary Markov strategies) and the standard algorithms (linear programming, policy iteration). Our analysis is completely based on a primal linear programming formulation of the problem.
Wessels, J., van Nunen, J.A.E.E.
openaire   +3 more sources

Adaptive Honeypot Engagement through Reinforcement Learning of Semi-Markov Decision Processes [PDF]

open access: yesDecision and Game Theory for Security, 2019
A honeynet is a promising active cyber defense mechanism. It reveals the fundamental Indicators of Compromise (IoCs) by luring attackers to conduct adversarial behaviors in a controlled and monitored environment.
Linan Huang, Quanyan Zhu
semanticscholar   +1 more source

Application of the Pareto front for risk control in the transport system [PDF]

open access: yesMATEC Web of Conferences, 2019
The article describes the developed model of controlling the process of means of transport operation, in which the choice of control strategy is carried out using non-deterministic methods.
Sołtysiak Agnieszka, Migawa Klaudiusz
doaj   +1 more source

A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index

open access: yesMathematics, 2020
The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and ...
José Niño-Mora
doaj   +1 more source

Reactive Reinforcement Learning in Asynchronous Environments

open access: yesFrontiers in Robotics and AI, 2018
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision
Jaden B. Travnik   +6 more
doaj   +1 more source

Home - About - Disclaimer - Privacy