Results 1 to 10 of about 67,931 (252)
A Faster-Than Relation for Semi-Markov Decision Processes [PDF]
When modeling concurrent or cyber-physical systems, non-functional requirements such as time are important to consider. In order to improve the timing aspects of a model, it is necessary to have some notion of what it means for a process to be faster ...
Mathias Ruggaard Pedersen +2 more
doaj +10 more sources
Learning to maximize reward rate: a model based on semi-Markov decision processes [PDF]
When animals have to make a number of decisions during a limited time interval, they face a fundamental problem: how much time they should spend on each decision in order to achieve the maximum possible total outcome.
Arash eKhodadadi +2 more
doaj +2 more sources
Continuous-Observation Partially Observable Semi-Markov Decision Processes for Machine Maintenance [PDF]
Partially observable semi-Markov decision processes (POSMDPs) provide a rich framework for planning under both state transition uncertainty and observation uncertainty. In this paper, we widen the literature on POSMDP by studying discrete-state, discrete-action yet continuous-observation POSMDPs.
Zhang, Mimi, Revie, Matthew
openaire +6 more sources
Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions [PDF]
The focus of this paper is on solving multi-robot planning problems in continuous spaces with partial observability. Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for multi-robot coordination problems, but ...
Agha-mohammadi, Ali-akbar +3 more
core +7 more sources
The article presents the possibility to control the real operation process of an arbitrary device installed in the marine power plant based on the four-state semi-Markov process, being the model of the process, which describes the transition process of ...
Girtler Jerzy, Rudnicki Jacek
doaj +1 more source
Some work and some play: microscopic and macroscopic approaches to labor and leisure. [PDF]
Given the option, humans and other animals elect to distribute their time between work and leisure, rather than choosing all of one and none of the other.
Ritwik K Niyogi +2 more
doaj +1 more source
Oil tanker disasters have been a cause of major environmental disasters, with multi-generational impacts. One of the greatest hazards is damage to the propulsion system that causes the ship to turn sideways to a wave and lose stability, which in storm ...
Zbigniew Łosiewicz +4 more
doaj +1 more source
Application of the Pareto front for risk control in the transport system [PDF]
The article describes the developed model of controlling the process of means of transport operation, in which the choice of control strategy is carried out using non-deterministic methods.
Sołtysiak Agnieszka, Migawa Klaudiusz
doaj +1 more source
Fast Two-Stage Computation of an Index Policy for Multi-Armed Bandits with Setup Delays
We consider the multi-armed bandit problem with penalties for switching that include setup delays and costs, extending the former results of the author for the special case with no switching delays.
José Niño-Mora
doaj +1 more source
Hierarchical dialogue optimization using semi-Markov decision processes [PDF]
This paper addresses the problem of dialogue optimization on large search spaces. For such a purpose, in this paper we propose to learn dialogue strategies using multiple Semi-Markov Decision Processes and hierarchical reinforcement learning. This approach factorizes state variables and actions in order to learn a hierarchy of policies. Our experiments
Cuayáhuitl, Heriberto +3 more
openaire +2 more sources

