Results 11 to 20 of about 68,852 (252)
Takeshi TATEYAMA, Seiichi KAWATA
openaire +3 more sources
Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions [PDF]
The focus of this paper is on solving multi-robot planning problems in continuous spaces with partial observability. Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for multi-robot coordination problems, but ...
Agha-mohammadi, Ali-akbar +3 more
core +7 more sources
Some work and some play: microscopic and macroscopic approaches to labor and leisure. [PDF]
Given the option, humans and other animals elect to distribute their time between work and leisure, rather than choosing all of one and none of the other.
Ritwik K Niyogi +2 more
doaj +1 more source
Oil tanker disasters have been a cause of major environmental disasters, with multi-generational impacts. One of the greatest hazards is damage to the propulsion system that causes the ship to turn sideways to a wave and lose stability, which in storm ...
Zbigniew Łosiewicz +4 more
doaj +1 more source
Discounted semi‐Markov decision processes: linear programming and policy iteration [PDF]
AbstractFor semi‐Markov decision processes with discounted rewards we derive the well known results regarding the structure of optimal strategies (nonrandomized, stationary Markov strategies) and the standard algorithms (linear programming, policy iteration). Our analysis is completely based on a primal linear programming formulation of the problem.
Wessels, J., van Nunen, J.A.E.E.
openaire +3 more sources
Application of the Pareto front for risk control in the transport system [PDF]
The article describes the developed model of controlling the process of means of transport operation, in which the choice of control strategy is carried out using non-deterministic methods.
Sołtysiak Agnieszka, Migawa Klaudiusz
doaj +1 more source
A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index
The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and ...
José Niño-Mora
doaj +1 more source
Reactive Reinforcement Learning in Asynchronous Environments
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision
Jaden B. Travnik +6 more
doaj +1 more source
In the article a common semi-Markov mathematical model is considered that allows one to investigate the productivity and reliability of various technological processes of mechanical assembly production.
Rapatskiy Yuri +5 more
doaj +1 more source
Optimal Intervention in Semi-Markov-Based Asynchronous Probabilistic Boolean Networks
Synchronous probabilistic Boolean networks (PBNs) and generalized asynchronous PBNs have received significant attention over the past decade as a tool for modeling complex genetic regulatory networks.
Qiuli Liu +3 more
doaj +1 more source

