Results 11 to 20 of about 68,852 (252)

Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions [PDF]

open access: yes, 2015
The focus of this paper is on solving multi-robot planning problems in continuous spaces with partial observability. Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for multi-robot coordination problems, but ...
Agha-mohammadi, Ali-akbar   +3 more
core   +7 more sources

Some work and some play: microscopic and macroscopic approaches to labor and leisure. [PDF]

open access: yesPLoS Computational Biology, 2014
Given the option, humans and other animals elect to distribute their time between work and leisure, rather than choosing all of one and none of the other.
Ritwik K Niyogi   +2 more
doaj   +1 more source

Application of Generator-Electric Motor System for Emergency Propulsion of a Vessel in the Event of Loss of the Full Serviceability of the Diesel Main Engine

open access: yesEnergies, 2022
Oil tanker disasters have been a cause of major environmental disasters, with multi-generational impacts. One of the greatest hazards is damage to the propulsion system that causes the ship to turn sideways to a wave and lose stability, which in storm ...
Zbigniew Łosiewicz   +4 more
doaj   +1 more source

Discounted semi‐Markov decision processes: linear programming and policy iteration [PDF]

open access: yesStatistica Neerlandica, 1975
AbstractFor semi‐Markov decision processes with discounted rewards we derive the well known results regarding the structure of optimal strategies (nonrandomized, stationary Markov strategies) and the standard algorithms (linear programming, policy iteration). Our analysis is completely based on a primal linear programming formulation of the problem.
Wessels, J., van Nunen, J.A.E.E.
openaire   +3 more sources

Application of the Pareto front for risk control in the transport system [PDF]

open access: yesMATEC Web of Conferences, 2019
The article describes the developed model of controlling the process of means of transport operation, in which the choice of control strategy is carried out using non-deterministic methods.
Sołtysiak Agnieszka, Migawa Klaudiusz
doaj   +1 more source

A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index

open access: yesMathematics, 2020
The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and ...
José Niño-Mora
doaj   +1 more source

Reactive Reinforcement Learning in Asynchronous Environments

open access: yesFrontiers in Robotics and AI, 2018
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision
Jaden B. Travnik   +6 more
doaj   +1 more source

Research of reliability and efficiency of technological processes of mechanical assembly production on the basis of the common semi-Markov model

open access: yesMATEC Web of Conferences, 2018
In the article a common semi-Markov mathematical model is considered that allows one to investigate the productivity and reliability of various technological processes of mechanical assembly production.
Rapatskiy Yuri   +5 more
doaj   +1 more source

Optimal Intervention in Semi-Markov-Based Asynchronous Probabilistic Boolean Networks

open access: yesComplexity, 2018
Synchronous probabilistic Boolean networks (PBNs) and generalized asynchronous PBNs have received significant attention over the past decade as a tool for modeling complex genetic regulatory networks.
Qiuli Liu   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy