Markov and semi-markov decision processes

Results 11 to 20 of about 68,852 (252)

A Semi-Markov Decision Processes Modeling Algorithm for Continuous State Space Environments Using k-Certainty Exploration Method and Fuzzy-ART

TRANSACTIONS OF THE JAPAN SOCIETY OF MECHANICAL ENGINEERS Series C, 2005
Takeshi TATEYAMA, Seiichi KAWATA
openaire +3 more sources

Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions [PDF]

, 2015
The focus of this paper is on solving multi-robot planning problems in continuous spaces with partial observability. Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for multi-robot coordination problems, but ...
Agha-mohammadi, Ali-akbar +3 more
core +7 more sources

Some work and some play: microscopic and macroscopic approaches to labor and leisure. [PDF]

PLoS Computational Biology, 2014
Given the option, humans and other animals elect to distribute their time between work and leisure, rather than choosing all of one and none of the other.
Ritwik K Niyogi, Peter Shizgal, Peter Dayan +2 more
doaj +1 more source

Application of Generator-Electric Motor System for Emergency Propulsion of a Vessel in the Event of Loss of the Full Serviceability of the Diesel Main Engine

Energies, 2022
Oil tanker disasters have been a cause of major environmental disasters, with multi-generational impacts. One of the greatest hazards is damage to the propulsion system that causes the ship to turn sideways to a wave and lose stability, which in storm ...
Zbigniew Łosiewicz +4 more
doaj +1 more source

Discounted semi‐Markov decision processes: linear programming and policy iteration [PDF]

Statistica Neerlandica, 1975
AbstractFor semi‐Markov decision processes with discounted rewards we derive the well known results regarding the structure of optimal strategies (nonrandomized, stationary Markov strategies) and the standard algorithms (linear programming, policy iteration). Our analysis is completely based on a primal linear programming formulation of the problem.
Wessels, J., van Nunen, J.A.E.E.
openaire +3 more sources

Application of the Pareto front for risk control in the transport system [PDF]

MATEC Web of Conferences, 2019
The article describes the developed model of controlling the process of means of transport operation, in which the choice of control strategy is carried out using non-deterministic methods.
Sołtysiak Agnieszka, Migawa Klaudiusz
doaj +1 more source

A Fast-Pivoting Algorithm for Whittle’s Restless Bandit Index

Mathematics, 2020
The Whittle index for restless bandits (two-action semi-Markov decision processes) provides an intuitively appealing optimal policy for controlling a single generic project that can be active (engaged) or passive (rested) at each decision epoch, and ...
José Niño-Mora
doaj +1 more source

Reactive Reinforcement Learning in Asynchronous Environments

Frontiers in Robotics and AI, 2018
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision
Jaden B. Travnik +6 more
doaj +1 more source

Research of reliability and efficiency of technological processes of mechanical assembly production on the basis of the common semi-Markov model

MATEC Web of Conferences, 2018
In the article a common semi-Markov mathematical model is considered that allows one to investigate the productivity and reliability of various technological processes of mechanical assembly production.
Rapatskiy Yuri +5 more
doaj +1 more source

Optimal Intervention in Semi-Markov-Based Asynchronous Probabilistic Boolean Networks

Complexity, 2018
Synchronous probabilistic Boolean networks (PBNs) and generalized asynchronous PBNs have received significant attention over the past decade as a tool for modeling complex genetic regulatory networks.
Qiuli Liu +3 more
doaj +1 more source

mathematics
computer science
markov decision process

markov process
statistics
markov chain

mathematical optimization
machine learning
artificial intelligence