The Convergence of a Cooperation Markov Decision Process System [PDF]
In a general Markov decision progress system, only one agent’s learning evolution is considered. However, considering the learning evolution of a single agent in many problems has some limitations, more and more applications involve multi-agent.
Xiaoling Mo, Daoyun Xu, Zufeng Fu
doaj +2 more sources
Improving RED algorithm congestion control by using the Markov decision process [PDF]
Congestion control plays an essential role on the internet to manage overload, which affects data transmission performance. The random early detection (RED) algorithm belongs to active queue management (AQM), which is used to manage internet traffic. The
Amar A. Mahawish, Hassan J. Hassan
doaj +2 more sources
Data-Driven Markov Decision Process Approximations for Personalized Hypertension Treatment Planning [PDF]
Background: Markov decision process (MDP) models are powerful tools. They enable the derivation of optimal treatment policies but may incur long computational times and generate decision rules that are challenging to interpret by physicians.
Greggory J. Schell PhD +4 more
doaj +2 more sources
Intelligent Sensing in Dynamic Environments Using Markov Decision Process [PDF]
In a network of low-powered wireless sensors, it is essential to capture as many environmental events as possible while still preserving the battery life of the sensor node. This paper focuses on a real-time learning algorithm to extend the lifetime of a
Asad M. Madni +3 more
doaj +2 more sources
Multi-Vehicle Tracking via Real-Time Detection Probes and a Markov Decision Process Policy [PDF]
Online multi-object tracking (MOT) has broad applications in time-critical video analysis scenarios such as advanced driver-assistance systems (ADASs) and autonomous driving.
Yi Zou +3 more
doaj +2 more sources
Surveillance imaging for patients with head and neck cancer treated with definitive radiotherapy: A partially observed Markov decision process model. [PDF]
Ng SP +14 more
europepmc +3 more sources
Application of Reinforcement Learning to Solve Rubrik’s Cube with Markov Decision Process
The Rubik's Cube is a complex puzzle with an enormous number of possible configurations, making it a challenging problem for both humans and computational methods to solve.
Defni +6 more
doaj +3 more sources
Unusual Japanese Decision-making: Markov Process of Decision-making [PDF]
There is not enough research on decision-making and management process of organizations that do not make a profit. Economists should include healthy companies, as well as sick ones, the same way doctors do.
Nobumichi WATAHIKI +4 more
doaj +1 more source
Dialogue Strateqy Integrating Markov Decision Process and Information Entropy [PDF]
Dialogue strategy is an important component in the human-machine dialogue system,and its performance directly affects the performance of the dialogue system.In a cold start scenario without any data,it is complex and time-consuming to collect dialogue ...
ZHU Yingbo, ZHAO Yangyang, WANG Pei, YIN Kai, WANG Zhenyu
doaj +1 more source
Quantile Markov Decision Processes
Title: Sequential Decision Making Using Quantiles The goal of a traditional Markov decision process (MDP) is to maximize the expectation of cumulative reward over a finite or infinite horizon. In many applications, however, a decision maker may be interested in optimizing a specific quantile of the cumulative reward. For example, a physician may want
Xiaocheng Li +2 more
openaire +6 more sources

