Semi-Markov decision process based congestion control algorithm for video transmission
Due to the problem that the congestion control algorithm of transmission control protocol (TCP) cannot meet the requirements of quality of experience for Internet video transmission, a semi-Markov decision process based conges-tion control algorithm for ...
Bo TIAN, Yi-min YANG, Shu-ting CAI
doaj +2 more sources
Markov-Decision-Process-Assisted Consumer Scheduling in a Networked Smart Grid
Many recently built residential houses and factories are equipped with facilities for converting energy from green sources, such as solar energy, into electricity.
Zhi Liu+5 more
doaj +1 more source
A New Markov Decision Process Based Behavioral Prediction System for Airborne Crews
In order to ensure the normal and stable flights in the aircraft, a variety of sensors and corresponding instrumentation systems have been applied on the aircraft to monitor/control the current flight status, and the resulted data ensure the flight ...
Yaozhong Zhang+4 more
doaj +1 more source
On the Expressivity of Multidimensional Markov Reward [PDF]
We consider the expressivity of Markov rewards in sequential decision making under uncertainty. We view reward functions in Markov Decision Processes (MDPs) as a means to characterize desired behaviors of agents. Assuming desired behaviors are specified as a set of acceptable policies, we investigate if there exists a scalar or multidimensional Markov ...
arxiv
A distance for probability spaces, and long-term values in Markov Decision Processes and Repeated Games [PDF]
Given a finite set $K$, we denote by $X=\Delta(K)$ the set of probabilities on $K$ and by $Z=\Delta_f(X)$ the set of Borel probabilities on $X$ with finite support.
Renault, Jérôme, Venel, Xavier
core
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making [PDF]
The Markov assumption (MA) is fundamental to the empirical validity of reinforcement learning. In this paper, we propose a novel Forward-Backward Learning procedure to test MA in sequential decision making. The proposed test does not assume any parametric form on the joint distribution of the observed data and plays an important role for identifying ...
arxiv
Design for reusability and product reuse under radical innovation
Many industries, including consumer electronics and telecommunications equipment, are characterized with short product life-cycles, constant technological innovations, rapid product introductions, and fast obsolescence.
Vedat Verter+2 more
doaj
Using Markov Decision Process Model for Sustainability Assessment in Industry 4.0
The manufacturing industry is facing increasing challenges to improve sustainability performance by using Industry 4.0 technologies like big data analytics, the internet of things, and digital twins, considering their potentials.
Majid Sodachi+2 more
doaj +1 more source
Adaptive Channel Recommendation For Opportunistic Spectrum Access [PDF]
We propose a dynamic spectrum access scheme where secondary users recommend "good" channels to each other and access accordingly. We formulate the problem as an average reward based Markov decision process. We show the existence of the optimal stationary
Chen, Xu, Huang, Jianwei, Li, Husheng
core
Approximating Euclidean by Imprecise Markov Decision Processes [PDF]
Euclidean Markov decision processes are a powerful tool for modeling control problems under uncertainty over continuous domains. Finite state imprecise, Markov decision processes can be used to approximate the behavior of these infinite models. In this paper we address two questions: first, we investigate what kind of approximation guarantees are ...
arxiv