Results 1 to 10 of about 120,210 (259)
Optimal Time-Abstract Schedulers for CTMDPs and Markov Games [PDF]
We study time-bounded reachability in continuous-time Markov decision processes for time-abstract scheduler classes. Such reachability problems play a paramount role in dependability analysis and the modelling of manufacturing and queueing systems ...
Markus Rabe, Sven Schewe
doaj +4 more sources
Q-LVS: A Q-Learning-based Algorithm for Video Streaming in Peer-to-Peer Networks Considering a Token-Based Incentive Mechanism [PDF]
Peer-to-peer video streaming has reached great attention during recent years. Video streaming in peer-to-peer networks is a good way to stream video on the Internet due to the high scalability, high video quality, and low bandwidth requirements.
Z. Imanimehr
doaj +1 more source
This paper is concerned with the asymptotic optimality of quantized stationary policies for continuous-time Markov decision processes (CTMDPs) in Polish spaces with state-dependent discount factors, where the transition rates and reward rates are allowed
Xiao Wu, Yanqiu Tang
doaj +1 more source
This paper is aimed at developing a stochastic volatility model that is useful to explain the dynamics of the returns of gold, silver, and platinum during the period 1994–2019.
Martha Carpinteyro +2 more
doaj +1 more source
Effects of Sampling and Prediction Horizon in Reinforcement Learning
Plain reinforcement learning (RL) may be prone to loss of convergence, constraint violation, unexpected performance, etc. Commonly, RL agents undergo extensive learning stages to achieve proper functionality.
Pavel Osinenko, Dmitrii Dobriborsci
doaj +1 more source
A Dynamic Programming Algorithm for Finding an Optimal Sequence of Informative Measurements
An informative measurement is the most efficient way to gain information about an unknown state. We present a first-principles derivation of a general-purpose dynamic programming algorithm that returns an optimal sequence of informative measurements by ...
Peter N. Loxley, Ka-Wai Cheung
doaj +1 more source
Quasi-Deterministic Processes with Monotonic Trajectories and Unsupervised Machine Learning
This paper aims to consider approximation-estimation tests for decision-making by machine-learning methods, and integral-estimation tests are defined, which is a generalization for the continuous case.
Andrey V. Orekhov
doaj +1 more source
Artificial intelligence methods and technologies are increasingly included in human's everyday life. Managing actors in the context of their activities, from the planning stage to the decision-making stage, are faced with the need to operate with big ...
Gocha Ugulava
doaj +1 more source
Recent high-dimensional single-cell technologies such as mass cytometry are enabling time series experiments to monitor the temporal evolution of cell state distributions and to identify dynamically important cell states, such as fate decision states in ...
Jake P Taylor-King +3 more
doaj +1 more source
Oil tanker disasters have been a cause of major environmental disasters, with multi-generational impacts. One of the greatest hazards is damage to the propulsion system that causes the ship to turn sideways to a wave and lose stability, which in storm ...
Zbigniew Łosiewicz +4 more
doaj +1 more source

