Results 131 to 140 of about 937,585 (275)
Finite Model Approximations for Partially Observed Markov Decision Processes with Discounted Cost [PDF]
We consider finite model approximations of discrete-time partially observed Markov decision processes (POMDPs) under the discounted cost criterion. After converting the original partially observed stochastic control problem to a fully observed one on the belief space, the finite models are obtained through the uniform quantization of the state and ...
arxiv
On the adaptive control of a class of partially observed Markov decision processes
AbstractThis paper is concerned with the adaptive control problem, over the infinite horizon, for partially observable Markov decision processes whose transition functions are parameterized by an unknown vector. We treat finite models and impose relatively mild assumptions on the transition function.
Hsu, Shun-Pin, Arapostathis, Ari
openaire +3 more sources
The development of artificial intelligence (AI) game agents that use deep reinforcement learning (DRL) algorithms to process visual information for decision-making has emerged as a key research focus in both academia and industry.
Zihao Cui+4 more
doaj +1 more source
Opportunistic Spectrum Access in Self-Similar Primary Traffic
We take a stochastic optimization approach to opportunity tracking and access in self-similar primary traffic. Based on a multiple time-scale hierarchical Markovian model, we formulate opportunity tracking and access in self-similar primary traffic as a ...
Xiangyang Xiao, Qing Zhao, Keqin Liu
doaj +1 more source
Pulse‐Level Quantum Robust Control with Diffusion‐Based Reinforcement Learning
This paper proposes a diffusion‐based reinforcement learning method for pulse‐level quantum robust control (PQC‐DBRL) to enhance the robustness of pulse‐level quantum gate control. When evaluated across different Hamiltonian variations, PQC‐DBRL shows a smaller fidelity variance compared to GRAPE, indicating higher robustness against system parameter ...
Yuanjing Zhang+3 more
wiley +1 more source
This paper uses a numerical landscape evolution model to reconstruct the topographic history of Madagascar since the Late Cretaceous. The model is optimised by balancing the volumes of onshore erosion and offshore sedimentation; the former is predicted with erosion laws and based on uplift history inferred from elevated planation surfaces.
Ruohong Jiao+4 more
wiley +1 more source
Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes [PDF]
Rajesh P. N. Rao
openalex +1 more source
Abstract Academic libraries, institutions, and publishers are interested in predicting future publishing output to help evaluate publishing agreements. Current predictive models are overly simplistic and provide inaccurate predictions. This paper presents Bayesian and frequentist statistical models to predict future article counts and costs.
Philip M. Dixon, Eric Schares
wiley +1 more source
To solve the problem of multi-target hunting by an unmanned surface vehicle (USV) fleet, a hunting algorithm based on multi-agent reinforcement learning is proposed.
Jiawei Xia+5 more
doaj
Partially Observable Markov Decision Process for Recommender Systems
We report the "Recurrent Deterioration" (RD) phenomenon observed in online recommender systems. The RD phenomenon is reflected by the trend of performance degradation when the recommendation model is always trained based on users' feedbacks of the previous recommendations.
Lu, Zhongqi, Yang, Qiang
openaire +2 more sources