Results 111 to 120 of about 13,931 (229)
Posterior Sampling-based Online Learning for Episodic POMDPs [PDF]
Dengwang Tang +4 more
openalex +1 more source
Future memories are not needed for large classes of POMDPs [PDF]
Victor Cohen, Axel Parmentier
openalex +1 more source
Stochastic Shortest Path with Energy Constraints in POMDPs
We consider partially observable Markov decision processes (POMDPs) with a set of target states and positive integer costs associated with every transition.
Brázdil, Tomáš +4 more
core
Reinforcement Learning for Robust Header Compression (ROHC) Under Model Uncertainty
Robust header compression (ROHC), critically positioned between network and MAC layers, plays an important role in modern wireless communication networks for improving data efficiency.
Shusen Jing, Songyang Zhang, Zhi Ding
doaj +1 more source
Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems [PDF]
Sushmita Bhattacharya +4 more
openalex +1 more source
Safe POMDP Online Planning via Shielding
Partially observable Markov decision processes (POMDPs) have been widely used in many robotic applications for sequential decision-making under uncertainty. POMDP online planning algorithms such as Partially Observable Monte-Carlo Planning (POMCP) can solve very large POMDPs with the goal of maximizing the expected return.
Sheng, S, Parker, D, Feng, L
openaire +2 more sources
Privacy Verification in POMDPs via Barrier Certificates [PDF]
Mohamadreza Ahmadi +3 more
openalex +1 more source
Adaptive cache policy optimization through deep reinforcement learning in dynamic cellular networks
We explore the use of caching both at the network edge and within User Equipment (UE) to alleviate traffic load of wireless networks. We develop a joint cache placement and delivery policy that maximizes the Quality of Service (QoS) while simultaneously ...
Ashvin Srinivasan +3 more
doaj +1 more source

