Results 111 to 120 of about 13,931 (229)

Posterior Sampling-based Online Learning for Episodic POMDPs [PDF]

open access: green, 2023
Dengwang Tang   +4 more
openalex   +1 more source

Stochastic Shortest Path with Energy Constraints in POMDPs

open access: yes, 2016
We consider partially observable Markov decision processes (POMDPs) with a set of target states and positive integer costs associated with every transition.
Brázdil, Tomáš   +4 more
core  

Reinforcement Learning for Robust Header Compression (ROHC) Under Model Uncertainty

open access: yesIEEE Transactions on Machine Learning in Communications and Networking
Robust header compression (ROHC), critically positioned between network and MAC layers, plays an important role in modern wireless communication networks for improving data efficiency.
Shusen Jing, Songyang Zhang, Zhi Ding
doaj   +1 more source

Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems [PDF]

open access: green, 2020
Sushmita Bhattacharya   +4 more
openalex   +1 more source

Safe POMDP Online Planning via Shielding

open access: yes2024 IEEE International Conference on Robotics and Automation (ICRA)
Partially observable Markov decision processes (POMDPs) have been widely used in many robotic applications for sequential decision-making under uncertainty. POMDP online planning algorithms such as Partially Observable Monte-Carlo Planning (POMCP) can solve very large POMDPs with the goal of maximizing the expected return.
Sheng, S, Parker, D, Feng, L
openaire   +2 more sources

Privacy Verification in POMDPs via Barrier Certificates [PDF]

open access: green, 2018
Mohamadreza Ahmadi   +3 more
openalex   +1 more source

Vectorized Online POMDP Planning

open access: yes
9 pages, 3 figures.
Hoerger, Marcus   +2 more
openaire   +2 more sources

Adaptive cache policy optimization through deep reinforcement learning in dynamic cellular networks

open access: yesIntelligent and Converged Networks
We explore the use of caching both at the network edge and within User Equipment (UE) to alleviate traffic load of wireless networks. We develop a joint cache placement and delivery policy that maximizes the Quality of Service (QoS) while simultaneously ...
Ashvin Srinivasan   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy