Pomdp - Open Access .click

Results 111 to 120 of about 13,931 (229)

Policies used in "Deep reinforcement learning for the olfactory search POMDP: a quantitative benchmark"

, 2023
Aurore Loisy, R. A. Heinonen
openalex +1 more source

Posterior Sampling-based Online Learning for Episodic POMDPs [PDF]

, 2023
Dengwang Tang +4 more
openalex +1 more source

Future memories are not needed for large classes of POMDPs [PDF]

, 2022
Victor Cohen, Axel Parmentier
openalex +1 more source

Stochastic Shortest Path with Energy Constraints in POMDPs

, 2016
We consider partially observable Markov decision processes (POMDPs) with a set of target states and positive integer costs associated with every transition.
Brázdil, Tomáš +4 more
core

Reinforcement Learning for Robust Header Compression (ROHC) Under Model Uncertainty

IEEE Transactions on Machine Learning in Communications and Networking
Robust header compression (ROHC), critically positioned between network and MAC layers, plays an important role in modern wireless communication networks for improving data efficiency.
Shusen Jing, Songyang Zhang, Zhi Ding
doaj +1 more source

Multiagent Rollout and Policy Iteration for POMDP with Application to Multi-Robot Repair Problems [PDF]

, 2020
Sushmita Bhattacharya +4 more
openalex +1 more source

Safe POMDP Online Planning via Shielding

2024 IEEE International Conference on Robotics and Automation (ICRA)
Partially observable Markov decision processes (POMDPs) have been widely used in many robotic applications for sequential decision-making under uncertainty. POMDP online planning algorithms such as Partially Observable Monte-Carlo Planning (POMCP) can solve very large POMDPs with the goal of maximizing the expected return.
Sheng, S, Parker, D, Feng, L
openaire +2 more sources

Privacy Verification in POMDPs via Barrier Certificates [PDF]

, 2018
Mohamadreza Ahmadi, Bo Wu, Hai Lin, Ufuk Topcu +3 more
openalex +1 more source

Vectorized Online POMDP Planning

9 pages, 3 figures.
Hoerger, Marcus, Sudrajat, Muhammad, Kurniawati, Hanna +2 more
openaire +2 more sources

Adaptive cache policy optimization through deep reinforcement learning in dynamic cellular networks

Intelligent and Converged Networks
We explore the use of caching both at the network edge and within User Equipment (UE) to alleviate traffic load of wireless networks. We develop a joint cache placement and delivery policy that maximizes the Quality of Service (QoS) while simultaneously ...
Ashvin Srinivasan +3 more
doaj +1 more source

computer science
artificial intelligence
partially observable markov decision process

machine learning
mathematics
markov chain

mathematical optimization
markov model
markov decision process