Results 51 to 60 of about 13,931 (229)
Multi‐Agent Reinforcement Learning for Cyber Defence Transferability and Scalability
A method for using mutli‐agent reinforcement learning that allows for zero shot transfer across network setups. Diagrams show the local observation construction, training and agent mapping process. The results for novel 15 and 30 node networks show effective transfer and improved scaling performance. ABSTRACT Reinforcement learning (RL) has shown to be
Andrew Thomas +2 more
wiley +1 more source
To improve the natural human-avoidance skills of service robots, a human motion predictive navigation method is proposed, namely PN-POMDP. A human-robot motion co-occurrence estimation algorithm is proposed which incorporates long-term and short-term ...
Kun Qian +4 more
doaj +1 more source
UAV4PE: An Open-Source Framework to Plan UAV Autonomous Missions for Planetary Exploration
Autonomous Unmanned Aerial Vehicles (UAV) for planetary exploration missions require increased onboard mission-planning and decision-making capabilities to access full operational potential in remote environments (e.g., Antarctica, Mars or Titan ...
Julian Galvez-Serna +5 more
doaj +1 more source
Online Planning Algorithms for POMDPs
Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP is often intractable except for small problems due to their complexity. Here, we focus on online approaches that alleviate the computational complexity by computing good local ...
Ross, Stéphane +3 more
openaire +4 more sources
A Task and Motion Planning Framework for Partially Observable Household Manipulation Scenes
This paper presents a novel TAMP framework tackling partial observability, cluttered scenes, and vague goals via symbolic reasoning and occlusion‐aware planning. It outperforms prior learning, policy, and LLM‐based methods in six real and simulated household tasks.
Yuhong Ma +3 more
wiley +1 more source
Experimental results : Reinforcement Learning of POMDPs using Spectral Methods [PDF]
We propose a new reinforcement learning algorithm for partially observable Markov decision processes (POMDP) based on spectral decomposition methods.
Anandkumar, Animashree +2 more
core +1 more source
Energy Efficient Execution of POMDP Policies [PDF]
Recent advances in planning techniques for partially observable Markov decision processes (POMDPs) have focused on online search techniques and offline point-based value iteration. While these techniques allow practitioners to obtain policies for fairly large problems, they assume that a nonnegligible amount of computation can be done between each ...
Grześ, Marek +3 more
openaire +2 more sources
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
Summary In recent years, reinforcement learning (RL) has acquired a prominent position in health‐related sequential decision‐making problems, gaining traction as a valuable tool for delivering adaptive interventions (AIs). However, in part due to a poor synergy between the methodological and the applied communities, its real‐life application is still ...
Nina Deliu +2 more
wiley +1 more source
Sensor Synthesis for POMDPs with Reachability Objectives
Partially observable Markov decision processes (POMDPs) are widely used in probabilistic planning problems in which an agent interacts with an environment using noisy and imprecise sensors.
Chatterjee, Krishnendu +2 more
core +2 more sources
Monte Carlo Bayesian Reinforcement Learning [PDF]
Bayesian reinforcement learning (BRL) encodes prior knowledge of the world in a model and represents uncertainty in model parameters by maintaining a probability distribution over them.
Hsu, David +3 more
core +1 more source

