Partially observable markov decision processes

Results 11 to 20 of about 30,788 (321)

The optimal probability of the risk for finite horizon partially observable Markov decision processes

AIMS Mathematics, 2023
This paper investigates the optimality of the risk probability for finite horizon partially observable discrete-time Markov decision processes (POMDPs).
Xian Wen , Haifeng Huo, Jinhua Cui
doaj +2 more sources

History-dependent Evaluations in Partially Observable Markov Decision Process [PDF]

SIAM Journal on Control and Optimization, 2021
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Xavier Mathieu Raymond Venel, Bruno Ziliotto +1 more
openaire +5 more sources

Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities [PDF]

, 2016
This paper describes sufficient conditions for the existence of optimal policies for partially observable Markov decision processes (POMDPs) with Borel state, observation, and action sets, when the goal is to minimize the expected total costs over finite
Feinberg, E. A. +4 more
core +3 more sources

Intelligent anti-jamming decision algorithm for wireless communication under limited channel state information conditions [PDF]

Scientific Reports
Deep reinforcement learning has been widely applied to solve the anti-jamming problems in wireless communications, achieving good results. However, most research assumes that the communication system can obtain complete Channel State Information (CSI ...
Feng Zhang +3 more
doaj +2 more sources

Increasing the Construct Validity of Computational Phenotypes of Mental Illness Through Active Inference and Brain Imaging [PDF]

Brain Sciences
After more than 30 years since its inception, the utility of brain imaging for understanding and diagnosing mental illnesses is in doubt, receiving well-grounded criticisms from clinical practitioners.
Roberto Limongi +3 more
doaj +2 more sources

Multi-UAV Mapping and Target Finding in Large, Complex, Partially Observable Environments

Remote Sensing, 2023
Coordinating multiple unmanned aerial vehicles (UAVs) for the purposes of target finding or surveying points of interest in large, complex, and partially observable environments remains an area of exploration.
Violet Walker, Fernando Vanegas, Felipe Gonzalez +2 more
doaj +1 more source

Factored Beliefs for Machine Agents in Decentralized Partially Observable Markov Decision Processes

Proceedings of the International Florida Artificial Intelligence Research Society Conference, 2022
A shared mental model (SMM) is a foundational structure in high performing, task-oriented teams and aid humans in determining their teammate's goals and intentions.
Joshua Lapso, Gilbert Peterson
doaj +1 more source

Guided Soft Actor Critic: A Guided Deep Reinforcement Learning Approach for Partially Observable Markov Decision Processes

IEEE Access, 2021
Most real-world problems are essentially partially observable, and the environmental model is unknown. Therefore, there is a significant need for reinforcement learning approaches to solve them, where the agent perceives the state of the environment ...
Mehmet Haklidir, Hakan Temeltas
doaj +1 more source

Bayesian inference with incomplete knowledge explains perceptual confidence and its deviations from accuracy

Nature Communications, 2021
A Bayesian framework based on partially observable Markov decision processes (POMDPs) not only predicts subjects’ confidence in a perceptual decision making task but also explains well-known discrepancies between confidence and choice accuracy as arising
Koosha Khalvati, Roozbeh Kiani, Rajesh P. N. Rao +2 more
doaj +1 more source

Frequency Agile Anti-Interference Technology Based on Reinforcement Learning Using Long Short-Term Memory and Multi-Layer Historical Information Observation

Remote Sensing, 2023
In modern electronic warfare, radar intelligence has become increasingly crucial when dealing with complex interference environments. This paper combines radar agile frequency technology with reinforcement learning to achieve adaptive frequency hopping ...
Weihao Shi +5 more
doaj +1 more source

pomdp
markov and semi-markov decision processes
dynamic programming

fos: computer and information sciences
markov decision process
reinforcement learning

artificial intelligence cs.ai
computer science - artificial intelligence
deep reinforcement learning