Results 11 to 20 of about 254,490 (174)
Faster algorithm and sharper analysis for constrained Markov decision process. [PDF]
The problem of constrained Markov decision process (CMDP) is investigated, where an agent aims to maximize the expected accumulated reward subject to constraints on its utilities/costs.
Li T +5 more
europepmc +3 more sources
Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process [PDF]
This article is concerned with constructing a confidence interval for a target policy’s value offline based on a pre-collected observational data in infinite horizon settings.
C. Shi +5 more
semanticscholar +1 more source
Risk-aware UAV-UGV Rendezvous with Chance-Constrained Markov Decision Process [PDF]
We study a chance-constrained variant of the cooperative aerial-ground vehicle routing problem, in which an Unmanned Aerial Vehicle (UAV) with limited battery capacity and an Unmanned Ground Vehicle (UGV) that can also act as a mobile recharging station ...
Guan-Yu Shi +6 more
semanticscholar +1 more source
Semi‐selfish mining based on hidden Markov decision process
Selfish mining attacks sabotage the blockchain systems by utilizing the vulnerabilities of consensus mechanism. The attackers' main target is to obtain higher revenues compared with honest parties.
Tao Li +5 more
semanticscholar +1 more source
The two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) for solving partially observable Markov decision processes (POMDP) problems.
Xuanchen Xiang, Simon Foo, Huanyu Zang
doaj +1 more source
The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems.
Xuanchen Xiang, Simon Foo
doaj +1 more source
Electric vehicles (EVs) have rapidly developed in recent years and their penetration has also significantly increased, which, however, brings new challenges to power systems. Due to their stochastic behaviors, the improper charging strategies for EVs may
Tao Ding +5 more
semanticscholar +1 more source
Markov Decision Process-Based Resilience Enhancement for Distribution Systems: An Approximate Dynamic Programming Approach [PDF]
Because failures in distribution systems caused by extreme weather events directly result in consumers’ outages, this paper proposes a state-based decision-making model with the objective of mitigating loss of load to improve the distribution system ...
Chong Wang +5 more
semanticscholar +1 more source
Health Status-Based Predictive Maintenance Decision-Making via LSTM and Markov Decision Process
Maintenance decision-making is essential to achieve safe and reliable operation with high performance for equipment. To avoid unexpected shutdown and increase machine life as well as system efficiency, it is fundamental to design an effective maintenance
Pan Zheng +4 more
doaj +1 more source
Cognitive searching optimization is a subconscious mental phenomenon in decision making. Aroused by exploiting accessible human action, alleviating inefficient decision and shrinking searching space remain challenges for optimizing the solution space ...
Bingxuan Ren, Tangwen Yin, Shan Fu
doaj +1 more source

