Results 241 to 250 of about 93,202 (286)
Active guidance in ultrasound bladder scanning using reinforcement learning. [PDF]
Hsu HL +9 more
europepmc +1 more source
Comparative Evaluation of Bandit-Style Heuristic Policies for Moving Target Detection in a Linear Grid Environment. [PDF]
Kang H, Ahn M, Seo Y.
europepmc +1 more source
Deep Reinforcement Learning for Intervention of Partially Observable Regulatory Networks. [PDF]
Hosseini SH, Imani M.
europepmc +1 more source
Network carrier allocation optimization based on immune algorithm under massive concurrent access. [PDF]
Qi L.
europepmc +1 more source
Some of the next articles are maybe not open access.
Related searches:
Related searches:
Classification-Based Approximate Policy Iteration
IEEE Transactions on Automatic Control, 2015zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Farahmand, Amir-Massoud +3 more
openaire +3 more sources
Approximate value iteration with randomized policies
Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187), 2002The curse of dimensionality in dynamic programming prevents, in most problems of practical interest, the exact computation of the value function. We study the fixed points of approximate value iteration, a simple algorithm that combats the curse of dimensionality by generating approximate iterates of the classical value iteration algorithm in the span ...
D.P. de Farias, B. Van Roy
openaire +1 more source
IEEE Transactions on Neural Networks and Learning Systems, 2017
Policy iteration approximate dynamic programming (DP) is an important algorithm for solving optimal decision and control problems. In this paper, we focus on the problem associated with policy approximation in policy iteration approximate DP for discrete-time nonlinear systems using infinite-horizon undiscounted value functions.
Wentao Guo +3 more
openaire +2 more sources
Policy iteration approximate dynamic programming (DP) is an important algorithm for solving optimal decision and control problems. In this paper, we focus on the problem associated with policy approximation in policy iteration approximate DP for discrete-time nonlinear systems using infinite-horizon undiscounted value functions.
Wentao Guo +3 more
openaire +2 more sources
Semiglobal nonlinear stabilization via approximate policy iteration
Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148), 2001We consider the problem of semiglobal nonlinear stabilization. Based on a given up table dynamic system and a region of acceptable operation within which the state is desired to be confined, we define an appropriate alternative dynamic system. We define an optimal control problem for the alternative (redefined) system which is amenable to solution via ...
C.I. Boussios +2 more
openaire +1 more source

