Approximate policy iteration - Open Access .click

Results 241 to 250 of about 93,202 (286)

Active guidance in ultrasound bladder scanning using reinforcement learning. [PDF]

Sci Rep
Hsu HL +9 more
europepmc +1 more source

Comparative Evaluation of Bandit-Style Heuristic Policies for Moving Target Detection in a Linear Grid Environment. [PDF]

Sensors (Basel)
Kang H, Ahn M, Seo Y.
europepmc +1 more source

Deep Reinforcement Learning for Intervention of Partially Observable Regulatory Networks. [PDF]

Proc Am Control Conf
Hosseini SH, Imani M.
europepmc +1 more source

Network carrier allocation optimization based on immune algorithm under massive concurrent access. [PDF]

Sci Rep
Qi L.
europepmc +1 more source

Some of the next articles are maybe not open access.

Related searches:

computer science
mathematics
mathematical optimization

reinforcement learning
artificial intelligence
statistics

machine learning
markov decision process
applied mathematics

Classification-Based Approximate Policy Iteration

IEEE Transactions on Automatic Control, 2015
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Farahmand, Amir-Massoud +3 more
openaire +3 more sources

Approximate value iteration with randomized policies

Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187), 2002
The curse of dimensionality in dynamic programming prevents, in most problems of practical interest, the exact computation of the value function. We study the fixed points of approximate value iteration, a simple algorithm that combats the curse of dimensionality by generating approximate iterates of the classical value iteration algorithm in the span ...
D.P. de Farias, B. Van Roy
openaire +1 more source

Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems

IEEE Transactions on Neural Networks and Learning Systems, 2017
Policy iteration approximate dynamic programming (DP) is an important algorithm for solving optimal decision and control problems. In this paper, we focus on the problem associated with policy approximation in policy iteration approximate DP for discrete-time nonlinear systems using infinite-horizon undiscounted value functions.
Wentao Guo, Jennie Si, Feng Liu, Shengwei Mei +3 more
openaire +2 more sources

Semiglobal nonlinear stabilization via approximate policy iteration

Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148), 2001
We consider the problem of semiglobal nonlinear stabilization. Based on a given up table dynamic system and a region of acceptable operation within which the state is desired to be confined, we define an appropriate alternative dynamic system. We define an optimal control problem for the alternative (redefined) system which is amenable to solution via ...
C.I. Boussios, M.A. Dahleh, J.N. Tsitsiklis +2 more
openaire +1 more source

computer science
mathematics
mathematical optimization

reinforcement learning
artificial intelligence
statistics

machine learning
markov decision process
applied mathematics