Results 241 to 250 of about 93,202 (286)

Active guidance in ultrasound bladder scanning using reinforcement learning. [PDF]

open access: yesSci Rep
Hsu HL   +9 more
europepmc   +1 more source

Classification-Based Approximate Policy Iteration

IEEE Transactions on Automatic Control, 2015
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Farahmand, Amir-Massoud   +3 more
openaire   +3 more sources

Approximate value iteration with randomized policies

Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187), 2002
The curse of dimensionality in dynamic programming prevents, in most problems of practical interest, the exact computation of the value function. We study the fixed points of approximate value iteration, a simple algorithm that combats the curse of dimensionality by generating approximate iterates of the classical value iteration algorithm in the span ...
D.P. de Farias, B. Van Roy
openaire   +1 more source

Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems

IEEE Transactions on Neural Networks and Learning Systems, 2017
Policy iteration approximate dynamic programming (DP) is an important algorithm for solving optimal decision and control problems. In this paper, we focus on the problem associated with policy approximation in policy iteration approximate DP for discrete-time nonlinear systems using infinite-horizon undiscounted value functions.
Wentao Guo   +3 more
openaire   +2 more sources

Semiglobal nonlinear stabilization via approximate policy iteration

Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148), 2001
We consider the problem of semiglobal nonlinear stabilization. Based on a given up table dynamic system and a region of acceptable operation within which the state is desired to be confined, we define an appropriate alternative dynamic system. We define an optimal control problem for the alternative (redefined) system which is amenable to solution via ...
C.I. Boussios   +2 more
openaire   +1 more source

Home - About - Disclaimer - Privacy