Results 141 to 147 of about 199,848 (147)
Some of the next articles are maybe not open access.

Interleaved Q-Learning

2023
Jinna Li, Frank L. Lewis, Jialu Fan
openaire   +1 more source

Q-Learning

2011
openaire   +1 more source

Double Q-learning

2010
In some stochastic environments the well-known reinforcement learning algorithm Q-learning performs very poorly. This poor performance is caused by large overestimations of action values. These overestimations result from a positive bias that is introduced because Q-learning uses the maximum action value as an approximation for the maximum expected ...
openaire   +1 more source

Neural Q-Learning

2000
ten Hagen, S.H.G., Kröse, B.J.A.
openaire   +1 more source

Home - About - Disclaimer - Privacy