Q-learning - Open Access .click

Results 141 to 147 of about 199,848 (147)

Some of the next articles are maybe not open access.

Interleaved Q-Learning

2023
Jinna Li, Frank L. Lewis, Jialu Fan
openaire +1 more source

Q-Learning

2011
openaire +1 more source

Deep Q-Learning

2023
openaire +1 more source

Q-Learning

2012
openaire +1 more source

Double Q-learning

2010
In some stochastic environments the well-known reinforcement learning algorithm Q-learning performs very poorly. This poor performance is caused by large overestimations of action values. These overestimations result from a positive bias that is introduced because Q-learning uses the maximum action value as an approximation for the maximum expected ...
openaire +1 more source

Greedy Q-Learning

2012
openaire +1 more source

Neural Q-Learning

2000
ten Hagen, S.H.G., Kröse, B.J.A.
openaire +1 more source

reinforcement learning
fos: computer and information sciences
machine learning cs.lg

computer science - machine learning
artificial intelligence cs.ai
computer science - artificial intelligence

deep reinforcement learning
machine learning stat.ml
statistics - machine learning