Results 81 to 90 of about 199,848 (147)

Q-learning with temporal memory to navigate turbulence. [PDF]

open access: yesElife
Rando M   +4 more
europepmc   +1 more source

Scaling Up Q-Learning via Exploiting State-Action Equivalence. [PDF]

open access: yesEntropy (Basel), 2023
Lyu Y, Côme A, Zhang Y, Talebi MS.
europepmc   +1 more source

Home - About - Disclaimer - Privacy