Results 221 to 230 of about 640,433 (264)
Some of the next articles are maybe not open access.

Reinforced mixture learning

Neural Networks, 2023
In this article, we formulate the standard mixture learning problem as a Markov Decision Process (MDP). We theoretically show that the objective value of the MDP is equivalent to the log-likelihood of the observed data with a slightly different parameter space constrained by the policy.
Yuan Le, Fan Zhou, Yang Bai
openaire   +2 more sources

FUZZY REINFORCEMENT LEARNING

International Journal of Modern Physics C, 2002
Fuzzy logic represents an extension of classical logic, giving modes of approximate reasoning in an environment of uncertainty and imprecision. Fuzzy inference systems incorporates human knowledge into their knowledge base on the conclusions of the fuzzy rules, which are affected by subjective decisions.
Andrecut, M., Ali, M. K.
openaire   +2 more sources

Reinforcement Learning

In this chapter, reinforcement learning (RL), a subfield of machine learning that has gained prominence because it enables agents to interact with their surroundings and learn from their mistakes, is covered in great detail. The chapter looks at the core elements of RL, including agents, actions, states, and rewards, in addition to examining a number ...
Arti Saxena, Falak Bhardwaj
  +8 more sources

Robust Reinforcement Learning

Neural Computation, 2005
This letter proposes a new reinforcement learning (RL) paradigm that explicitly takes into account input disturbance as well as modeling errors. The use of environmental models in RL is quite popular for both off-line learning using simulations and for online action planning.
Jun, Morimoto, Kenji, Doya
openaire   +2 more sources

Reinforcement learning

Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization, 2016
Reinforcement Learning (RL) is one of the best machine learning approaches for decision making in interactive environments. RL focuses on inducing effective decision making policies with the goal of maximizing the agent's cumulative reward. In this study, we investigated the impact of both immediate and delayed reward functions on RL-induced policies ...
Shitian Shen, Min Chi
openaire   +2 more sources

Meta-learning in Reinforcement Learning

Neural Networks, 2003
Meta-parameters in reinforcement learning should be tuned to the environmental dynamics and the animal performance. Here, we propose a biologically plausible meta-reinforcement learning algorithm for tuning these meta-parameters in a dynamic, adaptive manner.
Nicolas, Schweighofer, Kenji, Doya
openaire   +2 more sources

Home - About - Disclaimer - Privacy