Results 301 to 310 of about 5,619,500 (318)
Some of the next articles are maybe not open access.

Kimi k1.5: Scaling Reinforcement Learning with LLMs

arXiv.org
Language model pretraining with next token prediction has proved effective for scaling compute but is limited to the amount of available training data.
Kimi Team   +93 more
semanticscholar   +1 more source

Reinforcement Learning

IFAC Proceedings Volumes, 1997
Reinforcement learning refers to ways of improving performance through trial-and-error experience. Despite recent progress in developing artificial learning systems, including new learning methods for artificial neural networks, most of these systems learn under the tutelage of a knowledgeable “teacher” able to tell them how to respond to a set of ...
openaire   +2 more sources

Reinforcement Learning

2008
Just as there are many different types of supervised and unsupervised learning, so there are many different types of reinforcement learning. Reinforcement learning is appropriate for an AI or agent which is actively exploring its environment and also actively exploring what actions are best to take in different situations.
Darryl Charles   +3 more
openaire   +1 more source

Reinforcement learning control

Current Opinion in Neurobiology, 1994
Reinforcement learning refers to improving performance through trial-and-error. Despite recent progress in developing artificial learning systems, including new learning methods for artificial neural networks, most of these systems learn under the tutelage of a knowledgeable 'teacher' able to tell them how to respond to a set of training stimuli ...
openaire   +3 more sources

Reinforcement and Learning in Schizophrenics

Psychological Reports, 1971
Three groups of 7 chronic schizophrenics were tested on a word-association task under conditions of verbal reinforcement, non-verbal reinforcement, or no reinforcement. Both verbal and non-verbal reinforcement were significantly more effective than no reinforcement in conditioning patients to the task but equally effective.
openaire   +3 more sources

Grandmaster level in StarCraft II using multi-agent reinforcement learning

Nature, 2019
O. Vinyals   +41 more
semanticscholar   +1 more source

Dense reinforcement learning for safety validation of autonomous vehicles

Nature, 2023
Shuo Feng   +6 more
semanticscholar   +1 more source

Human-level control through deep reinforcement learning

Nature, 2015
Volodymyr Mnih   +18 more
semanticscholar   +1 more source

Reinforcement Learning: An Introduction

IEEE Trans. Neural Networks, 1998
R. S. Sutton, A. Barto
semanticscholar   +1 more source

Distributed Reinforcement Learning

Robotics and Autonomous Systems, 1995
In multi-agent systems two forms of learning can be distinguished: centralized learning, that is, learning done by a single agent independent of the other agents; and distributed learning, that is, learning that becomes possible only because several agents are present. Whereas centralized learning has been intensively studied in the field of artificial
openaire   +2 more sources

Home - About - Disclaimer - Privacy