Results 301 to 310 of about 1,717,828 (345)
Some of the next articles are maybe not open access.

Food reinforcement

Appetite, 2006
The reinforcing value of food, measured by how hard someone is willing to work to obtain food, is influenced by food palatability, food deprivation and food variety, and may be a more powerful determinant of food intake than hedonics or liking. The reinforcing value of food is mediated in part by dopaminergic activity. Genotypes that influence dopamine
Leonard H, Epstein, John J, Leddy
openaire   +2 more sources

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

Nature
General reasoning represents a long-standing and formidable challenge in artificial intelligence (AI). Recent breakthroughs, exemplified by large language models (LLMs)1,2 and chain-of-thought (CoT) prompting3, have achieved considerable success on ...
DeepSeek-AI   +197 more
semanticscholar   +1 more source

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

arXiv.org
Inference scaling empowers LLMs with unprecedented reasoning ability, with reinforcement learning as the core technique to elicit complex reasoning. However, key technical details of state-of-the-art reasoning LLMs are concealed (such as in OpenAI o1 ...
Qiying Yu   +34 more
semanticscholar   +1 more source

Kimi k1.5: Scaling Reinforcement Learning with LLMs

arXiv.org
Language model pretraining with next token prediction has proved effective for scaling compute but is limited to the amount of available training data.
Kimi Team   +93 more
semanticscholar   +1 more source

Intracranial Reinforcement Compared with Sugar-Water Reinforcement

Science, 1965
Three ways in which electrical, intracranial reinforcement is reputed to differ from conventional reinforcement were tested in an experiment which equated the form of the responses being reinforced and the response-reinforcement relation. Four groups of rats performed instrumental or consummatory responses reinforced by intracranial reinforcement or ...
W E, GIBSON   +3 more
openaire   +2 more sources

Gymnasium: A Standard Interface for Reinforcement Learning Environments

arXiv.org
Reinforcement Learning (RL) is a continuously growing field that has the potential to revolutionize many areas of artificial intelligence. However, despite its promise, RL research is often hindered by the lack of standardization in environment and ...
Mark Towers   +15 more
semanticscholar   +1 more source

Reinforcer probability, reinforcer magnitude, and the reinforcement context for remembering.

Journal of Experimental Psychology: Animal Behavior Processes, 2009
Traditional theories of delayed matching-to-sample performance do not predict that accuracy will improve when absolute levels of reinforcement are increased. This prediction emerges only when reinforcement context is considered (J. A. Nevin, M. Davison, A. L. Odum, & T. A. Shahan, 2007). To provide quantitative data, the authors factorially manipulated
Glenn S, Brown, K Geoffrey, White
openaire   +2 more sources

Reinforcement Comparison

1991
Sutton [in his PhD thesis] introduced a reinforcement comparison term into the equations governing certain stochastic learning automata, arguing that it should speed up learning, particularly for unbalanced reinforcement tasks. Williams's subsequent extensions [REINFORCE] to the class of algorithms demonstrated that they were all performing approximate
openaire   +2 more sources

Grandmaster level in StarCraft II using multi-agent reinforcement learning

Nature, 2019
O. Vinyals   +41 more
semanticscholar   +1 more source

Human-level control through deep reinforcement learning

Nature, 2015
Volodymyr Mnih   +18 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy