Decision-making without a brain: how an amoeboid organism solves the two-armed bandit. [PDF]
Reid CR +5 more
europepmc +1 more source
New Approach to Equitable Intervention Planning to Improve Engagement and Outcomes in a Digital Health Program: Simulation Study. [PDF]
Killian JA +5 more
europepmc +1 more source
Smoking and the bandit: a preliminary study of smoker and nonsmoker differences in exploratory behavior measured with a multiarmed bandit task. [PDF]
Addicott MA +4 more
europepmc +1 more source
Why copy others? Insights from the social learning strategies tournament. [PDF]
Rendell L +9 more
europepmc +1 more source
Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings. [PDF]
Payzan-LeNestour E, Bossaerts P.
europepmc +1 more source
Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes
We study the Whittle index learning algorithm for restless multi-armed bandits. We consider index learning algorithm with Q-learning. We first present Q-learning algorithm with exploration policies -- epsilon-greedy, softmax, epsilon-softmax with ...
Meshram, Rahul +2 more
core
Exploration Disrupts Choice-Predictive Signals and Alters Dynamics in Prefrontal Cortex. [PDF]
Ebitz RB, Albarran E, Moore T.
europepmc +1 more source
Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions. [PDF]
Wu Q, Han J, Yan Y, Kuo YH, Shen ZM.
europepmc +1 more source
The neural representation of unexpected uncertainty during value-based decision making. [PDF]
Payzan-LeNestour E +3 more
europepmc +1 more source
How copying affects the amount, evenness and persistence of cultural knowledge: insights from the social learning strategies tournament. [PDF]
Rendell L +5 more
europepmc +1 more source

