Results 91 to 100 of about 257 (142)

Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability

open access: yes, 2018
We present a two-armed bandit model of decision making under uncertainty where the expected return to investing in the “risky arm” increases when choosing that arm and decreases when choosing the “safe” arm.
Roland Fryer, Philipp Harms
core   +1 more source

Experimental evolutionary simulations of learning, memory and life history. [PDF]

open access: yesPhilos Trans R Soc Lond B Biol Sci, 2020
Morgan TJH, Suchow JW, Griffiths TL.
europepmc   +1 more source

Low-Complexity Algorithm for Restless Bandits with Imperfect Observations

open access: yes
We consider a class of restless bandit problems that finds a broad application area in reinforcement learning and stochastic optimization. We consider $N$ independent discrete-time Markov processes, each of which had two possible states: 1 and 0 (`good ...
Zhang, Chengzhong   +2 more
core  

A foundation model to predict and capture human cognition. [PDF]

open access: yesNature
Binz M   +39 more
europepmc   +1 more source

Anhedonic Traits Do Not Impair Performance in a 3-Arm Bandit Task. [PDF]

open access: yesComput Psychiatr
Ramaswamy A   +4 more
europepmc   +1 more source

Optimal Best Arm Identification with Fixed Confidence in Restless Bandits

open access: yes
We study best arm identification in a restless multi-armed bandit setting with finitely many arms. The discrete-time data generated by each arm forms a homogeneous Markov chain taking values in a common, finite state space.
Karthik, P. N.   +3 more
core  

Home - About - Disclaimer - Privacy