Results 181 to 190 of about 33,931 (272)
Pigeon and human performance in a multi-armed bandit task in response to changes in variable interval schedules [PDF]
Deborah E. Racey +4 more
openalex +1 more source
contextual: Evaluating Contextual Multi-Armed Bandit Problems in R [PDF]
Robin van Emden, Maurits Kaptein
openalex +1 more source
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond [PDF]
Xutong Liu +9 more
openalex +1 more source
Multi-Armed Bandits on Partially Revealed Unit Interval Graphs [PDF]
Xiao Xu +3 more
openalex +1 more source
Comparative Evaluation of Bandit-Style Heuristic Policies for Moving Target Detection in a Linear Grid Environment. [PDF]
Kang H, Ahn M, Seo Y.
europepmc +1 more source
Multi-Armed Bandits with Abstention
We introduce a novel extension of the canonical multi-armed bandit problem that incorporates an additional strategic element: abstention. In this enhanced framework, the agent is not only tasked with selecting an arm at each time step, but also has the option to abstain from accepting the stochastic instantaneous reward before observing it. When opting
Yang, Junwen +2 more
openaire +1 more source
Replicable Bandits for Digital Health Interventions. [PDF]
Zhang KW +3 more
europepmc +1 more source
Model-based exploration is measurable across tasks but not linked to personality and psychiatric assessments. [PDF]
Witte K, Thalmann M, Schulz E.
europepmc +1 more source

