Multi-armed bandit - Open Access .click

Results 181 to 190 of about 33,931 (272)

Pigeon and human performance in a multi-armed bandit task in response to changes in variable interval schedules [PDF]

, 2011
Deborah E. Racey +4 more
openalex +1 more source

contextual: Evaluating Contextual Multi-Armed Bandit Problems in R [PDF]

, 2018
Robin van Emden, Maurits Kaptein
openalex +1 more source

Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond [PDF]

Xutong Liu +9 more
openalex +1 more source

Multi-Armed Bandits on Partially Revealed Unit Interval Graphs [PDF]

, 2018
Xiao Xu +3 more
openalex +1 more source

Comparative Evaluation of Bandit-Style Heuristic Policies for Moving Target Detection in a Linear Grid Environment. [PDF]

Sensors (Basel)
Kang H, Ahn M, Seo Y.
europepmc +1 more source

Comparative Evaluation of Asymptotically Optimal and Standard Upper Confidence Bound Algorithms in Multi-Armed Bandit Scenarios

Qiuyuan Lyu
openalex +2 more sources

Multi-Armed Bandits with Abstention

We introduce a novel extension of the canonical multi-armed bandit problem that incorporates an additional strategic element: abstention. In this enhanced framework, the agent is not only tasked with selecting an arm at each time step, but also has the option to abstain from accepting the stochastic instantaneous reward before observing it. When opting
Yang, Junwen, Jin, Tianyuan, Tan, Vincent Y. F. +2 more
openaire +1 more source

Replicable Bandits for Digital Health Interventions. [PDF]

Stat Sci
Zhang KW, Closser N, Trella AL, Murphy SA. +3 more
europepmc +1 more source

Model-based exploration is measurable across tasks but not linked to personality and psychiatric assessments. [PDF]

Sci Rep
Witte K, Thalmann M, Schulz E.
europepmc +1 more source

Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective

Xiaosong Chen +4 more
openalex +2 more sources

computer science
mathematics
artificial intelligence

mathematical optimization
machine learning
regret

engineering
mathematical analysis
information technology