Combinatorial Multi-Armed Bandit with General Reward Functions [PDF]
Wei Chen +5 more
openalex +1 more source
A quality assuring, cost optimal multi-armed bandit mechanism for expertsourcing
Shweta Jain +4 more
openalex +2 more sources
Adapting to Delays and Data in Adversarial Multi-Armed Bandits [PDF]
András György, Pooria Joulani
openalex
Collaborative Learning with Limited Interaction: Tight Bounds for Distributed Exploration in Multi-Armed Bandits [PDF]
Tao Chao, Qin Zhang, Yuan Zhou
openalex +1 more source
Towards Soft Fairness in Restless Multi-Armed Bandits [PDF]
De-Xun Li, Pradeep Varakantham
openalex +1 more source
Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits [PDF]
Xi Liu +3 more
openalex +1 more source
Multi-Armed Bandit Problem with Temporally-Partitioned Rewards: When Partial Feedback Counts [PDF]
Giulia Romano +4 more
openalex +1 more source
Learning Variable Ordering Heuristics with Multi-Armed Bandits and Restarts
Hugues Wattez +4 more
openalex +1 more source
Harnessing nonlinear optoelectronic oscillator for speeding up reinforcement learning
Reinforcement learning is an indispensable branch of artificial intelligence (AI), referring to the technology and methods of maximizing the rewards from an uncertain environment.
Ziwei Xu +7 more
doaj +1 more source
Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit\n Settings [PDF]
Michael Köller +2 more
openalex +1 more source

