Arm order recognition in multi-armed bandit problem with laser chaos time series. [PDF]
Narisawa N +3 more
europepmc +1 more source
Evaluation of performance: multi-armed bandit vs. contextual bandit [PDF]
Master of ScienceDepartment of Computer ScienceWilliam H. HsuThis work compares two methods, the multi-armed bandit (MAB) and contextual multi-armed bandit (CMAB), for action recommendation in a sequential decision making domain.
Chatterjee, Ranojoy
core
Adaptive Sequence-Based Stimulus Selection in an ERP-Based Brain-Computer Interface by Thompson Sampling in a Multi-Armed Bandit Problem. [PDF]
Ma T, Huggins JE, Kang J.
europepmc +1 more source
Enhanced Dynamic Spectrum Access in UAV Wireless Networks for Post-Disaster Area Surveillance System: A Multi-Player Multi-Armed Bandit Approach. [PDF]
Amrallah A +3 more
europepmc +1 more source
Data from: Risk-aware multi-armed bandit problem with application to portfolio selection
Sequential portfolio selection has attracted increasing interests in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem ...
Huo, Xiaoguang, Fu, Feng
core +1 more source
Wi-Fi Assisted Contextual Multi-Armed Bandit for Neighbor Discovery and Selection in Millimeter Wave Device to Device Communications. [PDF]
Hashima S +3 more
europepmc +1 more source
RESTLESS BANDIT MARGINAL PRODUCTIVITY INDICES II: MULTIPROJECT CASE AND SCHEDULING A MULTICLASS MAKE-TO-ORDER/-STOCK M/G/1 QUEUE [PDF]
This paper develops a framework based on convex optimization and economic ideas to formulate and solve approximately a rich class of dynamic and stochastic resource allocation problems, fitting in a generic discrete-state multi-project restless bandit ...
José Niño-Mora
core
Human behavior in contextual multi-armed bandit problems [PDF]
In real-life decision environments people learn from their di-rect experience with alternative courses of action. Yet they can accelerate their learning by using functional knowledge about the features characterizing the alternatives. We designed a novel
Stojic, Hrvoje +5 more
core
Multi-Armed Bandit Networks: Exploring Online Learning with Networks [PDF]
Classical Multi-Armed Bandit solutions often assumes independent arms as a simplification of the problem. This has shown great results in many different fields of practice, but could in some cases, presumably leave untapped potential.
Hansen, Viktor
core
Regret Lower Bounds in Multi-agent Multi-armed Bandit
Multi-armed Bandit motivates methods with provable upper bounds on regret and also the counterpart lower bounds have been extensively studied in this context.
Klabjan, Diego, Xu, Mengfan
core

