Multi-armed bandit - Open Access .click

Results 101 to 110 of about 5,586 (174)

Arm order recognition in multi-armed bandit problem with laser chaos time series. [PDF]

Sci Rep, 2021
Narisawa N, Chauvet N, Hasegawa M, Naruse M. +3 more
europepmc +1 more source

Evaluation of performance: multi-armed bandit vs. contextual bandit [PDF]

Master of ScienceDepartment of Computer ScienceWilliam H. HsuThis work compares two methods, the multi-armed bandit (MAB) and contextual multi-armed bandit (CMAB), for action recommendation in a sequential decision making domain.
Chatterjee, Ranojoy
core

Adaptive Sequence-Based Stimulus Selection in an ERP-Based Brain-Computer Interface by Thompson Sampling in a Multi-Armed Bandit Problem. [PDF]

Proceedings (IEEE Int Conf Bioinformatics Biomed), 2021
Ma T, Huggins JE, Kang J.
europepmc +1 more source

Enhanced Dynamic Spectrum Access in UAV Wireless Networks for Post-Disaster Area Surveillance System: A Multi-Player Multi-Armed Bandit Approach. [PDF]

Sensors (Basel), 2021
Amrallah A, Mohamed EM, Tran GK, Sakaguchi K. +3 more
europepmc +1 more source

Data from: Risk-aware multi-armed bandit problem with application to portfolio selection

, 2017
Sequential portfolio selection has attracted increasing interests in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem ...
Huo, Xiaoguang, Fu, Feng
core +1 more source

Wi-Fi Assisted Contextual Multi-Armed Bandit for Neighbor Discovery and Selection in Millimeter Wave Device to Device Communications. [PDF]

Sensors (Basel), 2021
Hashima S +3 more
europepmc +1 more source

RESTLESS BANDIT MARGINAL PRODUCTIVITY INDICES II: MULTIPROJECT CASE AND SCHEDULING A MULTICLASS MAKE-TO-ORDER/-STOCK M/G/1 QUEUE [PDF]

This paper develops a framework based on convex optimization and economic ideas to formulate and solve approximately a rich class of dynamic and stochastic resource allocation problems, fitting in a generic discrete-state multi-project restless bandit ...
José Niño-Mora
core

Human behavior in contextual multi-armed bandit problems [PDF]

, 2015
In real-life decision environments people learn from their di-rect experience with alternative courses of action. Yet they can accelerate their learning by using functional knowledge about the features characterizing the alternatives. We designed a novel
Stojic, Hrvoje +5 more
core

Multi-Armed Bandit Networks: Exploring Online Learning with Networks [PDF]

, 2018
Classical Multi-Armed Bandit solutions often assumes independent arms as a simpliﬁcation of the problem. This has shown great results in many different ﬁelds of practice, but could in some cases, presumably leave untapped potential.
Hansen, Viktor
core

Regret Lower Bounds in Multi-agent Multi-armed Bandit

, 2023
Multi-armed Bandit motivates methods with provable upper bounds on regret and also the counterpart lower bounds have been extensively studied in this context.
Klabjan, Diego, Xu, Mengfan
core

information technology
reinforcement learning
medicine

online learning