Results 81 to 90 of about 5,586 (174)
Two-stage index computation for bandits with switching penalties II : switching delays [PDF]
This paper addresses the multi-armed bandit problem with switching penalties including both costs and delays, extending results of the companion paper [J. Niño-Mora.
Jose Nino-Mora
core
Optimizing Station-Based Shared E-Bike Allocation Using Multi-Armed Bandit Algorithms [PDF]
In the era of rapid urban transformation, the sharing economy has revolutionized transportation solutions, with shared bicycles emerging as a prominent example.
Zhao Wenkui
doaj +1 more source
The game theory was created on the basis of social as well as gambling games, such as chess, poker, baccarat, hex, or one-armed bandit. The aforementioned games lay solid foundations for analogous mathematical models (e.g., hex), artificial intelligence ...
Drabik Ewa
doaj +1 more source
Bandit Algorithms for Advertising Optimization: A Comparative Study [PDF]
In recent years, the rapid development of digital advertising has challenged advertisers to make optimal choices among multiple options quickly. This is crucial for increasing user engagement and return on investment.
Tian Ziyue
doaj +1 more source
Bridging Adversarial and Nonstationary Multi-armed Bandit
In the multi-armed bandit framework, there are two formulations that are commonly employed to handle time-varying reward distributions: adversarial bandit and nonstationary bandit.
Yang, Shuoguang +2 more
core
Sustainable Cooperative Coevolution with a Multi-Armed Bandit
International audienceThis paper proposes a self-adaptation mechanism to manage the resources allocated to the different species comprising a cooperative coevolutionary algorithm.
Gagné, Christian +4 more
core +2 more sources
Spectrum Allocation and User Scheduling Based on Combinatorial Multi-Armed Bandit for 5G Massive MIMO. [PDF]
Dou J, Liu X, Qie S, Li J, Wang C.
europepmc +1 more source
Annealing linear scalarized based multi-objective multi-armed bandit algorithm
A stochastic multi-objective multi-armed bandit problem is a particular type of multi-objective (MO) optimization problems where the goal is to find and play fairly the optimal arms. To solve the multi-objective optimization problem, we propose annealing
Yahyaa, S.Q. +5 more
core +1 more source
Keeping Your Options Open [PDF]
In standard models of experimentation, the costs of project development consist of (i) the direct cost of running trials as well as (ii) the implicit opportunity cost of leaving alternative projects idle. Another natural type of experimentation cost, the
Jean Guillaume Forand
core
Satisficing in Multi-Armed Bandit Problems
Satisficing is a relaxation of maximizing and allows for less risky decision making in the face of uncertainty. We propose two sets of satisficing objectives for the multi-armed bandit problem, where the objective is to achieve reward-based decision ...
Reverdy, P +2 more
core +1 more source

