Multi-armed bandit - Open Access .click

Results 81 to 90 of about 5,586 (174)

Two-stage index computation for bandits with switching penalties II : switching delays [PDF]

This paper addresses the multi-armed bandit problem with switching penalties including both costs and delays, extending results of the companion paper [J. Niño-Mora.
Jose Nino-Mora
core

Optimizing Station-Based Shared E-Bike Allocation Using Multi-Armed Bandit Algorithms [PDF]

ITM Web of Conferences
In the era of rapid urban transformation, the sharing economy has revolutionized transportation solutions, with shared bicycles emerging as a prominent example.
Zhao Wenkui
doaj +1 more source

Several Remarks on the Role of Certain Positional and Social Games in the Creation of the Selected Statistical and Economic Applications

Foundations of Management, 2016
The game theory was created on the basis of social as well as gambling games, such as chess, poker, baccarat, hex, or one-armed bandit. The aforementioned games lay solid foundations for analogous mathematical models (e.g., hex), artificial intelligence ...
Drabik Ewa
doaj +1 more source

Bandit Algorithms for Advertising Optimization: A Comparative Study [PDF]

ITM Web of Conferences
In recent years, the rapid development of digital advertising has challenged advertisers to make optimal choices among multiple options quickly. This is crucial for increasing user engagement and return on investment.
Tian Ziyue
doaj +1 more source

Bridging Adversarial and Nonstationary Multi-armed Bandit

, 2023
In the multi-armed bandit framework, there are two formulations that are commonly employed to handle time-varying reward distributions: adversarial bandit and nonstationary bandit.
Yang, Shuoguang, Chen, Ningyuan, Zhang, Hailun +2 more
core

Sustainable Cooperative Coevolution with a Multi-Armed Bandit

, 2013
International audienceThis paper proposes a self-adaptation mechanism to manage the resources allocated to the different species comprising a cooperative coevolutionary algorithm.
Gagné, Christian +4 more
core +2 more sources

Spectrum Allocation and User Scheduling Based on Combinatorial Multi-Armed Bandit for 5G Massive MIMO. [PDF]

Sensors (Basel), 2023
Dou J, Liu X, Qie S, Li J, Wang C.
europepmc +1 more source

Annealing linear scalarized based multi-objective multi-armed bandit algorithm

, 2015
A stochastic multi-objective multi-armed bandit problem is a particular type of multi-objective (MO) optimization problems where the goal is to find and play fairly the optimal arms. To solve the multi-objective optimization problem, we propose annealing
Yahyaa, S.Q. +5 more
core +1 more source

Keeping Your Options Open [PDF]

In standard models of experimentation, the costs of project development consist of (i) the direct cost of running trials as well as (ii) the implicit opportunity cost of leaving alternative projects idle. Another natural type of experimentation cost, the
Jean Guillaume Forand
core

Satisficing in Multi-Armed Bandit Problems

, 2017
Satisficing is a relaxation of maximizing and allows for less risky decision making in the face of uncertainty. We propose two sets of satisficing objectives for the multi-armed bandit problem, where the objective is to achieve reward-based decision ...
Reverdy, P, Srivastava, V, Leonard, Naomi E +2 more
core +1 more source

information technology
reinforcement learning
medicine

online learning