Multi-armed bandit - Open Access .click

Results 91 to 100 of about 5,586 (174)

Exploration versus Exploitation Using Kriging Surrogate Modelling in Electromagnetic Design [PDF]

, 2011
This paper discusses the use of kriging surrogate modelling in multiobjective design optimisation in electromagnetics. The importance of achieving appropriate balance between exploration and exploitation is emphasised when searching for the global ...
Rotaru, M., Sykulski, J.K., Xiao, Song, Rotaru, M +3 more
core +1 more source

DCOPS and bandits: Exploration and exploitation in decentralised coordination

, 2012
Real life coordination problems are characterised by stochasticity and a lack of a priori knowledge about the interactions between agents. However, decentralised constraint optimisation problems (DCOPs), a widely adopted framework for modelling ...
Jennings, Nick +5 more
core

Restless bandit marginal productivity indices II: multiproject case and scheduling a multiclass make-to-order/-stock M/G/1 queue [PDF]

, 2004
This paper develops a framework based on convex optimization and economic ideas to formulate and solve approximately a rich class of dynamic and stochastic resource allocation problems, fitting in a generic discrete-state multi-project restless bandit ...
Niño-Mora, José, Niño Mora, José
core

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

, 2012
Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that ...
N. Cesa-Bianchi, S. Bubeck
core +1 more source

A Comparative Study of UCB and Thompson Sampling with Structured Rewards: Parameter Sensitivity and Robustness [PDF]

ITM Web of Conferences
The behavior of multi-armed bandit (MAB) algorithms is closely tied to how their hyperparameters are set, but their stability in structured reward environments has not been examined in depth.
Chen Yutong
doaj +1 more source

Adversarial Autoencoder and Multi-Armed Bandit for Dynamic Difficulty Adjustment in Immersive Virtual Reality for Rehabilitation: Application to Hand Movement. [PDF]

Sensors (Basel), 2022
Kamikokuryo K, Haga T, Venture G, Hernandez V. +3 more
europepmc +1 more source

Il problema del Multi-Armed Bandit applicato all'advertising online

, 2016
La seguente tesi ha il compito di introdurre il lettore nel vasto campo dell'advertising online, andando a conoscere le varie tecniche utilizzate. Si passa da un'analisi generale del problema, e si passa ad analizzare e a descrivere alcuni casi specifici,
MONACO, GIACOMO
core

Predicting Ecological Momentary Assessments in an App for Tinnitus by Learning From Each User's Stream With a Contextual Multi-Armed Bandit. [PDF]

Front Neurosci, 2022
Shahania S +7 more
europepmc +1 more source

Optimal Job Design and Career Dynamics in the Presence of Uncertainty [PDF]

The paper studies a learning model in which information about a worker's ability can be acquired symmetrically by the worker and a firm in any period by observing the worker's performance on a given task.
Elena Pastorino
core

Humans adaptively resolve the explore-exploit dilemma under cognitive constraints: Evidence from a multi-armed bandit task. [PDF]

Cognition, 2022
Brown VM +3 more
europepmc +1 more source

information technology
reinforcement learning
medicine

online learning