Exploration versus Exploitation Using Kriging Surrogate Modelling in Electromagnetic Design [PDF]
This paper discusses the use of kriging surrogate modelling in multiobjective design optimisation in electromagnetics. The importance of achieving appropriate balance between exploration and exploitation is emphasised when searching for the global ...
Rotaru, M. +3 more
core +1 more source
DCOPS and bandits: Exploration and exploitation in decentralised coordination
Real life coordination problems are characterised by stochasticity and a lack of a priori knowledge about the interactions between agents. However, decentralised constraint optimisation problems (DCOPs), a widely adopted framework for modelling ...
Jennings, Nick +5 more
core
Restless bandit marginal productivity indices II: multiproject case and scheduling a multiclass make-to-order/-stock M/G/1 queue [PDF]
This paper develops a framework based on convex optimization and economic ideas to formulate and solve approximately a rich class of dynamic and stochastic resource allocation problems, fitting in a generic discrete-state multi-project restless bandit ...
Niño-Mora, José, Niño Mora, José
core
Regret analysis of stochastic and nonstochastic multi-armed bandit problems
Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that ...
N. Cesa-Bianchi, S. Bubeck
core +1 more source
A Comparative Study of UCB and Thompson Sampling with Structured Rewards: Parameter Sensitivity and Robustness [PDF]
The behavior of multi-armed bandit (MAB) algorithms is closely tied to how their hyperparameters are set, but their stability in structured reward environments has not been examined in depth.
Chen Yutong
doaj +1 more source
Adversarial Autoencoder and Multi-Armed Bandit for Dynamic Difficulty Adjustment in Immersive Virtual Reality for Rehabilitation: Application to Hand Movement. [PDF]
Kamikokuryo K +3 more
europepmc +1 more source
Il problema del Multi-Armed Bandit applicato all'advertising online
La seguente tesi ha il compito di introdurre il lettore nel vasto campo dell'advertising online, andando a conoscere le varie tecniche utilizzate. Si passa da un'analisi generale del problema, e si passa ad analizzare e a descrivere alcuni casi specifici,
MONACO, GIACOMO
core
Predicting Ecological Momentary Assessments in an App for Tinnitus by Learning From Each User's Stream With a Contextual Multi-Armed Bandit. [PDF]
Shahania S +7 more
europepmc +1 more source
Optimal Job Design and Career Dynamics in the Presence of Uncertainty [PDF]
The paper studies a learning model in which information about a worker's ability can be acquired symmetrically by the worker and a firm in any period by observing the worker's performance on a given task.
Elena Pastorino
core
Humans adaptively resolve the explore-exploit dilemma under cognitive constraints: Evidence from a multi-armed bandit task. [PDF]
Brown VM +3 more
europepmc +1 more source

