Results 91 to 100 of about 5,586 (174)

Exploration versus Exploitation Using Kriging Surrogate Modelling in Electromagnetic Design [PDF]

open access: yes, 2011
This paper discusses the use of kriging surrogate modelling in multiobjective design optimisation in electromagnetics. The importance of achieving appropriate balance between exploration and exploitation is emphasised when searching for the global ...
Rotaru, M.   +3 more
core   +1 more source

DCOPS and bandits: Exploration and exploitation in decentralised coordination

open access: yes, 2012
Real life coordination problems are characterised by stochasticity and a lack of a priori knowledge about the interactions between agents. However, decentralised constraint optimisation problems (DCOPs), a widely adopted framework for modelling ...
Jennings, Nick   +5 more
core  

Restless bandit marginal productivity indices II: multiproject case and scheduling a multiclass make-to-order/-stock M/G/1 queue [PDF]

open access: yes, 2004
This paper develops a framework based on convex optimization and economic ideas to formulate and solve approximately a rich class of dynamic and stochastic resource allocation problems, fitting in a generic discrete-state multi-project restless bandit ...
Niño-Mora, José, Niño Mora, José
core  

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

open access: yes, 2012
Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that ...
N. Cesa-Bianchi, S. Bubeck
core   +1 more source

A Comparative Study of UCB and Thompson Sampling with Structured Rewards: Parameter Sensitivity and Robustness [PDF]

open access: yesITM Web of Conferences
The behavior of multi-armed bandit (MAB) algorithms is closely tied to how their hyperparameters are set, but their stability in structured reward environments has not been examined in depth.
Chen Yutong
doaj   +1 more source

Il problema del Multi-Armed Bandit applicato all'advertising online

open access: yes, 2016
La seguente tesi ha il compito di introdurre il lettore nel vasto campo dell'advertising online, andando a conoscere le varie tecniche utilizzate. Si passa da un'analisi generale del problema, e si passa ad analizzare e a descrivere alcuni casi specifici,
MONACO, GIACOMO
core  

Predicting Ecological Momentary Assessments in an App for Tinnitus by Learning From Each User's Stream With a Contextual Multi-Armed Bandit. [PDF]

open access: yesFront Neurosci, 2022
Shahania S   +7 more
europepmc   +1 more source

Optimal Job Design and Career Dynamics in the Presence of Uncertainty [PDF]

open access: yes
The paper studies a learning model in which information about a worker's ability can be acquired symmetrically by the worker and a firm in any period by observing the worker's performance on a given task.
Elena Pastorino
core  

Home - About - Disclaimer - Privacy