Results 51 to 60 of about 313 (110)
Starting with the Thomspon sampling algorithm, recent years have seen a resurgence of interest in Bayesian algorithms for the Multi-armed Bandit (MAB) problem.
Farias, Vivek F., Gutin, Eli
core
Optimal Learning with Non-Gaussian Rewards [PDF]
In this disseration, the author studies sequential Bayesian learning problems modeled under non-Gaussian distributions. We focus on a class of problems called the multi-armed bandit problem, and studies its optimal learning strategy, the Gittins index ...
Ding, Zi
core +1 more source
The Gittins index is optimal for dynamic allocation with conditionally independent filtrations
40 pages, no ...
openaire +2 more sources
Characterization and computation of restless bandit marginal productivity indices [PDF]
The Whittle index [P. Whittle (1988). Restless bandits: Activity allocation in a changing world. J. Appl. Probab. 25A, 287-298] yields a practical scheduling rule for the versatile yet intractable multi-armed restless bandit problem, involving the ...
Jose Nino-Mora
core
Index policies for the admission control and routing of impatient customers to heterogeneous service stations [PDF]
We propose a general Markovian model for the optimal control of admissions and subsequent routing of customers for service provided by a collection of heterogeneous stations. Queue-length information is available to inform all decisions.
Ouenniche, J +5 more
core +1 more source
A (2/3)n3Fast-Pivoting Algorithm for the Gittins Index and Optimal Stopping of a Markov Chain [PDF]
This paper presents a new fast-pivoting algorithm that computes the n Gittins index values of an n-state bandit—in the discounted and undiscounted cases—by performing (2/3)n3+ O(n2) arithmetic operations, thus attaining better complexity than previous algorithms and matching that of solving a corresponding linear-equation system by Gaussian elimination.
openaire +2 more sources
Gittins, Norman Harvey, Singapore
This record was harvested from a previous catalogue system and will be withdrawn in 2025. Information in this record may be superseded or incomplete.
Australian Red Cross Society, National Office
core
Marginal productivity index policies for dynamic priority allocation in restless bandit models [PDF]
Esta tesis estudia tres complejos problemas dinámicos y estocásticos de asignación de recursos: (i) Enrutamiento y control de admisión con información retrasada, (ii) Promoción dinámica de productos y el Problema de la mochila para artículos perecederos,
Jacko, Peter
core
Computing a Classic Index for Finite-Horizon Bandits [PDF]
This paper considers the efficient exact computation of the counterpart of the Gittins index for a finite-horizon discrete-state bandit, which measures for each initial state the average productivity, given by the maximum ratio of expected total ...
José Niño-Mora
core +1 more source
Gittins, Mr H, [No Service Number]
This record was harvested from a previous catalogue system and will be withdrawn in 2025. Information in this record may be superseded or incomplete.
Australian Red Cross Society, National Office
core

