Restless multi-armed bandits - Open Access .click

Results 31 to 40 of about 257 (142)

A sensing policy based on confidence bounds and a restless multi-armed bandit model [PDF]

2012 Conference Record of the Forty Sixth Asilomar Conference on Signals, Systems and Computers (ASILOMAR), 2012
In proceedings of the 46th Asilomar conference ...
Jan Oksanen, Visa Koivunen, H. Vincent Poor +2 more
openaire +3 more sources

The non-Bayesian restless multi-armed bandit: A case of near-logarithmic regret [PDF]

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011
In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are $N$ arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A player seeks to activate $K \geq 1$ arms at each time in order to maximize the expected total reward obtained over multiple plays. RMAB is a challenging problem that is known to
Wenhan Dai +3 more
openaire +2 more sources

On the Whittle Index for Restless Multiarmed Hidden Markov Bandits [PDF]

IEEE Transactions on Automatic Control, 2018
We consider a restless multi-armed bandit in which each arm can be in one of two states. When an arm is sampled, the state of the arm is not available to the sampler. Instead, a binary signal with a known randomness that depends on the state of the arm is available. No signal is available if the arm is not sampled.
Rahul Meshram, D. Manjunath, Aditya Gopalan +2 more
openaire +5 more sources

Dynamic resource allocation in a multi-product make-to-stock production system [PDF]

, 2011
We consider optimal policies for a production facility in which several (K) products are made to stock in order to satisfy exogenous demand for each. The single machine version of this problem in which the facility manufactures at most one product at a ...
K. D. Glazebrook +3 more
core +1 more source

Equitable Restless Multi-Armed Bandits: A General Framework Inspired By Digital Health

CoRR, 2023
Restless multi-armed bandits (RMABs) are a popular framework for algorithmic decision making in sequential settings with limited resources. RMABs are increasingly being used for sensitive decisions such as in public health, treatment scheduling, anti-poaching, and -- the motivation for this work -- digital health.
Jackson A. Killian +5 more
openaire +2 more sources

Fairness of Exposure in Online Restless Multi-armed Bandits

International Joint Conference on Autonomous Agents and Multiagent Systems
Restless multi-armed bandits (RMABs) generalize the multi-armed bandits where each arm exhibits Markovian behavior and transitions according to their transition dynamics. Solutions to RMAB exist for both offline and online cases. However, they do not consider the distribution of pulls among the arms.
Archit Sood, Shweta Jain 0002, Sujit Gujar +2 more
openaire +3 more sources

An Asymptotically Optimal Heuristic for General Non-Stationary Finite-Horizon Restless Multi-Armed Multi-Action Bandits [PDF]

, 2017
We propose an asymptotically optimal heuristic, which we termed the Randomized Assignment Control (RAC) for restless multi-armed bandit problems with discrete-time and fi nite states.
Zayas-Caban, Gabriel +5 more
core +1 more source

Design of Centralized Scheduling Policies for Minimizing Average Age of Incorrect Information in Multi-Hop Wireless Networks With Interference

IEEE Access
This paper designs and evaluates novel centralized sampling and scheduling policies for minimizing the average age of incorrect information (AoII) for a multi-hop network with interference constraints.
Nibin Raj, Vineeth Bala Sukumaran
doaj +1 more source

An index heuristic for transshipment decisions in multi-location inventory systems based on a pairwise decomposition [PDF]

, 2009
In multi-location inventory systems, transshipments are often used to improve customer service and reduce cost. Determining optimal transshipment policies for such systems involves a complex optimisation problem that is only tractable for systems with ...
Archibald, T.; id_orcid +5 more
core +1 more source

Optimistic Whittle Index Policy: Online Learning for Restless Bandits

, 2023
Restless multi-armed bandits (RMABs) extend multi-armed bandits to allow for stateful arms, where the state of each arm evolves restlessly with different transitions depending on whether that arm is pulled.
Taneja, Aparna, Tambe, Milind, Xu, Lily, Wang, Kai +3 more
core +1 more source

fos: computer and information sciences
machine learning cs.lg
computer science - machine learning

artificial intelligence cs.ai
computer science - artificial intelligence
statistics - machine learning

machine learning stat.ml
3. good health
systems and control eess.sy