Results 21 to 30 of about 3,394 (246)
Tabular and Deep Learning of Whittle Index
International audience- Whittle index policy is an asymptotically optimal heuristic for solving Restless Multi-Armed Bandit Problems (RMBAP). - We propose two algorithms, QWI and QWINN, for the computation of such indices.
Ayesta, Urtzi +4 more
core +4 more sources
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Restless multi-armed bandits (RMABs) extend multi-armed bandits to allow for stateful arms, where the state of each arm evolves restlessly with different transitions depending on whether that arm is pulled.
Taneja, Aparna +3 more
core +2 more sources
Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes
We study the Whittle index learning algorithm for restless multi-armed bandits. We consider index learning algorithm with Q-learning. We first present Q-learning algorithm with exploration policies -- epsilon-greedy, softmax, epsilon-softmax with ...
Meshram, Rahul +2 more
core +2 more sources
Whittle indexability in egalitarian processor sharing systems [PDF]
27 pages, 6 ...
Vivek S. Borkar, Sarath Pattathil
openaire +2 more sources
On the Global Optimality of Whittle’s Index Policy for Minimizing the Age of Information [PDF]
This paper examines the average age minimization problem where only a fraction of the network users can transmit simultaneously over unreliable channels. Finding the optimal scheduling scheme, in this case, is known to be challenging. Accordingly, the Whittle's index policy was proposed in the literature as a low-complexity heuristic to the problem ...
Saad Kriouile +2 more
openaire +2 more sources
Whittle Index Policy for Crawling Ephemeral Content [PDF]
We consider a task of scheduling a crawler to retrieve content from several siteswith ephemeral content. A user typically loses interest in ephemeral content,like news or posts at social network groups, after several days or hours.Thus, development of ...
Avrachenkov, Konstantin +1 more
core +3 more sources
NeurWIN: Neural Whittle Index Network for Restless Bandits via Deep RL [PDF]
Whittle index policy is a powerful tool to obtain asymptotically optimal solutions for the notoriously intractable problem of restless bandits. However, finding the Whittle indices remains a difficult problem for many practical restless bandits with ...
Nakhleh, Khaled Jamal Khader
core
The problem of rested and restless multi-armed bandits with constrained availability (RMAB-CA) of arms is considered. The states of arms evolve in Markovian manner and the exact states are hidden from the decision maker. First, some structural results on
Varun Mehta +4 more
doaj +1 more source
In dense millimeter wave (mmWave) networks, user association, i.e., the task of selecting the access point (AP) that each arriving user should join, significantly impacts the network performance.
Ravindra S. Tomar +2 more
doaj +1 more source
User Association in the Presence of Jamming in Wireless Networks Using the Whittle Index
In wireless networks, algorithms for user association, i.e., the task of choosing the base station (BS) that every arriving user should join, significantly impact the network performance. A wireless network with multiple BSs, operating on non-overlapping
Pramod N. Chine +3 more
doaj +1 more source

