Results 21 to 30 of about 3,394 (246)

Tabular and Deep Learning of Whittle Index

open access: yes, 2022
International audience- Whittle index policy is an asymptotically optimal heuristic for solving Restless Multi-Armed Bandit Problems (RMBAP). - We propose two algorithms, QWI and QWINN, for the computation of such indices.
Ayesta, Urtzi   +4 more
core   +4 more sources

Optimistic Whittle Index Policy: Online Learning for Restless Bandits

open access: yesProceedings of the AAAI Conference on Artificial Intelligence, 2023
Restless multi-armed bandits (RMABs) extend multi-armed bandits to allow for stateful arms, where the state of each arm evolves restlessly with different transitions depending on whether that arm is pulled.
Taneja, Aparna   +3 more
core   +2 more sources

Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes

open access: yes2024 IEEE 8th International Conference on Information and Communication Technology (CICT)
We study the Whittle index learning algorithm for restless multi-armed bandits. We consider index learning algorithm with Q-learning. We first present Q-learning algorithm with exploration policies -- epsilon-greedy, softmax, epsilon-softmax with ...
Meshram, Rahul   +2 more
core   +2 more sources

Whittle indexability in egalitarian processor sharing systems [PDF]

open access: yesAnnals of Operations Research, 2017
27 pages, 6 ...
Vivek S. Borkar, Sarath Pattathil
openaire   +2 more sources

On the Global Optimality of Whittle’s Index Policy for Minimizing the Age of Information [PDF]

open access: yesIEEE Transactions on Information Theory, 2022
This paper examines the average age minimization problem where only a fraction of the network users can transmit simultaneously over unreliable channels. Finding the optimal scheduling scheme, in this case, is known to be challenging. Accordingly, the Whittle's index policy was proposed in the literature as a low-complexity heuristic to the problem ...
Saad Kriouile   +2 more
openaire   +2 more sources

Whittle Index Policy for Crawling Ephemeral Content [PDF]

open access: yes, 2015
We consider a task of scheduling a crawler to retrieve content from several siteswith ephemeral content. A user typically loses interest in ephemeral content,like news or posts at social network groups, after several days or hours.Thus, development of ...
Avrachenkov, Konstantin   +1 more
core   +3 more sources

NeurWIN: Neural Whittle Index Network for Restless Bandits via Deep RL [PDF]

open access: yes, 2021
Whittle index policy is a powerful tool to obtain asymptotically optimal solutions for the notoriously intractable problem of restless bandits. However, finding the Whittle indices remains a difficult problem for many practical restless bandits with ...
Nakhleh, Khaled Jamal Khader
core  

Rested and Restless Bandits With Constrained Arms and Hidden States: Applications in Social Networks and 5G Networks

open access: yesIEEE Access, 2018
The problem of rested and restless multi-armed bandits with constrained availability (RMAB-CA) of arms is considered. The states of arms evolve in Markovian manner and the exact states are hidden from the decision maker. First, some structural results on
Varun Mehta   +4 more
doaj   +1 more source

User Association in Dense Millimeter Wave Networks With Multi-Channel Access Points Using the Whittle Index

open access: yesIEEE Open Journal of the Communications Society
In dense millimeter wave (mmWave) networks, user association, i.e., the task of selecting the access point (AP) that each arriving user should join, significantly impacts the network performance.
Ravindra S. Tomar   +2 more
doaj   +1 more source

User Association in the Presence of Jamming in Wireless Networks Using the Whittle Index

open access: yesIEEE Open Journal of Vehicular Technology
In wireless networks, algorithms for user association, i.e., the task of choosing the base station (BS) that every arriving user should join, significantly impact the network performance. A wireless network with multiple BSs, operating on non-overlapping
Pramod N. Chine   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy