Restless multi-armed bandits - Open Access .click

Results 1 to 10 of about 103 (96)

Rested and Restless Bandits With Constrained Arms and Hidden States: Applications in Social Networks and 5G Networks

IEEE Access, 2018
The problem of rested and restless multi-armed bandits with constrained availability (RMAB-CA) of arms is considered. The states of arms evolve in Markovian manner and the exact states are hidden from the decision maker. First, some structural results on
Varun Mehta +4 more
doaj +3 more sources

Restless Multi-Armed Bandits under Exogenous Global Markov Process

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
Accepted for presentation at IEEE ICASSP 2022.
Tomer Gafni, Michal Yemini, Kobi Cohen +2 more
exaly +3 more sources

Constrained Restless Bandits for Dynamic Scheduling in Cyber-Physical Systems

IEEE Access
This paper develops a sequential decision-making framework called constrained restless multi-armed bandits (CRMABs) to model problems of resource allocation under uncertainty and dynamic availability constraints.
Kesav Ram Kaza +3 more
doaj +3 more sources

IRL for Restless Multi-armed Bandits with Applications in Maternal and Child Health

Lecture Notes in Computer Science
Public health practitioners often have the goal of monitoring patients and maximizing patients' time spent in "favorable" or healthy states while being constrained to using limited resources. Restless multi-armed bandits (RMAB) are an effective model to solve this problem as they are helpful to allocate limited resources among many agents under ...
Pradeep Varakantham, Aparna Taneja, Prashant Doshi +2 more
exaly +4 more sources

Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making

2024 IEEE 63rd Conference on Decision and Control (CDC)
This paper introduces a novel multi-armed bandits framework, termed Contextual Restless Bandits (CRB), for complex online decision-making. This CRB framework incorporates the core features of contextual bandits and restless bandits, so that it can model both the internal state transitions of each arm and the influence of external global environmental ...
I-Hong Hou
exaly +3 more sources

Networked Restless Multi-Armed Bandits for Mobile Interventions [PDF]

International Joint Conference on Autonomous Agents and Multiagent Systems, 2022
Motivated by a broad class of mobile intervention problems, we propose and study restless multi-armed bandits (RMABs) with network effects. In our model, arms are partially recharging and connected through a graph, so that pulling one arm also improves the state of neighboring arms, significantly extending the previously studied setting of fully ...
Han-Ching Ou +6 more
openaire +2 more sources

A Whittle Index Approach to Minimizing Age of Multi-Packet Information in IoT Network

IEEE Access, 2021
Age of information (AoI) captures the freshness of information and has been used broadly as an important performance metric in big data analytics in the Internet of Things (IoT).
Mianlong Chen, Kui Wu, Linqi Song
doaj +1 more source

Avoiding Starvation of Arms in Restless Multi-Armed Bandits

International Joint Conference on Autonomous Agents and Multiagent Systems, 2023
Restless multi-armed bandits (RMAB) is a popular framework for optimizing performance with limited resources under uncertainty. It is an extremely useful model for monitoring beneficiaries (arms) and executing timely interventions using health workers (limited resources) to ensure optimal benefit in public health settings.
LI, Dexun, VARAKANTHAM, Pradeep
openaire +2 more sources

Towards Soft Fairness in Restless Multi-Armed Bandits

CoRR, 2022
Restless multi-armed bandits (RMAB) is a framework for allocating limited resources under uncertainty. It is an extremely useful model for monitoring beneficiaries and executing timely interventions to ensure maximum benefit in public health settings (e.g., ensuring patients take medicines in tuberculosis settings, ensuring pregnant mothers listen to ...
Dexun Li, Pradeep Varakantham
openaire +2 more sources

Distributed learning algorithm with synchronized epochs for dynamic spectrum access in unknown environment using multi-user restless multi-armed bandit

Journal of King Saud University: Computer and Information Sciences, 2022
Dynamic spectrum access using cognitive radio has many application areas like smart-grid, Internet of Things, and various other device-to-device communication paradigms.
Himanshu Agrawal, Krishna Asawa
doaj +1 more source

fos: computer and information sciences
machine learning cs.lg
computer science - machine learning

artificial intelligence cs.ai
computer science - artificial intelligence
statistics - machine learning

machine learning stat.ml
3. good health
systems and control eess.sy