Results 1 to 10 of about 103 (96)
The problem of rested and restless multi-armed bandits with constrained availability (RMAB-CA) of arms is considered. The states of arms evolve in Markovian manner and the exact states are hidden from the decision maker. First, some structural results on
Varun Mehta +4 more
doaj +3 more sources
Restless Multi-Armed Bandits under Exogenous Global Markov Process
Accepted for presentation at IEEE ICASSP 2022.
Tomer Gafni +2 more
exaly +3 more sources
Constrained Restless Bandits for Dynamic Scheduling in Cyber-Physical Systems
This paper develops a sequential decision-making framework called constrained restless multi-armed bandits (CRMABs) to model problems of resource allocation under uncertainty and dynamic availability constraints.
Kesav Ram Kaza +3 more
doaj +3 more sources
IRL for Restless Multi-armed Bandits with Applications in Maternal and Child Health
Public health practitioners often have the goal of monitoring patients and maximizing patients' time spent in "favorable" or healthy states while being constrained to using limited resources. Restless multi-armed bandits (RMAB) are an effective model to solve this problem as they are helpful to allocate limited resources among many agents under ...
Pradeep Varakantham +2 more
exaly +4 more sources
Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making
This paper introduces a novel multi-armed bandits framework, termed Contextual Restless Bandits (CRB), for complex online decision-making. This CRB framework incorporates the core features of contextual bandits and restless bandits, so that it can model both the internal state transitions of each arm and the influence of external global environmental ...
I-Hong Hou
exaly +3 more sources
Networked Restless Multi-Armed Bandits for Mobile Interventions [PDF]
Motivated by a broad class of mobile intervention problems, we propose and study restless multi-armed bandits (RMABs) with network effects. In our model, arms are partially recharging and connected through a graph, so that pulling one arm also improves the state of neighboring arms, significantly extending the previously studied setting of fully ...
Han-Ching Ou +6 more
openaire +2 more sources
A Whittle Index Approach to Minimizing Age of Multi-Packet Information in IoT Network
Age of information (AoI) captures the freshness of information and has been used broadly as an important performance metric in big data analytics in the Internet of Things (IoT).
Mianlong Chen, Kui Wu, Linqi Song
doaj +1 more source
Avoiding Starvation of Arms in Restless Multi-Armed Bandits
Restless multi-armed bandits (RMAB) is a popular framework for optimizing performance with limited resources under uncertainty. It is an extremely useful model for monitoring beneficiaries (arms) and executing timely interventions using health workers (limited resources) to ensure optimal benefit in public health settings.
LI, Dexun, VARAKANTHAM, Pradeep
openaire +2 more sources
Towards Soft Fairness in Restless Multi-Armed Bandits
Restless multi-armed bandits (RMAB) is a framework for allocating limited resources under uncertainty. It is an extremely useful model for monitoring beneficiaries and executing timely interventions to ensure maximum benefit in public health settings (e.g., ensuring patients take medicines in tuberculosis settings, ensuring pregnant mothers listen to ...
Dexun Li, Pradeep Varakantham
openaire +2 more sources
Dynamic spectrum access using cognitive radio has many application areas like smart-grid, Internet of Things, and various other device-to-device communication paradigms.
Himanshu Agrawal, Krishna Asawa
doaj +1 more source

