Results 131 to 140 of about 257 (142)
Some of the next articles are maybe not open access.

Whittle index based Q-learning for restless bandits with average reward

Automatica, 2022
Konstantin Avrachenkov, Vivek S Borkar
exaly  

Approximation algorithms for restless bandit problems

Journal of the ACM, 2010
Sudipto Guha, Kamesh Munagala
exaly  

Introduction to Multi-Armed Bandits

Foundations and Trends in Machine Learning, 2019
Aleksandrs Slivkins
exaly  

Addressing Coupling in Restless Multi-Armed Bandits by Finetuning Whittle Index

2025 IEEE 21st International Conference on Automation Science and Engineering (CASE)
Yao Luan 0001, Ni Mu, Qing-Shan Jia
openaire   +1 more source

Continuous Multi-Armed Bandits and Multiparameter Processes

Annals of Probability, 1987
Avi Mandelbaum
exaly  

Multi-armed bandits with episode context

Annals of Mathematics and Artificial Intelligence, 2011
exaly  

Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem.

CoRR
Nima Akbarzadeh   +2 more
openaire   +1 more source

Home - About - Disclaimer - Privacy