Results 61 to 70 of about 3,144 (205)

A multi-armed bandit approach for exploring partially observed networks

open access: yesApplied Network Science, 2019
Background real-world networks such as social and communication networks are too large to be observed entirely. Such networks are often partially observed such that network size, network topology, and nodes of the original network are unknown.
Kaushalya Madhawa, Tsuyoshi Murata
doaj   +1 more source

Neighbor Cell List Optimization in Handover Management Using Cascading Bandits Algorithm

open access: yesIEEE Access, 2020
Frequent handover is a key challenge in 5G Ultra-Dense Networks (UDN). In this paper, we show the significance of configuring Neighbor Cell List (NCL) in handover procedure.
Chao Wang   +5 more
doaj   +1 more source

Modified Index Policies for Multi-Armed Bandits with Network-like Markovian Dependencies

open access: yesNetwork
Sequential decision-making in dynamic and interconnected environments is a cornerstone of numerous applications, ranging from communication networks and finance to distributed blockchain systems and IoT frameworks. The multi-armed bandit (MAB) problem is
Abdalaziz Sawwan, Jie Wu
doaj   +1 more source

On Penalization in Stochastic Multi-Armed Bandits

open access: yesIEEE Transactions on Information Theory
We study an important variant of the stochastic multi-armed bandit (MAB) problem, which takes penalization into consideration. Instead of directly maximizing cumulative expected reward, we need to balance between the total reward and fairness level.
Guanhua Fang   +2 more
openaire   +2 more sources

Multi-armed Bandits with Missing Outcome

open access: yesCoRR
38 pages, 5 figures, multi-armed bandits, missing ...
Ilia Mahrooghi   +3 more
openaire   +3 more sources

Scaling Multi-Armed Bandit Algorithms

open access: yesProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019
The Multi-Armed Bandit (MAB) is a fundamental model capturing the dilemma between exploration and exploitation in sequential decision making. At every time step, the decision maker selects a set of arms and observes a reward from each of the chosen arms. In this paper, we present a variant of the problem, which we call the Scaling MAB (S-MAB): The goal
Fouché, E., Komiyama, J., Böhm, K.
openaire   +2 more sources

NeIL: Intelligent Replica Selection for Distributed Applications

open access: yesIEEE Transactions on Machine Learning in Communications and Networking
Distributed applications such as cloud gaming, streaming, etc., are increasingly using edge-to-cloud infrastructure for high availability and performance.
Faraz Ahmed   +3 more
doaj   +1 more source

Imprecise Multi-Armed Bandits

open access: yesCoRR
We introduce a novel multi-armed bandit framework, where each arm is associated with a fixed unknown credal set over the space of outcomes (which can be richer than just the reward). The arm-to-credal-set correspondence comes from a known class of hypotheses. We then define a notion of regret corresponding to the lower prevision defined by these credal
openaire   +2 more sources

Causally Abstracted Multi-armed Bandits

open access: yesCoRR
8 pages, 3 figures (main article); 20 pages, 10 figures (appendix); 40th Conference on Uncertainty in Artificial Intelligence (UAI)
Fabio Massimo Zennaro   +6 more
openaire   +5 more sources

Home - About - Disclaimer - Privacy