LEO satellite constellations: a two-stage beamforming scheme based on CMAB
For the beamforming design problem of frequency-division duplex (FDD) massive multiple-input multiple-output (MIMO) low earth orbit (LEO) satellite constellations, a two-stage beamforming (TSB) scheme based on combinatorial multi-armed bandit (CMAB) was
SONG Xinting +4 more
doaj
Characterization and computation of restless bandit marginal productivity indices [PDF]
The Whittle index [P. Whittle (1988). Restless bandits: Activity allocation in a changing world. J. Appl. Probab. 25A, 287-298] yields a practical scheduling rule for the versatile yet intractable multi-armed restless bandit problem, involving the ...
Jose Nino-Mora
core
BanditMF: Multi-Armed Bandit Based Matrix Factorization Recommender System
Multi-armed bandits (MAB) provide a principled online learning approach to attain the balance between exploration and exploitation. Due to the superior performance and low feedback learning without the learning to act in multiple situations, Multi-armed ...
Xu, Shenghao
core
Harnessing nonlinear optoelectronic oscillator for speeding up reinforcement learning
Reinforcement learning is an indispensable branch of artificial intelligence (AI), referring to the technology and methods of maximizing the rewards from an uncertain environment.
Ziwei Xu +7 more
doaj +1 more source
STUDIES ON OPTIMAL STOPPING PROBLEMS FOR MULTI-ARMED BANDIT PROCESSES
1 Preface 2 The optimal stopping problem for multi-armed bandit processes 3 The optimal stopping problem for multi-armed diffusion bandit processes 4 The multi-armed bandit ...
ヨシダ, ユウジ +2 more
core
Solving multi-armed bandit problems using a chaotic microresonator comb
The Multi-Armed Bandit (MAB) problem, foundational to reinforcement learning-based decision-making, addresses the challenge of maximizing rewards amid multiple uncertain choices.
Jonathan Cuevas +4 more
doaj +1 more source
Multi–Armed Bandit Models for Efficient Long–Term Information Collection in Wireless Sensor Networks
We are entering a new age in the evolution of computer systems, in which pervasive computing technologies seamlessly interact with human users. These technologies serve people in their everyday lives at home and work by functioning invisibly in the ...
Tran-Thanh, Long
core
Gateway Selection in Millimeter Wave UAV Wireless Networks Using Multi-Player Multi-Armed Bandit. [PDF]
Mohamed EM +4 more
europepmc +1 more source
Dynamic channel selection in wireless communications via a multi-armed bandit algorithm using laser chaos time series. [PDF]
Takeuchi S +5 more
europepmc +1 more source
Video footage from the WBAP-TV station in Fort Worth, Texas to accompany a news story about Bonnie Moore, the DFW area's "Blonde Bandit", pleading guilty to charges of armed robbery and forgery in ...
WBAP-TV (Television station : Fort Worth, Tex.)
core

