Multi-armed bandit - Open Access .click

Results 111 to 120 of about 5,586 (174)

LEO satellite constellations: a two-stage beamforming scheme based on CMAB

物联网学报
For the beamforming design problem of frequency-division duplex (FDD) massive multiple-input multiple-output (MIMO) low earth orbit (LEO) satellite constellations, a two-stage beamforming (TSB) scheme based on combinatorial multi-armed bandit (CMAB) was
SONG Xinting +4 more
doaj

Characterization and computation of restless bandit marginal productivity indices [PDF]

The Whittle index [P. Whittle (1988). Restless bandits: Activity allocation in a changing world. J. Appl. Probab. 25A, 287-298] yields a practical scheduling rule for the versatile yet intractable multi-armed restless bandit problem, involving the ...
Jose Nino-Mora
core

BanditMF: Multi-Armed Bandit Based Matrix Factorization Recommender System

, 2022
Multi-armed bandits (MAB) provide a principled online learning approach to attain the balance between exploration and exploitation. Due to the superior performance and low feedback learning without the learning to act in multiple situations, Multi-armed ...
Xu, Shenghao
core

Harnessing nonlinear optoelectronic oscillator for speeding up reinforcement learning

PhotoniX
Reinforcement learning is an indispensable branch of artificial intelligence (AI), referring to the technology and methods of maximizing the rewards from an uncertain environment.
Ziwei Xu +7 more
doaj +1 more source

STUDIES ON OPTIMAL STOPPING PROBLEMS FOR MULTI-ARMED BANDIT PROCESSES

1 Preface 2 The optimal stopping problem for multi-armed bandit processes 3 The optimal stopping problem for multi-armed diffusion bandit processes 4 The multi-armed bandit ...
ヨシダ, ユウジ, 吉田, 祐治, Yoshida, Yuji +2 more
core

Solving multi-armed bandit problems using a chaotic microresonator comb

APL Photonics
The Multi-Armed Bandit (MAB) problem, foundational to reinforcement learning-based decision-making, addresses the challenge of maximizing rewards amid multiple uncertain choices.
Jonathan Cuevas +4 more
doaj +1 more source

Multi–Armed Bandit Models for Efficient Long–Term Information Collection in Wireless Sensor Networks

We are entering a new age in the evolution of computer systems, in which pervasive computing technologies seamlessly interact with human users. These technologies serve people in their everyday lives at home and work by functioning invisibly in the ...
Tran-Thanh, Long
core

Gateway Selection in Millimeter Wave UAV Wireless Networks Using Multi-Player Multi-Armed Bandit. [PDF]

Sensors (Basel), 2020
Mohamed EM +4 more
europepmc +1 more source

Dynamic channel selection in wireless communications via a multi-armed bandit algorithm using laser chaos time series. [PDF]

Sci Rep, 2020
Takeuchi S +5 more
europepmc +1 more source

[News Clip: Blonde bandit]

, 1958
Video footage from the WBAP-TV station in Fort Worth, Texas to accompany a news story about Bonnie Moore, the DFW area's "Blonde Bandit", pleading guilty to charges of armed robbery and forgery in ...
WBAP-TV (Television station : Fort Worth, Tex.)
core

information technology
reinforcement learning
medicine

online learning