Results 1 to 10 of about 5,586 (174)

Risk-aware multi-armed bandit problem with application to portfolio selection. [PDF]

open access: yesR Soc Open Sci, 2017
Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem ...
Huo X, Fu F.
europepmc   +4 more sources

Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers. [PDF]

open access: yesSci Rep, 2022
Decision making using photonic technologies has been intensively researched for solving the multi-armed bandit problem, which is fundamental to reinforcement learning.
Morijiri K   +4 more
europepmc   +2 more sources

Re-Learning EXP3 Multi-Armed Bandit Algorithm for Enhancing the Massive IoT-LoRaWAN Network Performance. [PDF]

open access: yesSensors (Basel), 2022
Long-Range Wide Area Network (LoRaWAN) is an open-source protocol for the standard Internet of Things (IoT) Low Power Wide Area Network (LPWAN). This work’s focal point is the LoRa Multi-Armed Bandit decentralized decision-making solution.
Almarzoqi SA, Yahya A, Matar Z, Gomaa I.
europepmc   +2 more sources

Some performance considerations when using multi-armed bandit algorithms in the presence of missing data. [PDF]

open access: yesPLoS One, 2022
When comparing the performance of multi-armed bandit algorithms, the potential impact of missing data is often overlooked. In practice, it also affects their implementation where the simplest approach to overcome this is to continue to sample according ...
Chen X, Lee KM, Villar SS, Robertson DS.
europepmc   +2 more sources

Multi-Armed Bandit-Based User Network Node Selection. [PDF]

open access: yesSensors (Basel)
In the scenario of an integrated space–air–ground emergency communication network, users encounter the challenge of rapidly identifying the optimal network node amidst the uncertainty and stochastic fluctuations of network states. This study introduces a
Gao Q, Xie Z.
europepmc   +2 more sources

Reinforcement learning framework for computerized adaptive testing using multi armed bandit approach. [PDF]

open access: yesSci Rep
This research introduces a new framework based on the combination of deep learning and reinforcement learning techniques for computer-based adaptive testing that overcomes the limitations of traditional methods for test preparation.
Tang B, Li S, Zhao C.
europepmc   +2 more sources

Design of Multi-Armed Bandit-Based Routing for in-Network Caching

open access: yesIEEE Access, 2023
This paper proposes a multi-armed bandit-based routing method for in-network cache-enabled networks. In-network caching enables intermediate routers to store contents in their cache, which is adopted in new networking paradigms such as Information ...
Gen Tabei   +3 more
doaj   +1 more source

Multi-objective contextual bandits in recommendation systems for smart tourism. [PDF]

open access: yesSci Rep
In the context of smart tourism, recommender systems play a pivotal role in enhancing the personalization and quality of travel experiences. Tourists often face challenges in decision-making due to information overload.
Qassimi S, Rakrak S.
europepmc   +2 more sources

Multi-Armed-Bandit Based Channel Selection Algorithm for Massive Heterogeneous Internet of Things Networks

open access: yesApplied Sciences, 2022
In recent times, the number of Internet of Things devices has increased considerably. Numerous Internet of Things devices generate enormous traffic, thereby causing network congestion and packet loss.
So Hasegawa   +6 more
doaj   +1 more source

Home - About - Disclaimer - Privacy