Results 11 to 20 of about 3,144 (205)
Multi-class boosting for the analysis of multiple incomplete views on microbiome data. [PDF]
Background Microbiome dysbiosis has recently been associated with different diseases and disorders. In this context, machine learning (ML) approaches can be useful either to identify new patterns or learn predictive models.
Simeon A +5 more
europepmc +2 more sources
Thompson Sampling for Non-Stationary Bandit Problems. [PDF]
Non-stationary multi-armed bandit (MAB) problems have recently attracted extensive attention. We focus on the abruptly changing scenario where reward distributions remain constant for a certain period and change at unknown time steps.
Qi H, Guo F, Zhu L.
europepmc +2 more sources
Mating with Multi-Armed Bandits: Reinforcement Learning Models of Human Mate Search. [PDF]
Conroy-Beam D.
europepmc +2 more sources
Ballooning Multi-Armed Bandits [PDF]
We introduce ballooning multi-armed bandits (BL-MAB), a novel extension to the classical stochastic MAB model. In the BL-MAB model, the set of available arms grows (or balloons) over time. The regret in a BL-MAB setting is computed with respect to the best available arm at each time.
Ganesh Ghalme +4 more
openaire +3 more sources
Multi-Armed Bandits With Correlated Arms [PDF]
We consider a multi-armed bandit framework where the rewards obtained by pulling different arms are correlated. We develop a unified approach to leverage these reward correlations and present fundamental generalizations of classic bandit algorithms to the correlated setting.
Samarth Gupta +3 more
openaire +2 more sources
Federated multi-armed bandits (FMAB) is a new bandit paradigm that parallels the federated learning (FL) framework in supervised learning. It is inspired by practical applications in cognitive radio and recommender systems, and enjoys features that are analogous to FL.
Chengshuai Shi, Cong Shen 0001
openaire +2 more sources
Modelling Cournot Games as Multi-agent Multi-armed Bandits
We investigate the use of a multi-agent multi-armed bandit (MA-MAB) setting for modeling repeated Cournot oligopoly games, where the firms acting as agents choose from the set of arms representing production quantity (a discrete value).
Kshitija Taywade +3 more
doaj +1 more source
The multi-armed bandit, with constraints [PDF]
The colorfully-named and much-studied multi-armed bandit is the following Markov decision problem: At epochs 1, 2, ... , a decision maker observes the current state of each of several Markov chains with rewards (bandits) and plays one of them. The Markov chains that are not played remain in their current states.
Eric V. Denardo +2 more
openaire +2 more sources
The Assistive Multi-Armed Bandit [PDF]
Learning preferences implicit in the choices humans make is a well studied problem in both economics and computer science. However, most work makes the assumption that humans are acting (noisily) optimally with respect to their preferences. Such approaches can fail when people are themselves learning about what they want. In this work, we introduce the
Lawrence Chan +3 more
openaire +2 more sources
Multi-Armed Bandits for Spectrum Allocation in Multi-Agent Channel Bonding WLANs
While dynamic channel bonding (DCB) is proven to boost the capacity of wireless local area networks (WLANs) by adapting the bandwidth on a per-frame basis, its performance is tied to the primary and secondary channel selection.
Sergio Barrachina-Munoz +2 more
doaj +1 more source

