Results 141 to 150 of about 33,931 (272)

Combinatorial Multi-Armed Bandit with General Reward Functions [PDF]

open access: green, 2016
Wei Chen   +5 more
openalex   +1 more source

A quality assuring, cost optimal multi-armed bandit mechanism for expertsourcing

open access: green, 2017
Shweta Jain   +4 more
openalex   +2 more sources

Multi-Armed Bandit Problem with Temporally-Partitioned Rewards: When Partial Feedback Counts [PDF]

open access: bronze, 2022
Giulia Romano   +4 more
openalex   +1 more source

Learning Variable Ordering Heuristics with Multi-Armed Bandits and Restarts

open access: hybrid, 2020
Hugues Wattez   +4 more
openalex   +1 more source

Harnessing nonlinear optoelectronic oscillator for speeding up reinforcement learning

open access: yesPhotoniX
Reinforcement learning is an indispensable branch of artificial intelligence (AI), referring to the technology and methods of maximizing the rewards from an uncertain environment.
Ziwei Xu   +7 more
doaj   +1 more source

Home - About - Disclaimer - Privacy