Regret - Open Access .click

Results 291 to 300 of about 577,383 (337)

Correction: Shape programming of liquid crystal elastomers by two-stage wavelength-selective photopolymerization.

Mater Horiz
Bruining T, Tomé DR, Liu D.
europepmc +1 more source

Some of the next articles are maybe not open access.

Related searches:

computer science
machine learning
mathematics

psychology
statistics
economics

mathematical optimization
artificial intelligence
biology

Optimistic posterior sampling for reinforcement learning: worst-case regret bounds

Neural Information Processing Systems, 2022
We present an algorithm based on posterior sampling (aka Thompson sampling) that achieves near-optimal worst-case regret bounds when the underlying Markov decision process (MDP) is communicating with a finite, although unknown, diameter.
Shipra Agrawal, Randy Jia
semanticscholar +1 more source

computer science
machine learning
mathematics

psychology
statistics
economics

mathematical optimization
artificial intelligence
biology

previous 28 29 30 31 32 next