Results 61 to 70 of about 577,383 (337)
No-Regret Learning in Extensive-Form Games with Imperfect Recall [PDF]
Counterfactual Regret Minimization (CFR) is an efficient no-regret learning algorithm for decision problems modeled as extensive games. CFR's regret bounds depend on the requirement of perfect recall: players always remember information that was revealed
Marc Lanctot +5 more
core
Electroactive Liquid Crystal Elastomers as Soft Actuators
Electroactive liquid crystal elastomers (eLCEs) can be actuated via electromechanical, electrochemical, or electrothermal effects. a) Electromechanical effects include Maxwell stress, electrostriction, and the electroclinic effect. b) Electrochemical effects arise from electrode redox reactions.
Yakui Deng, Min‐Hui Li
wiley +1 more source
The study presents an antibiotic‐free strategy using medical fabrics coated with supramolecular assemblies of polyarginine and hyaluronic acid. These coatings showed strong antimicrobial and anti‐biofilm activity in vitro and in vivo, achieving major bacterial load reductions, including against MRSA.
Adjara Diarrassouba +18 more
wiley +1 more source
Collaborative Learning of Stochastic Bandits over a Social Network
We consider a collaborative online learning paradigm, wherein a group of agents connected through a social network are engaged in playing a stochastic multi-armed bandit game.
Gopalan, Aditya +2 more
core +1 more source
Regret in Dynamic Decision Problems [PDF]
The paper proposes a framework to extend regret theory to dynamic contexts. The key idea is to conceive of a dynamic decision problem with regret as an intra-personal game in which the agent forms conjectures about the behaviour of the various ...
Krähmer, Daniel, Stone, Rebecca
core +2 more sources
A single object with dual properties – degradable and non‐degradable – is fabricated in a single print simply by switching the printing colors. The advanced multi‐material printing is enabled by the combination of a fully wavelength‐orthogonal photoresin and a monochromatic tunable laser printer, paving the way for precise multi‐material ...
Xingyu Wu +5 more
wiley +1 more source
The Impatient May Use Limited Optimism to Minimize Regret
Discounted-sum games provide a formal model for the study of reinforcement learning, where the agent is enticed to get rewards early since later rewards are discounted.
B Aminof +15 more
core +1 more source
Detecting proteins secreted by a single cell while retaining its viability remains challenging. A particles‐in‐particle (PiPs) system made by co‐encapsulating barcoded microparticles (BMPs) with a single cell inside an alginate hydrogel particle is introduced.
Félix Lussier +10 more
wiley +1 more source
Action orientation, consistency and feelings of regret [PDF]
Previous research has demonstrated that consistency between people's behavior and their dispositions has predictive validity for judgments of regret. Research has also shown that differences in the personality variable of action orientation can influence
Todd McElroy, Keith Dowd
doaj
Pure Exploration for Multi-Armed Bandit Problems [PDF]
We consider the framework of stochastic multi-armed bandit problems and study the possibilities and limitations of forecasters that perform an on-line exploration of the arms.
Bubeck, Sébastien +2 more
core +4 more sources

