Results 91 to 100 of about 198,528 (317)

Model-Based Reinforcement Learning with Continuous States and Actions [PDF]

open access: yes, 2008
22.10.13 KB. Ok to add the published version to spiral. ESANNFinding an optimal policy in a reinforcement learning (RL) framework with continuous state and action spaces is challenging. Approximate solutions are often inevitable.
Rasmussen, Carl E   +7 more
core  

Zein‐Based Adhesives: Sustainable Extraction and Application in Bioadhesive Technologies

open access: yesAdvanced Engineering Materials, EarlyView.
Zein is extracted from corn gluten meal using a simple and scalable process with high yield (~90%). The resulting protein is applied in bioadhesives modified with Ca2+ and Fe3+ ions, exhibiting substrate‐dependent adhesion. The findings demonstrate competitive bonding performance and highlight the role of ionic interactions in tuning adhesion ...
Paula Bertolino Sanvezzo   +3 more
wiley   +1 more source

Deep Forest Reinforcement Learning for Preventive Strategy Considering Automatic Generation Control in Large-Scale Interconnected Power Systems

open access: yesApplied Sciences, 2018
To reduce occurrences of emergency situations in large-scale interconnected power systems with large continuous disturbances, a preventive strategy for the automatic generation control (AGC) of power systems is proposed.
Linfei Yin   +3 more
doaj   +1 more source

Probability Matching and Reinforcement Learning* [PDF]

open access: yes
Probability matching occurs when an action is chosen with a frequency equivalent to the probability of that action being the best choice. This sub-optimal behavior has been reported repeatedly by psychologist and experimental economist.
Javier Rivas
core  

Additive Gaussian Process Regression for Predictive Design of High‐Performance, Printable Silicones

open access: yesAdvanced Engineering Materials, EarlyView.
A chemistry‐aware design framework for tuning printable polydimethylsiloxane (PDMS) for vat photopolymerization (VPP) is developed using additive Gaussian process (GP) modeling. Polymer network mechanics informs variable groupings, feasible formulation constraints, and interaction variables.
Roxana Carbonell   +3 more
wiley   +1 more source

Decentralized Bayesian reinforcement learning for online agent collaboration [PDF]

open access: yes, 2012
Solving complex but structured problems in a decentralized manner via multiagent collaboration has received much attention in recent years. This is natural, as on one hand, multiagent systems usually possess a structure that determines the allowable ...
Farinelli, A.   +15 more
core  

Thermodynamic Pathways of Nonequilibrium Solidification in Wire‐Arc Additive Manufacturing Fe‐Based Multicomponent Alloy Structures

open access: yesAdvanced Engineering Materials, EarlyView.
Geometry‐driven thermal behavior in wire‐arc additive manufacturing (WAAM) influences microstructural evolution during nonequilibrium solidification of a chemically complex Fe–Cr–Nb–W–Mo–C nanocomposite system. By comparing different deposits configurations, distinct entropy–cooling rate correlations, segregation, and carbide evolution are revealed ...
Blanca Palacios   +5 more
wiley   +1 more source

A Simplified Laminar Flow Model for the Pultrusion of Glass Fiber/Polyethylene Terephthalate Commingled Yarns

open access: yesAdvanced Engineering Materials, EarlyView.
A simplified thermoplastic pultrusion model is developed to predict thermal fields in glass fiber/polyethylene terephthalate (GF/PET) composites with reduced computational cost. By combining effective material homogenization, validation against literature data, and Gaussian‐process‐based optimization, the study reveals how heating limits, pulling speed,
Elder Soares   +3 more
wiley   +1 more source

Multiagent Deep Reinforcement Learning Algorithms in StarCraft II: A Review

open access: yesIEEE Access
StarCraft II, as a real-time strategy game, features multiagent collaboration, complex decision-making processes, partially observable environments, and long-term credit assignment; thus, it is an ideal platform for exploring, validating, and optimizing ...
Yanyan Li, Yijun Wang, Yiwei Zhou
doaj   +1 more source

Using reinforcement learning to coordinate better

open access: yes, 2005
This paper examines the potential and the impact of introducing learning capabilities into autonomous agents that make decisions at run-time about which mechanism to exploit in order to coordinate their activities. Specifically, our motivating hypothesis
Jennings, N. R.   +5 more
core   +1 more source

Home - About - Disclaimer - Privacy