Q-learning - Open Access .click

Results 91 to 100 of about 212,484 (267)

A Simplified Laminar Flow Model for the Pultrusion of Glass Fiber/Polyethylene Terephthalate Commingled Yarns

Advanced Engineering Materials, EarlyView.
A simplified thermoplastic pultrusion model is developed to predict thermal fields in glass fiber/polyethylene terephthalate (GF/PET) composites with reduced computational cost. By combining effective material homogenization, validation against literature data, and Gaussian‐process‐based optimization, the study reveals how heating limits, pulling speed,
Elder Soares +3 more
wiley +1 more source

Reinforcement Learning-Based Autonomous Soccer Agents: A Study in Multi-Agent Coordination and Strategy Development

Buana Information Technology and Computer Sciences
Reinforcement learning (RL) approaches, particularly Q-learning, have emerged as strong tools for autonomous agent training, allowing agents to acquire optimum decision-making rules through interaction with their surroundings.
Biplov Paneru +3 more
doaj +1 more source

Multimodal Data‐Driven Microstructure Characterization

Advanced Engineering Materials, EarlyView.
A self‐consistent autonomous workflow for EBSP‐based microstructure segmentation by integrating PCA, GMM clustering, and cNMF with information‐theoretic parameter selection, requiring no user input. An optimal ROI size related to characteristic grain size is identified.
Qi Zhang +4 more
wiley +1 more source

Symbolic Regression and Multi‐Objective Optimization of the Flory–Huggins Interaction Parameter for Hydrogels

Advanced Engineering Materials, EarlyView.
We develop a data‐driven method to derive the mathematical expressions of the Flory–Huggins interaction parameter χ for the swelling behavior of temperature–responsive hydrogels. Starting from initial assumptions of χ, our workflow combines Bayesian optimization, Flory–Rehner theory, and symbolic regression to generate candidate χ expressions.
Yawen Wang +2 more
wiley +1 more source

Current Status and Challenges in Data Collection for Aerospace Coatings Deposited by Plasma Spraying

Advanced Engineering Materials, EarlyView.
An innovative approach has been integrated into the GRENAT project to optimize plasma spraying and coating performance. Raw materials are accelerated and melted in the plasma generated by torches, creating coatings. Monitoring sensors collect process data which are combined with ex situ characterization data.
Lila Randriamananjara +8 more
wiley +1 more source

Q-learning with Nearest Neighbors

CoRR, 2018
Accepted to NIPS ...
Shah, Devavrat, Xie, Qiaomin
openaire +3 more sources

Workflow for Design of Experiments‐Based Modeling of Species Transport and Growth Kinetics in GaN Hydride Vapor Phase Epitaxy

Advanced Engineering Materials, EarlyView.
A novel workflow for investigating hydride vapor phase epitaxy for GaN bulk crystal growth is proposed. It combines Design of experiments (DoE) with physical simulations of mass transport and crystal growth kinetics, serving as an intermediate step between DoE and experiments.
J. Tomkovič +7 more
wiley +1 more source

Achieving High ON State Current through Ferroelectric Polarization‐Dependent Interfacial Resistance Switching in Undoped Orthorhombic HfO2 Films

Advanced Functional Materials, EarlyView.
Ferroelectric tunnel junction devices based on epitaxial undoped ferroelectric HfO2 films demonstrate stable switching endurance of over 106 switching cycles, low write voltages of ±3 V, 16 measured resistance states, and neuromorphic capability.
Markus Hellenbrand +13 more
wiley +1 more source

Adaptive Q-Learning Grey Wolf Optimizer for UAV Path Planning

Drones
Path planning is crucial for safely and efficiently navigating unmanned aerial vehicles (UAVs) toward operational goals. Often, this is a complex, multi-constraint, and non-linear optimization problem, and metaheuristic algorithms are frequently used to ...
Golam Moktader Nayeem, Mingyu Fan, Golam Moktader Daiyan +2 more
doaj +1 more source

Multiagent Soft Q-Learning

CoRR, 2018
Policy gradient methods are often applied to reinforcement learning in continuous multiagent games. These methods perform local search in the joint-action space, and as we show, they are susceptable to a game-theoretic pathology known as relative overgeneralization.
Ermo Wei, Drew Wicke, David Freelan, Sean Luke +3 more
openaire +3 more sources

reinforcement learning
deep reinforcement learning
artificial intelligence

machine learning
path planning