Results 91 to 100 of about 241,763 (280)

A Q‐Learning Algorithm to Solve the Two‐Player Zero‐Sum Game Problem for Nonlinear Systems

open access: yesInternational Journal of Adaptive Control and Signal Processing, Volume 39, Issue 3, Page 566-581, March 2025.
A Q‐learning algorithm to solve the two‐player zero‐sum game problem for nonlinear systems. ABSTRACT This paper deals with the two‐player zero‐sum game problem, which is a bounded L2$$ {L}_2 $$‐gain robust control problem. Finding an analytical solution to the complex Hamilton‐Jacobi‐Issacs (HJI) equation is a challenging task.
Afreen Islam   +2 more
wiley   +1 more source

Scheduling Bi-Objective Lot-Streaming Hybrid Flow Shops with Consistent Sublots via an Enhanced Artificial Bee Colony Algorithm

open access: yesComplex System Modeling and Simulation
This work addresses bi-objective hybrid flow shop scheduling problems considering consistent sublots (Bi-HFSP_CS). The objectives are to minimize the makespan and total energy consumption.
Benxue Lu   +3 more
doaj   +1 more source

Speedy Q-learning

open access: yes, 2011
We introduce a new convergent variant of Q-learning, called speedy Q-learning, to address the problem of slow convergence in the standard form of the Q-learning algorithm. We prove a PAC bound on the performance of SQL, which shows that for an MDP with n state-action pairs and the discount factor γ only T = O(log(n)/(ε^2 (1 - γ)^4)) steps are required ...
Azar, Mohammad Gheshlaghi   +3 more
openaire   +2 more sources

What Do Large Language Models Know About Materials?

open access: yesAdvanced Engineering Materials, EarlyView.
If large language models (LLMs) are to be used inside the material discovery and engineering process, they must be benchmarked for the accurateness of intrinsic material knowledge. The current work introduces 1) a reasoning process through the processing–structure–property–performance chain and 2) a tool for benchmarking knowledge of LLMs concerning ...
Adrian Ehrenhofer   +2 more
wiley   +1 more source

A Workflow to Accelerate Microstructure‐Sensitive Fatigue Life Predictions

open access: yesAdvanced Engineering Materials, EarlyView.
This study introduces a workflow to accelerate predictions of microstructure‐sensitive fatigue life. Results from frameworks with varying levels of simplification are benchmarked against published reference results. The analysis reveals a trade‐off between accuracy and model complexity, offering researchers a practical guide for selecting the optimal ...
Luca Loiodice   +2 more
wiley   +1 more source

A Reinforcement Learning Approach for Smart Farming [PDF]

open access: yesDatabase Systems Journal, 2019
At a basic level, the aim of machine learning is to develop solutions for real-life engineering problems and to enhance the performance of different computers tasks in order to obtain an algorithm that is highly independent of human intervention.
Gabriela ENE
doaj  

Q-learning with Nearest Neighbors

open access: yes, 2018
We consider model-free reinforcement learning for infinite-horizon discounted Markov Decision Processes (MDPs) with a continuous state space and unknown transition kernel, when only a single sample path under an arbitrary policy of the system is ...
Shah, Devavrat, Xie, Qiaomin
core  

Zein‐Based Adhesives: Sustainable Extraction and Application in Bioadhesive Technologies

open access: yesAdvanced Engineering Materials, EarlyView.
Zein is extracted from corn gluten meal using a simple and scalable process with high yield (~90%). The resulting protein is applied in bioadhesives modified with Ca2+ and Fe3+ ions, exhibiting substrate‐dependent adhesion. The findings demonstrate competitive bonding performance and highlight the role of ionic interactions in tuning adhesion ...
Paula Bertolino Sanvezzo   +3 more
wiley   +1 more source

Reinforcement Learning-Based Autonomous Soccer Agents: A Study in Multi-Agent Coordination and Strategy Development

open access: yesBuana Information Technology and Computer Sciences
Reinforcement learning (RL) approaches, particularly Q-learning, have emerged as strong tools for autonomous agent training, allowing agents to acquire optimum decision-making rules through interaction with their surroundings.
Biplov Paneru   +3 more
doaj   +1 more source

Additive Gaussian Process Regression for Predictive Design of High‐Performance, Printable Silicones

open access: yesAdvanced Engineering Materials, EarlyView.
A chemistry‐aware design framework for tuning printable polydimethylsiloxane (PDMS) for vat photopolymerization (VPP) is developed using additive Gaussian process (GP) modeling. Polymer network mechanics informs variable groupings, feasible formulation constraints, and interaction variables.
Roxana Carbonell   +3 more
wiley   +1 more source

Home - About - Disclaimer - Privacy