Reinforcement learning - Open Access .click

Results 81 to 90 of about 198,528 (317)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents

Tongxin xuebao, 2023
To solve the problem that intelligent devices equipped with deep reinforcement learning agents lack effective security data sharing mechanisms in the intelligent Internet of things, a general federated reinforcement learning (GenFedRL) framework was ...
Biao JIN +4 more
doaj +2 more sources

Munchausen Reinforcement Learning

CoRR, 2020
NeurIPS 2020.
Vieillard, Nino, Pietquin, Olivier, Geist, Matthieu +2 more
openaire +4 more sources

Artificial Intelligence in Systemic Sclerosis: Clinical Applications, Challenges, and Future Directions

Arthritis Care &Research, EarlyView.
Systemic sclerosis (SSc) is a rare autoimmune disease defined by immune dysregulation, vasculopathy, and progressive fibrosis of the skin and internal organs. Despite advances in care, major complications such as interstitial lung disease (ILD) and myocardial involvement remain the leading causes of morbidity and mortality.
Cristiana Sieiro Santos +2 more
wiley +1 more source

Comparative Analysis of Energy Management Strategies for HEV: Dynamic Programming and Reinforcement Learning

IEEE Access, 2020
Energy management strategy is an important factor in determining the fuel economy of hybrid electric vehicles; thus, much research on how to distribute the required power to engines and motors of hybrid vehicles is required.
Heeyun Lee +3 more
doaj +1 more source

Quantum Reinforcement Learning [PDF]

, 2005
A novel quantum reinforcement learning is proposed through combining quantum theory and reinforcement learning. Inspired by state superposition principle, a framework of state value update algorithm is introduced. The state/action value is represented with quantum state and the probability of action eigenvalue is denoted by probability amplitude, which
Daoyi Dong, Chunlin Chen 0001, Zonghai Chen +2 more
openaire +1 more source

A Q‐Learning Algorithm to Solve the Two‐Player Zero‐Sum Game Problem for Nonlinear Systems

International Journal of Adaptive Control and Signal Processing, Volume 39, Issue 3, Page 566-581, March 2025.
A Q‐learning algorithm to solve the two‐player zero‐sum game problem for nonlinear systems. ABSTRACT This paper deals with the two‐player zero‐sum game problem, which is a bounded L2$$ {L}_2 $$‐gain robust control problem. Finding an analytical solution to the complex Hamilton‐Jacobi‐Issacs (HJI) equation is a challenging task.
Afreen Islam, Anthony Siming Chen, Guido Herrmann +2 more
wiley +1 more source

Optimizing Reinforcement Learning Using a Generative Action-Translator Transformer

Algorithms
In recent years, with the rapid advancements in Natural Language Processing (NLP) technologies, large models have become widespread. Traditional reinforcement learning algorithms have also started experimenting with language models to optimize training ...
Jiaming Li, Ning Xie, Tingting Zhao
doaj +1 more source

Reinforcement learning

Astronomy and Computing
To appear, Astronomy & ...
openaire +2 more sources

Characterization of Defect Distribution in an Additively Manufactured AlSi10Mg as a Function of Processing Parameters and Correlations with Extreme Value Statistics

Advanced Engineering Materials, EarlyView.
Predicting extreme defects in additive manufacturing remains a key challenge limiting its structural reliability. This study proposes a statistical framework that integrates Extreme Value Theory with advanced process indicators to explore defect–process relationships and improve the estimation of critical defect sizes. The approach provides a basis for
Muhammad Muteeb Butt +8 more
wiley +1 more source

What Do Large Language Models Know About Materials?

Advanced Engineering Materials, EarlyView.
If large language models (LLMs) are to be used inside the material discovery and engineering process, they must be benchmarked for the accurateness of intrinsic material knowledge. The current work introduces 1) a reasoning process through the processing–structure–property–performance chain and 2) a tool for benchmarking knowledge of LLMs concerning ...
Adrian Ehrenhofer +2 more
wiley +1 more source

fos: computer and information sciences
machine learning cs.lg
computer science - machine learning

artificial intelligence
artificial intelligence cs.ai
machine learning

deep reinforcement learning
statistics - machine learning
computer science - artificial intelligence