Results 51 to 60 of about 257,916 (275)

Coordinated Multi-Agent Imitation Learning [PDF]

open access: yes, 2017
We study the problem of imitation learning from demonstrations of multiple coordinating agents. One key challenge in this setting is that learning a good model of coordination can be difficult, since coordination is often implicit in the demonstrations ...
Carr, Peter   +3 more
core   +2 more sources

An Adaptive Human Pilot Model With Reaction Time Delay for Enhanced Adaptive Control in Piloted Systems

open access: yesInternational Journal of Adaptive Control and Signal Processing, EarlyView.
This work introduces an adaptive human pilot model that captures pilot time‐delay effects in adaptive control systems. The model enables the prediction of pilot–controller interactions, facilitating safer integration and improved design of adaptive controllers for piloted applications.
Abdullah Habboush, Yildiray Yildiz
wiley   +1 more source

Development of a causal model of self-regulated learning by students at Loei Rajabhat University

open access: yesFrontiers in Education
IntroductionSelf-regulated learning is an active process in which learners employ self-directed behaviors, thoughts, and actions to attain learning objectives.
Anuphum Kumyoung   +4 more
doaj   +1 more source

Predictability of imitative learning trajectories [PDF]

open access: yesJournal of Statistical Mechanics: Theory and Experiment, 2019
The fitness landscape metaphor plays a central role on the modeling of optimizing principles in many research fields, ranging from evolutionary biology, where it was first introduced, to management research. Here we consider the ensemble of trajectories of the imitative learning search, in which agents exchange information on their fitness and imitate ...
Campos, Paulo R. A., Fontanari, José F.
openaire   +3 more sources

A Robust Adaptive One‐Sample‐Ahead Preview Super‐Twisting Sliding Mode Controller

open access: yesInternational Journal of Adaptive Control and Signal Processing, EarlyView.
Block Diagram of the Robust Adaptive One‐Sample‐Ahead Preview Super‐Twisting Sliding Mode Controller. ABSTRACT This article introduces a discrete‐time robust adaptive one‐sample‐ahead preview super‐twisting sliding mode controller. A stability analysis of the controller by Lyapunov criteria is developed to demonstrate its robustness in handling both ...
Guilherme Vieira Hollweg   +5 more
wiley   +1 more source

Self-Imitation Learning

open access: yes, 2018
This paper proposes Self-Imitation Learning (SIL), a simple off-policy actor-critic algorithm that learns to reproduce the agent's past good decisions. This algorithm is designed to verify our hypothesis that exploiting past good experiences can indirectly drive deep exploration.
Oh, Junhyuk   +3 more
openaire   +2 more sources

What Do Large Language Models Know About Materials?

open access: yesAdvanced Engineering Materials, EarlyView.
If large language models (LLMs) are to be used inside the material discovery and engineering process, they must be benchmarked for the accurateness of intrinsic material knowledge. The current work introduces 1) a reasoning process through the processing–structure–property–performance chain and 2) a tool for benchmarking knowledge of LLMs concerning ...
Adrian Ehrenhofer   +2 more
wiley   +1 more source

Imitation-Reinforcement Learning Penetration Strategy for Hypersonic Vehicle in Gliding Phase

open access: yesAerospace
To enhance the penetration capability of hypersonic vehicles in the gliding phase, an intelligent maneuvering penetration strategy combining imitation learning and reinforcement learning is proposed.
Lei Xu   +3 more
doaj   +1 more source

Proper Imitation, a Prerequisite for Creativity Imitative Learning in Architectural Education (Design Process) [PDF]

open access: yesصفه, 2017
According to general opinion, “imitation” is considered as the opposite of “creativity”. In architectural design, which is based on creativity, the same perspective is, likewise, prevalent to some degree.
Vahid Sadram
doaj  

Hybrid Reinforcement Learning with Expert State Sequences

open access: yes, 2019
Existing imitation learning approaches often require that the complete demonstration data, including sequences of actions and states, are available. In this paper, we consider a more realistic and difficult scenario where a reinforcement learning agent ...
Campbell, Murray   +4 more
core   +1 more source

Home - About - Disclaimer - Privacy