Results 101 to 110 of about 387,064 (296)

Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition

open access: yes, 1998
This paper presents the MAXQ approach to hierarchical reinforcement learning based on decomposing the target Markov decision process (MDP) into a hierarchy of smaller MDPs and decomposing the value function of the target MDP into an additive combination ...
Dietterich, Thomas G.
core   +6 more sources

TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs [PDF]

open access: yes, 2010
Reinforcement learning is one of the main adaptive mechanisms that is both well documented in animal behaviour and giving rise to computational studies in animats and robots. In this paper, we present TeXDYNA, an algorithm designed to solve large reinforcement learning problems with unknown structure by integrating hierarchical abstraction techniques ...
Kozlova, Olga   +2 more
openaire   +2 more sources

Strategies for Enhancing Thermal Conductivity of PDMS in Electronic Applications

open access: yesAdvanced Materials Technologies, EarlyView.
This review explores effective strategies for enhancing heat dissipation in Polydimethylsiloxane (PDMS)‐based composites, focusing on particle optimization, 3D network design, and multifunctional integration. It offers key insights into cutting‐edge methods and simulations that are advancing thermal management in modern electronic devices.
Xiang Yan, Marisol Martin‐Gonzalez
wiley   +1 more source

Hierarchical Imitation and Reinforcement Learning

open access: yes, 2018
Proceedings of the 35th International Conference on Machine Learning (ICML 2018)
Le, Hoang M.   +5 more
openaire   +3 more sources

3D Printing of Stretchable, Compressible and Conductive Porous Polyurethane for Soft Robotics

open access: yesAdvanced Materials Technologies, EarlyView.
A 3D‐printable porous dopamine‐polyurethane acrylate elastomer results in conductive, stretchable, and compressible structures that can be metallized in situ through catechol‐mediated silver reduction. The resulting material function as both compliant soft robot with a and strain sensors without complex assemblies, enabling fully 3D‐printed soft ...
Ouriel Bliah   +3 more
wiley   +1 more source

Online hierarchical reinforcement learning based on interrupting Option

open access: yesTongxin xuebao, 2016
Aiming at dealing with volume of big data,an on-line updating algorithm,named by Macro-Q with in-place updating (MQIU),which was based on Macro-Q algorithm and takes advantage of in-place updating approach,was proposed.The MQIU algorithm updates both the
Fei ZHU   +4 more
doaj   +2 more sources

Goal-oriented hierarchical reinforcement learning-based overlay multicast routing method with SDN integration

open access: yesTongxin xuebao
Addressing the limitations of traditional network architectures where coverage multicast lacks awareness of underlying physical network states and struggles to adapt to dynamic network changes, and recognising that existing mainstream reinforcement ...
Ye Miao   +6 more
doaj  

Flexible Sensor‐Based Human–Machine Interfaces with AI Integration for Medical Robotics

open access: yesAdvanced Robotics Research, EarlyView.
This review explores how flexible sensing technology and artificial intelligence (AI) significantly enhance human–machine interfaces in medical robotics. It highlights key sensing mechanisms, AI‐driven advancements, and applications in prosthetics, exoskeletons, and surgical robotics.
Yuxiao Wang   +5 more
wiley   +1 more source

Learning Highly Dynamic Skills Transition for Quadruped Jumping Through Constrained Space

open access: yesAdvanced Robotics Research, EarlyView.
A quadruped robot masters dynamic jumps through constrained spaces with animal‐inspired moves and intelligent vision control. This hierarchical learning approach combines imitation of biological agility with real‐time trajectory planning. Although legged animals are capable of performing explosive motions while traversing confined spaces, replicating ...
Zeren Luo   +6 more
wiley   +1 more source

Hierarchical deep reinforcement learning for self-adaptive economic dispatch

open access: yesHeliyon
It is challenging to accurately model the overall uncertainty of the power system when it is connected to large-scale intermittent generation sources such as wind and photovoltaic generation due to the inherent volatility, uncertainty, and indivisibility
Mengshi Li   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy