Results 181 to 190 of about 3,129 (263)

TNCOA: Efficient Exploration via Observation‐Action Constraint on Trajectory‐Based Intrinsic Reward

open access: yesCAAI Transactions on Intelligence Technology, EarlyView.
ABSTRACT Efficient exploration is critical in handling sparse rewards and partial observability in deep reinforcement learning. However, most existing intrinsic reward methods based on novelty rely on single‐step observations or Euclidean distances.
Jingxiang Ma, Hongbin Ma, Youzhi Zhang
wiley   +1 more source

On infinite-dimensional stochastic differential games

open access: yesOn infinite-dimensional stochastic differential games
Dedicated to Professor S.
openaire   +1 more source

From single cells to communities: Mathematical perspectives on bacterial quorum sensing. [PDF]

open access: yesComput Struct Biotechnol J
Sadr S, Zargar B, Aucoin MG, Ingalls B.
europepmc   +1 more source

Temporal Dependency‐Aware Trajectory‐Level Behavioural Metric for Exploration in Reinforcement Learning

open access: yesCAAI Transactions on Intelligence Technology, EarlyView.
ABSTRACT Intrinsic motivation serves as the predominant paradigm of exploration in reinforcement learning. In pursuit of an informative and robust state representation, the behavioural metric groups behaviourally equivalent states together, which share the same single‐step reward and transition distribution.
Anjie Zhu   +3 more
wiley   +1 more source

Differential shooting training in youth basketball players: an analysis of performance effects. [PDF]

open access: yesFront Psychol
Burkaitė G   +4 more
europepmc   +1 more source

AT‐AER: Adversarial Training With Adaptive Example Reuse

open access: yesCAAI Transactions on Intelligence Technology, EarlyView.
ABSTRACT Adversarial training (AT) is widely regarded as a crucial defense method for deep neural networks against adversarial attacks. Most of the existing AT methods suffer from the problems of insufficient coverage of perturbation space and robust overfitting.
Meng Hu   +5 more
wiley   +1 more source

Multi‐Agent Reinforcement Learning Driven Dynamic Resource Optimisation in Healthcare Transportation Networks

open access: yesCAAI Transactions on Intelligence Technology, EarlyView.
ABSTRACT This paper presents HealthNet, a novel framework for the dynamic optimisation of healthcare transportation networks using multi‐agent reinforcement learning. HealthNet leverages a spatiotemporal dependency module to capture complex spatiotemporal relationships in healthcare demand and resource allocation patterns, combined with centralised ...
Jianhui Lv   +3 more
wiley   +1 more source

Time to Breakdown for Type I Motor Winding Insulation Under High‐Frequency High‐Slew‐Rate Square Wave Voltages

open access: yesHigh Voltage, EarlyView.
ABSTRACT The emergence of (ultra)wide bandgap ((U)WBG)‐based devices has significantly enhanced performance in high‐frequency switching applications, enabling breakthroughs in electric transportation, aerospace, and renewable energy systems. However, it also presents considerable challenges to insulation systems, such as electric motor winding ...
Easir Arafat   +3 more
wiley   +1 more source

Home - About - Disclaimer - Privacy