Results 171 to 180 of about 39,858 (272)

Annual survival in a dynamic species: pronghorn survival patterns across their northern range

open access: yesWildlife Biology, EarlyView.
Quantifying variation in demographic patterns, such as survival and recruitment, is critical for understanding population dynamics and informing evidence‐based and adaptive wildlife management. In this study, we leverage an extensive dataset from over 1000 GPS collared pronghorn Antilocapra americana to provide the first large‐scale evaluation of ...
Molly C. McDevitt   +5 more
wiley   +1 more source

Heterogeneity, reinforcement learning, and chaos in population games. [PDF]

open access: yesProc Natl Acad Sci U S A
Bielawski J   +4 more
europepmc   +1 more source

TNCOA: Efficient Exploration via Observation‐Action Constraint on Trajectory‐Based Intrinsic Reward

open access: yesCAAI Transactions on Intelligence Technology, EarlyView.
ABSTRACT Efficient exploration is critical in handling sparse rewards and partial observability in deep reinforcement learning. However, most existing intrinsic reward methods based on novelty rely on single‐step observations or Euclidean distances.
Jingxiang Ma, Hongbin Ma, Youzhi Zhang
wiley   +1 more source

Temporal Dependency‐Aware Trajectory‐Level Behavioural Metric for Exploration in Reinforcement Learning

open access: yesCAAI Transactions on Intelligence Technology, EarlyView.
ABSTRACT Intrinsic motivation serves as the predominant paradigm of exploration in reinforcement learning. In pursuit of an informative and robust state representation, the behavioural metric groups behaviourally equivalent states together, which share the same single‐step reward and transition distribution.
Anjie Zhu   +3 more
wiley   +1 more source

Multi‐Agent Reinforcement Learning Driven Dynamic Resource Optimisation in Healthcare Transportation Networks

open access: yesCAAI Transactions on Intelligence Technology, EarlyView.
ABSTRACT This paper presents HealthNet, a novel framework for the dynamic optimisation of healthcare transportation networks using multi‐agent reinforcement learning. HealthNet leverages a spatiotemporal dependency module to capture complex spatiotemporal relationships in healthcare demand and resource allocation patterns, combined with centralised ...
Jianhui Lv   +3 more
wiley   +1 more source

On infinite-dimensional stochastic differential games

open access: yesOn infinite-dimensional stochastic differential games
Dedicated to Professor S.
openaire   +1 more source

Home - About - Disclaimer - Privacy