Results 61 to 70 of about 3,173,574 (293)
AGT: Efficient Offline Reinforcement Learning With Advantage‐Guided Transformer
ABSTRACT Offline reinforcement learning (RL) is a paradigm that seeks to train policies directly based on fixed datasets derived from previous interactions with the environment. However, offline RL faces critical challenges in environments characterised by sparse rewards and datasets dominated by suboptimal trajectories.
Jiaye Wei +4 more
wiley +1 more source
This paper is concerned with the abstract evolution equation with delay. Firstly, we establish some sufficient conditions to ensure the existence results for the S-asymptotically periodic solutions by means of the compact semigroup. Secondly, we consider
Hong Qiao, Qiang Li, Tianjiao Yuan
doaj +1 more source
Some operator Bellman type inequalities
In this paper, we employ the Mond--Pe\v{c}ari\'c method to establish some reverses of the operator Bellman inequality under certain conditions. In particular, we show \begin{equation*} \delta I_{\mathscr K}+\sum_{j=1}^n\omega_j\Phi_j\left((I_{\mathscr H}-
Bakherad, Mojtaba, Morassaei, Ali
core +1 more source
A Model for Optimal Human Navigation with Stochastic Effects
We present a method for optimal path planning of human walking paths in mountainous terrain, using a control theoretic formulation and a Hamilton-Jacobi-Bellman equation.
Arnold, David +3 more
core +1 more source
The Consequences of Soil Organic Carbon for Crop Yield, Farm Productivity and Profit
ABSTRACT Crop choices affect soil organic carbon (SOC) stocks, allowing farmers to manipulate the amount of carbon sequestered in the soil over time. This paper examines the private and public benefits of crop rotations that sequester additional carbon across the province of Saskatchewan, Canada using a novel field‐level dataset from the Saskatchewan ...
Devin Allen Serfas
wiley +1 more source
Cryptocurrency Bubbles and Costly Mining
ABSTRACT This paper develops a model of a cryptocurrency by incorporating mining into the otherwise standard search‐theoretic monetary framework. As usual, multiple equilibria exist. To obtain a sharp prediction on whether a cryptocurrency' s value will last in the future, I propose a notion of equilibrium refinement based on the feature that mining ...
Kohei Iwasaki
wiley +1 more source
Monotone concave operators: An application to the existence and uniqueness of solutions to the Bellman equation [PDF]
We propose a new approach to the issue of existence and uniqueness of solutions to the Bellman equation, exploiting an emerging class of methods, called monotone map methods, pioneered in the work of Krasnosel’skii (1964) and Krasnosel’skii-Zabreiko ...
Cuong Le Van +2 more
core +3 more sources
Ambiguity Aversion, Portfolio Choice, and Life Expectancy
ABSTRACT This paper studies how wealth and aging affect portfolio choices in a life‐cycle model with ambiguity aversion. Ambiguity aversion implies wealthier and older agents are endogenously more optimistic about risky asset returns, relative to poorer/younger agents. As life expectancy grows, old agents become even more optimistic, while young agents
Alistair Macaulay, Chenchuan Shi
wiley +1 more source
Search and Inventory in Over‐the‐Counter Markets
ABSTRACT We investigate the sources of the dealer centrality premium in the over‐the‐counter market for corporate bonds. We model dealer heterogeneity by allowing the dealer's status in the network to determine search effort and inventory costs when choosing to conduct riskless principal or principal trades.
Evan Dudley, Hongfei Sun, Chengjie Diao
wiley +1 more source
Nonlinear Optimal Control for Stochastic Dynamical Systems
This paper presents a comprehensive framework addressing optimal nonlinear analysis and feedback control synthesis for nonlinear stochastic dynamical systems.
Manuel Lanchares, Wassim M. Haddad
doaj +1 more source

