Results 61 to 70 of about 3,173,574 (293)

AGT: Efficient Offline Reinforcement Learning With Advantage‐Guided Transformer

open access: yesCAAI Transactions on Intelligence Technology, EarlyView.
ABSTRACT Offline reinforcement learning (RL) is a paradigm that seeks to train policies directly based on fixed datasets derived from previous interactions with the environment. However, offline RL faces critical challenges in environments characterised by sparse rewards and datasets dominated by suboptimal trajectories.
Jiaye Wei   +4 more
wiley   +1 more source

Existence and Global Asymptotic Behavior of S-Asymptotically ω-Periodic Solutions for Evolution Equation with Delay

open access: yesJournal of Function Spaces, 2020
This paper is concerned with the abstract evolution equation with delay. Firstly, we establish some sufficient conditions to ensure the existence results for the S-asymptotically periodic solutions by means of the compact semigroup. Secondly, we consider
Hong Qiao, Qiang Li, Tianjiao Yuan
doaj   +1 more source

Some operator Bellman type inequalities

open access: yes, 2015
In this paper, we employ the Mond--Pe\v{c}ari\'c method to establish some reverses of the operator Bellman inequality under certain conditions. In particular, we show \begin{equation*} \delta I_{\mathscr K}+\sum_{j=1}^n\omega_j\Phi_j\left((I_{\mathscr H}-
Bakherad, Mojtaba, Morassaei, Ali
core   +1 more source

A Model for Optimal Human Navigation with Stochastic Effects

open access: yes, 2020
We present a method for optimal path planning of human walking paths in mountainous terrain, using a control theoretic formulation and a Hamilton-Jacobi-Bellman equation.
Arnold, David   +3 more
core   +1 more source

The Consequences of Soil Organic Carbon for Crop Yield, Farm Productivity and Profit

open access: yesAustralian Journal of Agricultural and Resource Economics, EarlyView.
ABSTRACT Crop choices affect soil organic carbon (SOC) stocks, allowing farmers to manipulate the amount of carbon sequestered in the soil over time. This paper examines the private and public benefits of crop rotations that sequester additional carbon across the province of Saskatchewan, Canada using a novel field‐level dataset from the Saskatchewan ...
Devin Allen Serfas
wiley   +1 more source

Cryptocurrency Bubbles and Costly Mining

open access: yesInternational Economic Review, EarlyView.
ABSTRACT This paper develops a model of a cryptocurrency by incorporating mining into the otherwise standard search‐theoretic monetary framework. As usual, multiple equilibria exist. To obtain a sharp prediction on whether a cryptocurrency' s value will last in the future, I propose a notion of equilibrium refinement based on the feature that mining ...
Kohei Iwasaki
wiley   +1 more source

Monotone concave operators: An application to the existence and uniqueness of solutions to the Bellman equation [PDF]

open access: yes
We propose a new approach to the issue of existence and uniqueness of solutions to the Bellman equation, exploiting an emerging class of methods, called monotone map methods, pioneered in the work of Krasnosel’skii (1964) and Krasnosel’skii-Zabreiko ...
Cuong Le Van   +2 more
core   +3 more sources

Ambiguity Aversion, Portfolio Choice, and Life Expectancy

open access: yesInternational Economic Review, EarlyView.
ABSTRACT This paper studies how wealth and aging affect portfolio choices in a life‐cycle model with ambiguity aversion. Ambiguity aversion implies wealthier and older agents are endogenously more optimistic about risky asset returns, relative to poorer/younger agents. As life expectancy grows, old agents become even more optimistic, while young agents
Alistair Macaulay, Chenchuan Shi
wiley   +1 more source

Search and Inventory in Over‐the‐Counter Markets

open access: yesInternational Economic Review, EarlyView.
ABSTRACT We investigate the sources of the dealer centrality premium in the over‐the‐counter market for corporate bonds. We model dealer heterogeneity by allowing the dealer's status in the network to determine search effort and inventory costs when choosing to conduct riskless principal or principal trades.
Evan Dudley, Hongfei Sun, Chengjie Diao
wiley   +1 more source

Nonlinear Optimal Control for Stochastic Dynamical Systems

open access: yesMathematics
This paper presents a comprehensive framework addressing optimal nonlinear analysis and feedback control synthesis for nonlinear stochastic dynamical systems.
Manuel Lanchares, Wassim M. Haddad
doaj   +1 more source

Home - About - Disclaimer - Privacy