Results 111 to 120 of about 154,449 (307)

Risk-Sensitive Reinforcement Learning: A Constrained Optimization Viewpoint

open access: yes, 2018
The classic objective in a reinforcement learning (RL) problem is to find a policy that minimizes, in expectation, a long-run objective such as the infinite-horizon discounted or long-run average cost.
A., Prashanth L., Fu, Michael
core  

Robust Reinforcement Learning Control Framework for a Quadrotor Unmanned Aerial Vehicle Using Critic Neural Network

open access: yesAdvanced Intelligent Systems, Volume 7, Issue 3, March 2025.
Quadrotor unmanned aerial vehicle control is critical to maintain flight safety and efficiency, especially when facing external disturbances and model uncertainties. This article presents a robust reinforcement learning control scheme to deal with these challenges.
Yu Cai   +3 more
wiley   +1 more source

Bounding Rationality by Discounting Time [PDF]

open access: yes, 2009
Consider a game where Alice generates an integer and Bob wins if he can factor that integer. Traditional game theory tells us that Bob will always win this game even though in practice Alice will win given our usual assumptions about the hardness of ...
Fortnow, Lance, Santhanam, Rahul
core   +4 more sources

Multiobjective Environmental Cleanup with Autonomous Surface Vehicle Fleets Using Multitask Multiagent Deep Reinforcement Learning

open access: yesAdvanced Intelligent Systems, EarlyView.
This study presents a multitask strategy for plastic cleanup with autonomous surface vehicles, combining exploration and cleaning phases. A two‐headed Deep Q‐Network shared by all agents is traineded via multiobjective reinforcement learning, producing a Pareto front of trade‐offs.
Dame Seck   +4 more
wiley   +1 more source

Public infrastructures, public consumption and welfare in a new open economy macro model [PDF]

open access: yes
This paper focuses on the trade-off faced by governments in deciding the allocation of public expenditures between productivity-enhancing public infrastructures and utility-enhancing public consumption in a two-country model.
Ganelli , Giovanni, Tervala, Juha
core  

Discounting and Future Selves [PDF]

open access: yes, 2002
Is discounting of future instantaneous utilities consistent with altruism towards future selves? More precisely, can temporal preferences, expressed as a sum of discounted instantaneous utilities, be derived from a representation in the form of a sum of ...
Sáez-Martí, María   +1 more
core  

Adaptive Autonomy in Microrobot Motion Control via Deep Reinforcement Learning and Path Planning Synergy

open access: yesAdvanced Intelligent Systems, EarlyView.
This study introduces a data‐driven framework that combines deep reinforcement learning with classical path planning to achieve adaptive microrobot navigation. By training a surrogate neural network to emulate microrobot dynamics, the approach improves learning efficiency, reduces training time, and enables robust real‐time obstacle avoidance in ...
Amar Salehi   +3 more
wiley   +1 more source

Characterization of a Cournot–Nash Equilibrium for a Fishery Model with Fuzzy Utilities

open access: yesJournal of Mathematics
The article deals with the extensions of discrete-time games with infinite time horizon and their application in a fuzzy context to fishery models. The criteria for these games are the total discounted utility and the average utility in a fishing problem.
R. Israel Ortega-Gutiérrez   +2 more
doaj   +1 more source

Money Metrics Welfare Measures in Imperfect Markets under Growth [PDF]

open access: yes
This paper shows how utility based welfare measures in dynamic general equilibrium under imperfect markets can be transferred into a money metrics. In order to do this, we need to price forward looking components measured in units of utility. The typical
Li, Chuan Zhong, Löfgren, Karl-Gustaf
core   +1 more source

Collaborative Multiagent Closed‐Loop Motion Planning for Multimanipulator Systems

open access: yesAdvanced Intelligent Systems, EarlyView.
This work presents a hierarchical multi‐manipulator planner, emphasizing highly overlapping space. The proposed method leverages an enhanced Dynamic Movement Primitive based planner along with an improvised Multi‐Agent Reinforcement Learning approach to ensure regulatory and mediatory control while ensuring low‐level autonomy. Experiments across varied
Tian Xu, Siddharth Singh, Qing Chang
wiley   +1 more source

Home - About - Disclaimer - Privacy