Results 111 to 120 of about 263,273 (143)
Some of the next articles are maybe not open access.
Asymptotic Optimality of Semi-Open-Loop Policies in Markov Decision Processes with Large Lead Times
Operations Research, 2020A generic way to verify asymptotic optimality of semi-open-loop policies for a wide class of MDPs with large lead times. In many real-life inventory models, order lead times can result in uncertain effects of inventory decisions. However, as the lead time grows large, one would naturally postulate that the effect of the delayed order depends weakly ...
Xingyu Bai +3 more
openaire +2 more sources
Optimality of Quasi-Open-Loop Policies for Discounted Semi-Markov Decision Processes
Mathematics of Operations Research, 2016Quasi-open-loop policies consist of sequences of Markovian decision rules that are insensitive to one component of the state space. Given a semi-Markov decision process (SMDP), we distinguish between exogenous and endogenous state components as follows: (i) the decision-maker’s actions do not impact the evolution of an exogenous state component, and ...
Adelman, Daniel, Mancini, Angelo J.
openaire +1 more source
Optimal Open-Loop Control Policies for a Class of Nonlinear Actuators
2019 18th European Control Conference (ECC), 2019This paper deals with the design and analysis of open-loop soft-landing control policies for a class of nonlinear actuators. A third-order nonlinear parametric model is firstly presented and the particularities of the systems under study are highlighted.
Edgar Ramirez-Laboreo +2 more
openaire +1 more source
Open-loop policies in Bayesian dynamic pricing: Some counter-intuitive observations and insights
Operations Research Letters, 2019zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Chen, Li, Wu, Chengyu
openaire +1 more source
Sensitivity analysis and stochastic optimization for open-loop batch operating policy determination
Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228), 2002The determination of the optimal open-loop operating policy for batch reaction systems under uncertainty is considered. Adjustment of the nominal optimal open-loop operating policy based on worst-case and chance-constrained stochastic optimization approaches are discussed and demonstrated using a commercial pharmaceutical synthesis reaction system ...
K.R. Muske, M. Badlani
openaire +1 more source
2023
In classic reinforcement learning(RL) for continuous control, agents make decisions at discrete and fixed time intervals. The duration between decisions becomes a crucial hyperparameter. Setting it too short may increase the problem’s difficulty by requiring the agent to make numerous decisions to achieve its goal, while setting it too long can result ...
openaire +1 more source
In classic reinforcement learning(RL) for continuous control, agents make decisions at discrete and fixed time intervals. The duration between decisions becomes a crucial hyperparameter. Setting it too short may increase the problem’s difficulty by requiring the agent to make numerous decisions to achieve its goal, while setting it too long can result ...
openaire +1 more source
AIAA Infotech@Aerospace Conference, 2009
We examine the complementary strengths of value function based policy learning and guided search. Our work unifies rollout based open-loop feedback control (outlined by Bertsekas 2 ) and plan-space approximate dynamic programming (studied by Boyan 3 ). We exploit the strengths of this unified space by finding a natural order and metric for considering ...
Lawrence Bush +2 more
openaire +1 more source
We examine the complementary strengths of value function based policy learning and guided search. Our work unifies rollout based open-loop feedback control (outlined by Bertsekas 2 ) and plan-space approximate dynamic programming (studied by Boyan 3 ). We exploit the strengths of this unified space by finding a natural order and metric for considering ...
Lawrence Bush +2 more
openaire +1 more source
IEEE Power Engineering Review, 2002
Stochastic dynamic programming has been extensively used in the optimization of long-term hydrothermal scheduling problems due to its ability to cope with the nonlinear and stochastic characteristics of such problems and the fact that it provides a closed-loop feedback control policy.
L. Martinez, S. Soares
openaire +1 more source
Stochastic dynamic programming has been extensively used in the optimization of long-term hydrothermal scheduling problems due to its ability to cope with the nonlinear and stochastic characteristics of such problems and the fact that it provides a closed-loop feedback control policy.
L. Martinez, S. Soares
openaire +1 more source
Closed and Open Loop Oil Taxation Policies in New Mexico
SSRN Electronic Journal, 2023Saeed Langarudi +2 more
openaire +1 more source

