Results 241 to 250 of about 391,445 (269)
Some of the next articles are maybe not open access.
2023
In classic reinforcement learning(RL) for continuous control, agents make decisions at discrete and fixed time intervals. The duration between decisions becomes a crucial hyperparameter. Setting it too short may increase the problem’s difficulty by requiring the agent to make numerous decisions to achieve its goal, while setting it too long can result ...
openaire +1 more source
In classic reinforcement learning(RL) for continuous control, agents make decisions at discrete and fixed time intervals. The duration between decisions becomes a crucial hyperparameter. Setting it too short may increase the problem’s difficulty by requiring the agent to make numerous decisions to achieve its goal, while setting it too long can result ...
openaire +1 more source
AIAA Infotech@Aerospace Conference, 2009
We examine the complementary strengths of value function based policy learning and guided search. Our work unifies rollout based open-loop feedback control (outlined by Bertsekas 2 ) and plan-space approximate dynamic programming (studied by Boyan 3 ). We exploit the strengths of this unified space by finding a natural order and metric for considering ...
Lawrence Bush +2 more
openaire +1 more source
We examine the complementary strengths of value function based policy learning and guided search. Our work unifies rollout based open-loop feedback control (outlined by Bertsekas 2 ) and plan-space approximate dynamic programming (studied by Boyan 3 ). We exploit the strengths of this unified space by finding a natural order and metric for considering ...
Lawrence Bush +2 more
openaire +1 more source
IEEE Power Engineering Review, 2002
Stochastic dynamic programming has been extensively used in the optimization of long-term hydrothermal scheduling problems due to its ability to cope with the nonlinear and stochastic characteristics of such problems and the fact that it provides a closed-loop feedback control policy.
L. Martinez, S. Soares
openaire +1 more source
Stochastic dynamic programming has been extensively used in the optimization of long-term hydrothermal scheduling problems due to its ability to cope with the nonlinear and stochastic characteristics of such problems and the fact that it provides a closed-loop feedback control policy.
L. Martinez, S. Soares
openaire +1 more source
Closed and Open Loop Oil Taxation Policies in New Mexico
SSRN Electronic Journal, 2023Saeed Langarudi +2 more
openaire +1 more source
American Cancer Society nutrition and physical activity guideline for cancer survivors
Ca-A Cancer Journal for Clinicians, 2022Cheryl L Rock +2 more
exaly
American Cancer Society's report on the status of cancer disparities in the United States, 2021
Ca-A Cancer Journal for Clinicians, 2022Farhad Islami +2 more
exaly
Navigating financial toxicity in patients with cancer: A multidisciplinary management approach
Ca-A Cancer Journal for Clinicians, 2022Grace Li Smith +2 more
exaly

