Results 221 to 230 of about 522,323 (249)
Some of the next articles are maybe not open access.
AIAA Infotech@Aerospace Conference, 2009
We examine the complementary strengths of value function based policy learning and guided search. Our work unifies rollout based open-loop feedback control (outlined by Bertsekas 2 ) and plan-space approximate dynamic programming (studied by Boyan 3 ). We exploit the strengths of this unified space by finding a natural order and metric for considering ...
Lawrence Bush +2 more
openaire +2 more sources
We examine the complementary strengths of value function based policy learning and guided search. Our work unifies rollout based open-loop feedback control (outlined by Bertsekas 2 ) and plan-space approximate dynamic programming (studied by Boyan 3 ). We exploit the strengths of this unified space by finding a natural order and metric for considering ...
Lawrence Bush +2 more
openaire +2 more sources
IEEE Transactions on Vehicular Technology, 2021
To support ultra-reliable and low-latency communication (URLLC) in vehicular networks, the virtual cells, where multiple access points (APs) cooperatively serve one mobile node, have been proposed to reduce the end-to-end latency in the downlink.
Yaoyuan Zhang +5 more
semanticscholar +1 more source
To support ultra-reliable and low-latency communication (URLLC) in vehicular networks, the virtual cells, where multiple access points (APs) cooperatively serve one mobile node, have been proposed to reduce the end-to-end latency in the downlink.
Yaoyuan Zhang +5 more
semanticscholar +1 more source
Sensitivity analysis and stochastic optimization for open-loop batch operating policy determination
Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228), 2002The determination of the optimal open-loop operating policy for batch reaction systems under uncertainty is considered. Adjustment of the nominal optimal open-loop operating policy based on worst-case and chance-constrained stochastic optimization approaches are discussed and demonstrated using a commercial pharmaceutical synthesis reaction system ...
K.R. Muske, M. Badlani
openaire +1 more source
AI/ML Data-driven Control Loop for Managing O-RAN SDR-based RANs
Conference on Computer Communications Workshops, 2023Open Radio Access Network (O-RAN) introduced a common control and management overlay, allowing mobile network operators to embed networking intelligence using different types of third-party applications: xApps for real-time control loops, and rApps for ...
Jaswanth S. R. Mallu +5 more
semanticscholar +1 more source
Evolving Dynamic Locomotion Policies in Minutes
International Conference on Information, Intelligence, Systems and Applications, 2023Many effective evolutionary methods have been proposed that allow robots to learn how to walk. Most of the proposed methods have one or more of the following drawbacks: (a) utilization of hand designed open loop policies that cannot scale to different ...
Konstantinos I. Chatzilygeroudis +2 more
semanticscholar +1 more source
2023
In classic reinforcement learning(RL) for continuous control, agents make decisions at discrete and fixed time intervals. The duration between decisions becomes a crucial hyperparameter. Setting it too short may increase the problem’s difficulty by requiring the agent to make numerous decisions to achieve its goal, while setting it too long can result ...
openaire +1 more source
In classic reinforcement learning(RL) for continuous control, agents make decisions at discrete and fixed time intervals. The duration between decisions becomes a crucial hyperparameter. Setting it too short may increase the problem’s difficulty by requiring the agent to make numerous decisions to achieve its goal, while setting it too long can result ...
openaire +1 more source
EquAct: An SE(3)-Equivariant Multi-Task Transformer for Open-Loop Robotic Manipulation
arXiv.orgTransformer architectures can effectively learn language-conditioned, multi-task 3D open-loop manipulation policies from demonstrations by jointly processing natural language instructions and 3D observations.
Xu Zhu +4 more
semanticscholar +1 more source
IEEE Control Systems Letters
Dynamic games offer a versatile framework for modeling the evolving interactions of strategic agents, whose steady-state behavior can be captured by the Nash equilibria of the games. Nash equilibria are often computed in feedback, with policies depending
Chih-Yuan Chiu +3 more
semanticscholar +1 more source
Dynamic games offer a versatile framework for modeling the evolving interactions of strategic agents, whose steady-state behavior can be captured by the Nash equilibria of the games. Nash equilibria are often computed in feedback, with policies depending
Chih-Yuan Chiu +3 more
semanticscholar +1 more source
Open-loop response of Fischer–Tropsch reactions to manipulation of temperature and pressure
International journal of Chemical Reactor EngineeringIn the present work, the Fischer–Tropsch synthesis (FTS) is carried out through simulation. This reaction uses a gas mixture, called synthesis gas, composed of carbon monoxide rich in hydrogen (H2/CO > 2.5), to form medium and long chain hydrocarbons (C5
Salvador Piña-Contreras +3 more
semanticscholar +1 more source

