Learning to schedule job-shop problems: representation and policy learning using graph neural network and reinforcement learning [PDF]
We propose a framework to learn to schedule a job-shop problem (JSSP) using a graph neural network (GNN) and reinforcement learning (RL). We formulate the scheduling process of JSSP as a sequential decision-making problem with graph representation of the
Junyoung Park +4 more
semanticscholar +1 more source
For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal [PDF]
In recent years, increasing attention has been directed to leveraging pre-trained vision models for motor control. While existing works mainly emphasize the importance of this pre-training phase, the arguably equally important role played by downstream ...
Yingdong Hu +3 more
semanticscholar +1 more source
VISTA 2.0: An Open, Data-driven Simulator for Multimodal Sensing and Policy Learning for Autonomous Vehicles [PDF]
Simulation has the potential to transform the development of robust algorithms for mobile agents deployed in safety-critical scenarios. However, the poor photorealism and lack of diverse sensor modalities of existing simulation engines remain key hurdles
Alexander Amini +7 more
semanticscholar +1 more source
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning [PDF]
Dialogue policy learning (DPL) is a key component in a task-oriented dialogue (TOD) system. Its goal is to decide the next action of the dialogue system, given the dialogue state at each turn based on a learned dialogue policy. Reinforcement learning (RL)
Wai-Chung Kwan +3 more
semanticscholar +1 more source
Affordance Learning from Play for Sample-Efficient Policy Learning [PDF]
Robots operating in human-centered environments should have the ability to understand how objects function: what can be done with each object, where this interaction may occur, and how the object is used to achieve a goal. To this end, we propose a novel
Jessica Borja-Diaz +5 more
semanticscholar +1 more source
For students to advance beyond arithmetic, they must learn how to attend to the structure of math notation. This process can be challenging due to students' left-to-right computing tendencies.
Vy Ngo +3 more
doaj +1 more source
Meta Policy Learning for Cold-Start Conversational Recommendation [PDF]
Conversational recommender systems (CRS) explicitly solicit users' preferences for improved recommendations on the fly. Most existing CRS solutions count on a single policy trained by reinforcement learning for a population of users.
Zhendong Chu +4 more
semanticscholar +1 more source
Policy Learning for Nonlinear Model Predictive Control With Application to USVs [PDF]
The unaffordable computation load of nonlinear model predictive control (NMPC) has prevented it from being used in robots with high sampling rates for decades.
Rizhong Wang +4 more
semanticscholar +1 more source
Learning nullspace policies [PDF]
Many everyday tasks performed by people, such as reaching, pointing or drawing, resolve redundant degrees of freedom in the arm in a similar way. In this paper we present a novel method for learning the strategy used to resolve redundancy by exploiting the variability in multiple observations of different tasks. We demonstrate the effectiveness of this
Towell, Chris +2 more
openaire +3 more sources
A Survey of Deep RL and IL for Autonomous Driving Policy Learning [PDF]
Autonomous driving (AD) agents generate driving policies based on online perception results, which are obtained at multiple levels of abstraction, e.g., behavior planning, motion planning and control.
Zeyu Zhu, Huijing Zhao
semanticscholar +1 more source

