Results 11 to 20 of about 580,002 (195)
Deep Reinforcement Learning for Autonomous Driving: A Survey [PDF]
With the development of deep representation learning, the domain of reinforcement learning (RL) has become a powerful learning framework now capable of learning complex policies in high dimensional environments.
B. R. Kiran+6 more
semanticscholar +1 more source
A well-designed demand response (DR) program is essential in smart home to optimize energy usage according to user preferences. In this study, we proposed a multiobjective reinforcement learning (MORL) algorithm to design a DR program.
Song-Jen Chen, Wei-Yu Chiu, Wei-Jen Liu
doaj +1 more source
Memory-two strategies forming symmetric mutual reinforcement learning equilibrium in repeated prisoners' dilemma game [PDF]
We investigate symmetric equilibria of mutual reinforcement learning when both players alternately learn the optimal memory-two strategies against the opponent in the repeated prisoners' dilemma game. We provide a necessary condition for memory-two deterministic strategies to form symmetric equilibria.
arxiv +1 more source
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox [PDF]
With the breakthrough of AlphaGo, deep reinforcement learning becomes a recognized technique for solving sequential decision-making problems. Despite its reputation, data inefficiency caused by its trial and error learning mechanism makes deep reinforcement learning hard to be practical in a wide range of areas.
arxiv +1 more source
Rainbow: Combining Improvements in Deep Reinforcement Learning [PDF]
The deep reinforcement learning community has made several independent improvements to the DQN algorithm. However, it is unclear which of these extensions are complementary and can be fruitfully combined. This paper examines six extensions to the DQN
Matteo Hessel+9 more
semanticscholar +1 more source
Experimental Study on Axial Compression of an Insulating Layer through a Composite Shear Wall
Based on the research of composite walls at home and abroad, a construction method of continuous opening of the insulation layer in the specimen is proposed.
Yuliang Wang, Congcong Wang, Zhixing Cao
doaj +1 more source
Snake-like modular robots (MRs) are highly flexible, but, to traverse a challenging terrain or explore a region of interest, MR needs to attain efficient locomotion depending on a tradeoff between objectives like forward velocity and power consumption of
Akash Singh+3 more
doaj +1 more source
Seismic Performance Evaluation of Highway Bridges under Scour and Chloride Ion Corrosion
Cross-river bridges located in seismically active areas are exposed to two major natural hazards, namely earthquakes and flooding. As the scour depth increases, more parts of the bridge substructure will inevitably be exposed to unfavorable conditions ...
Mi Zhou+3 more
doaj +1 more source
Learning an Accurate State Transition Dynamics Model by Fitting Both a Function and its Derivative
Learning accurate state transition dynamics model in a sample-efficient way is important to predict the future states from the current states and actions of a system both accurately and efficiently in model-based reinforcement learning for many robotic ...
Youngho Kim, Hoosang Lee, Jeha Ryu
doaj +1 more source
Chapter in "A Guided Tour of Artificial Intelligence Research ...
Buffet, Olivier+2 more
openaire +4 more sources