Results 101 to 110 of about 171,350 (313)
The Mean-Squared Error of Double Q-Learning [PDF]
In this paper, we establish a theoretical comparison between the asymptotic mean-squared error of Double Q-learning and Q-learning. Our result builds upon an analysis for linear stochastic approximation based on Lyapunov equations and applies to both tabular setting and with linear function approximation, provided that the optimal policy is unique and ...
arxiv
The electroplating of a CrCoNi medium‐entropy alloy is achieved using a mixture of an ionic liquid and an aqueous solution containing metal salts. The CrCoNi medium‐entropy alloy thin film exhibits high wear and corrosion resistance superior to conventional hard chromium coatings. Abstract High‐entropy alloys (HEAs) and medium‐entropy alloys (MEAs) are
Yuki Murakami+7 more
wiley +1 more source
Reinforcement learning (RL) approaches, particularly Q-learning, have emerged as strong tools for autonomous agent training, allowing agents to acquire optimum decision-making rules through interaction with their surroundings.
Biplov Paneru+3 more
doaj +1 more source
A materials and device design concept that comprises a self‐assembled ultra‐thin epitaxial ion‐transporting layer, an amorphous oxide overcoat oxygen‐blocking layer, and a partial filament formed during an electroforming step is proposed for low‐current multilevel resistive switching devices.
Ming Xiao+17 more
wiley +1 more source
This work addresses bi-objective hybrid flow shop scheduling problems considering consistent sublots (Bi-HFSP_CS). The objectives are to minimize the makespan and total energy consumption.
Benxue Lu+3 more
doaj +1 more source
Hierarchical clustering with deep Q-learning
Abstract Following up on our previous study on applying hierarchical clustering algorithms to high energy particle physics, this paper explores the possibilities to use deep learning to generate models capable of processing the clusterization themselves.
Forster, R., Fülöp, Ágnes
openaire +6 more sources
This review synthesizes the evolution of radiative heat transfer, emphasizing the transition from far‐field to near‐field regimes. Traditional frameworks, such as Planck's law, are revisited alongside modern innovations like fluctuational electrodynamics. Applications span nanoscale thermal management, energy harvesting, and thermophotovoltaic systems.
Ambali Alade Odebowale+6 more
wiley +1 more source
Tin‐Based 2D/3D Perovskite Vertical Heterojunction for High‐Performance Synaptic Phototransistors
Phototransistors based on tin‐based 2D/3D perovskite heterostructures show an ultrahigh responsivity and detectivity at a low gate voltage across a broad wavelength region from ultraviolet to near‐infrared. The devices can replicate neuromorphic learning and remembering behaviors to light stimuli, in addition to electric depression and memory erasure ...
Hok‐Leung Loi+10 more
wiley +1 more source
Robot control using Q-Learning [PDF]
This paper focuses on machine learning, where an agent learns how to solve a specific problem. The learning process will take place in a simulated environment, so the effectiveness can be measured without any potential damage to real the robot. QLearning
Szántó Zoltán
doaj
Assessing the Potential of Classical Q-learning in General Game Playing [PDF]
After the recent groundbreaking results of AlphaGo and AlphaZero, we have seen strong interests in deep reinforcement learning and artificial general intelligence (AGI) in game playing. However, deep learning is resource-intensive and the theory is not yet well developed.
arxiv