Results 111 to 120 of about 171,350 (313)
A novel stratum corneum‐inspired zwitterionic hydrogel is developed for intelligent, flexible sensors, featuring intrinsic water retention and anti‐freezing properties. The quasi‐gel, composed of hygroscopic polymers and bound water, maintains its softness across a wide range of humidity.
Meng Wu+8 more
wiley +1 more source
Stochastic Primal-Dual Q-Learning [PDF]
In this work, we present a new model-free and off-policy reinforcement learning (RL) algorithm, that is capable of finding a near-optimal policy with state-action observations from arbitrary behavior policies. Our algorithm, called the stochastic primal-dual Q-learning (SPD Q-learning), hinges upon a new linear programming formulation and a dual ...
arxiv
A simulation technique for assessing both the fabrication and operation of a solid‐state Si battery is demonstrated by integrating particle dynamics with mass/charge transport. Although, the fabrication pressure (Pfab) increased the inter‐particle contacts and reduced the concentration (ηconc) and Li‐ion (ηLi+) overpotentials during discharging, it ...
Magnus So+4 more
wiley +1 more source
DQL: A New Updating Strategy for Reinforcement Learning Based on Q-Learning [PDF]
Carlos E. Mariano, Eduardo F. Morales
openalex +1 more source
Faster Q-Learning Algorithms for Restless Bandits [PDF]
We study the Whittle index learning algorithm for restless multi-armed bandits (RMAB). We first present Q-learning algorithm and its variants -- speedy Q-learning (SQL), generalized speedy Q-learning (GSQL) and phase Q-learning (PhaseQL). We also discuss exploration policies -- $\epsilon$-greedy and Upper confidence bound (UCB).
arxiv
Reinforcement Learning for Traffic Control with Adaptive Horizon [PDF]
This paper proposes a reinforcement learning approach for traffic control with the adaptive horizon. To build the controller for the traffic network, a Q-learning-based strategy that controls the green light passing time at the network intersections is applied.
arxiv
Temperature‐Resilient Polymeric Memristors for Effective Deblurring in Static and Dynamic Imaging
A thermally stable organic memristor based on a thiadiazolobenzotriazole (TBZ) and 2,5‐Dioctyl‐3,6‐di(thiophen‐2‐yl)pyrrolo[3,4‐c]pyrrole‐1,4(2H,5H)‐dione (DPP)‐based conjugated polymer is presented, demonstrating reliable, gradual resistance switching across a wide temperature range (153–573 K).
Ziyu Lv+15 more
wiley +1 more source
A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms [PDF]
In this paper, we introduce a unified framework for analyzing a large family of Q-learning algorithms, based on switching system perspectives and ODE-based stochastic approximation. We show that the nonlinear ODE models associated with these Q-learning algorithms can be formulated as switched linear systems, and analyze their asymptotic stability by ...
arxiv
Quantum Emitters in Hexagonal Boron Nitride: Principles, Engineering and Applications
Quantum emitters in hexagonal boron nitride have emerged as a promising candidate for quantum information science. This review examines the fundamentals of these quantum emitters, including their level structures, defect engineering, and their possible chemical structures.
Thi Ngoc Anh Mai+8 more
wiley +1 more source
This study considers a discrete-time, linear state feedback control strategy rooted in Q-learning, one of the Reinforcement Learning (RL) approaches, to address an adaptive Linear Quadratic (LQ) problem under stochastic disturbances. Q-learning optimizes
Vina Putri Virgiani, Shiro Masuda
doaj +1 more source