Results 1 to 10 of about 199,848 (147)
Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning [PDF]
A deep Q network (DQN) (Mnih et al., 2013) is an extension of Q learning, which is a typical deep reinforcement learning method. In DQN, a Q function expresses all action values under all states, and it is approximated using a convolutional neural ...
Shota Ohnishi +6 more
doaj +3 more sources
Using reinforcement learning in genome assembly: in-depth analysis of a Q-learning assembler [PDF]
Genome assembly remains an unsolved problem, and de novo strategies (i.e., those run without a reference) are relevant but computationally complex tasks in genomics.
Kleber Padovani +7 more
doaj +2 more sources
Reducing the Prevalence of Coronavirus (COVID-19) in Airlines Based on and the Reinforcement Artificial Intelligence [PDF]
This paper proposes a method based on the artificial intelligence reinforcement Q-learning algorithm and paired comparison technique to solve the problem of health monitoring devices shortage in airlines.
Iman Shafieenejad +3 more
doaj +1 more source
Generalized Speedy Q-Learning [PDF]
In this paper, we derive a generalization of the Speedy Q-learning (SQL) algorithm that was proposed in the Reinforcement Learning (RL) literature to handle slow convergence of Watkins' Q-learning. In most RL algorithms such as Q-learning, the Bellman equation and the Bellman operator play an important role.
Indu John +2 more
openaire +2 more sources
Background Supervised deep learning in radiology suffers from notorious inherent limitations: 1) It requires large, hand-annotated data sets; (2) It is non-generalizable; and (3) It lacks explainability and intuition.
J. N. Stember, H. Shalu
doaj +1 more source
Vehicular and flying ad hoc networks (VANETs and FANETs) are becoming increasingly important with the development of smart cities and intelligent transportation systems (ITSs).
Pavle Bugarčić +2 more
doaj +1 more source
QSPCA: A two-stage efficient power control approach in D2D communication for 5G networks
The existing literature on device-to-device (D2D) architecture suffers from a dearth of analysis under imperfect channel conditions. There is a need for rigorous analyses on the policy improvement and evaluation of network performance. Accordingly, a two-
Saurabh Chandra +4 more
doaj +1 more source
Cooperative Output Regulation By Q-learning For Discrete Multi-agent Systems In Finite-time
This article studies the output regulation of discrete-time multi-agent systems with an unknown model by a finite-time optimal control algorithm based on Q-learning that uses the method of the linear quadratic regulator (LQR).
Wenjun Wei, Jingyuan Tang
doaj +1 more source
QIBMRMN: Design of a Q-Learning based Iterative sleep-scheduling & hybrid Bioinspired Multipath Routing model for Multimedia Networks [PDF]
Multimedia networks utilize low-power scalar nodes to modify wakeup cycles of high-performance multimedia nodes, which assists in optimizing the power-toperformance ratios.
Minaxi Doorwar, P Malathi
doaj +1 more source
Methods and software for solar power plant cluster management
Object is solar power plant management software. Nowadays, solar panel production technologies are developing rapidly, investments in solar energy are growing, so users are interested in increasing energy production for faster return on investment.
А. Мокрий, І. Баклан
doaj +1 more source

