Results 21 to 30 of about 199,848 (147)

Q LEARNING REGRESSION NEURAL NETWORK [PDF]

open access: yesNeural Network World, 2018
In this work, a Nadaraya-Watson kernel based learning system which owns general regression neural network topology is adapted to Q learning method to evaluate a quick and efficient action selection policy for reinforcement learning problems. By means of the proposed method Q value function is generalized and learning speed of Q agent is accelerated ...
Sangiil M., Ave M.
openaire   +2 more sources

Type-2-Soft-Set Based Uncertainty Aware Task Offloading Framework for Fog Computing Using Apprenticeship Learning

open access: yesCybernetics and Information Technologies, 2023
Fog computing is one of the emerging forms of cloud computing which aims to satisfy the ever-increasing computation demands of the mobile applications. Effective offloading of tasks leads to increased efficiency of the fog network, but at the same time ...
Bhargavi K.   +2 more
doaj   +1 more source

Learning an Efficient Gait Cycle of a Biped Robot Based on Reinforcement Learning and Artificial Neural Networks

open access: yesApplied Sciences, 2019
Programming robots for performing different activities requires calculating sequences of values of their joints by taking into account many factors, such as stability and efficiency, at the same time. Particularly for walking, state of the art techniques
Cristyan R. Gil   +2 more
doaj   +1 more source

Traffic Light Cycle Configuration of Single Intersection Based on Modified Q-Learning

open access: yesApplied Sciences, 2019
In recent years, within large cities with a high population density, traffic congestion has become more and more serious, resulting in increased emissions of vehicles and reducing the efficiency of urban operations.
Hung-Chi Chu   +3 more
doaj   +1 more source

Aircraft Maintenance Check Scheduling Using Reinforcement Learning

open access: yesAerospace, 2021
This paper presents a Reinforcement Learning (RL) approach to optimize the long-term scheduling of maintenance for an aircraft fleet. The problem considers fleet status, maintenance capacity, and other maintenance constraints to schedule hangar checks ...
Pedro Andrade   +3 more
doaj   +1 more source

A Q-Learning Proposal for Tuning Genetic Algorithms in Flexible Job Shop Scheduling Problems

open access: yesProceedings of the International Florida Artificial Intelligence Research Society Conference, 2023
Genetic algorithms (GAs) belong to the category of evolutionary algorithms and are frequently utilized for resolving challenging combinatorial problems.
Christian Perez   +2 more
doaj   +1 more source

Maximum Power Point Tracking Based on Reinforcement Learning Using Evolutionary Optimization Algorithms

open access: yesEnergies, 2021
In this paper, two universal reinforcement learning methods are considered to solve the problem of maximum power point tracking for photovoltaics. Both methods exhibit fast achievement of the MPP under varying environmental conditions and are applicable ...
Kostas Bavarinos   +2 more
doaj   +1 more source

Q-learning based strategy analysis of cyber-physical systems considering unequal cost

open access: yesIntelligent and Converged Networks, 2023
This paper proposes a cyber security strategy for cyber-physical systems (CPS) based on Q-learning under unequal cost to obtain a more efficient and low-cost cyber security defense strategy with misclassification interference.
Xin Chen   +5 more
doaj   +1 more source

Q-LVS: A Q-Learning-based Algorithm for Video Streaming in Peer-to-Peer Networks Considering a Token-Based Incentive Mechanism [PDF]

open access: yesJournal of Artificial Intelligence and Data Mining, 2022
Peer-to-peer video streaming has reached great attention during recent years. Video streaming in peer-to-peer networks is a good way to stream video on the Internet due to the high scalability, high video quality, and low bandwidth requirements.
Z. Imanimehr
doaj   +1 more source

Output Feedback Control for Deterministic Unknown Dynamics Discrete-Time System Using Deep Recurrent Q-Networks

open access: yesIEEE Access, 2023
The current application of control theory is commonly carried out in systems with a model or known system dynamics. However, in practice this is a formidable task to achieve as not all state information can be known. The use of the Output Feedback (OPFB)
Adi Novitarini Putri   +3 more
doaj   +1 more source

Home - About - Disclaimer - Privacy