Results 61 to 70 of about 199,848 (147)
Adaptive Q-Learning Grey Wolf Optimizer for UAV Path Planning
Path planning is crucial for safely and efficiently navigating unmanned aerial vehicles (UAVs) toward operational goals. Often, this is a complex, multi-constraint, and non-linear optimization problem, and metaheuristic algorithms are frequently used to ...
Golam Moktader Nayeem +2 more
doaj +1 more source
We develop methodology for a multistage-decision problem with flexible number of stages in which the rewards are survival times that are subject to censoring. We present a novel Q-learning algorithm that is adjusted for censored data and allows a flexible number of stages.
Goldberg, Yair, Kosorok, Michael R.
openaire +4 more sources
Maximum Power Point Tracking of Photovoltaic System Based on Reinforcement Learning
The maximum power point tracking (MPPT) technique is often used in photovoltaic (PV) systems to extract the maximum power in various environmental conditions.
Kuan-Yu Chou +2 more
doaj +1 more source
Deep Q-learning From Demonstrations
Deep reinforcement learning (RL) has achieved several high profile successes in difficult decision-making problems. However, these algorithms typically require a huge amount of data before they reach reasonable performance. In fact, their performance during learning can be extremely poor.
Hester, Todd +13 more
openaire +2 more sources
CUBIC-Learn: A Reinforcement Learning Approach to CUBIC Congestion Control
Managing congestion effectively enables reliable and fast data transfer over networks. CUBIC delivers reliable results under normal circumstances but cannot adapt effectively to changing network scenarios.
Ehsan Abedini, Mohsen Nickray
doaj +1 more source
Energy and economy are increasing the relationship over the years, where the energy becomes a significant resource to keep a country developing, and it supports its economy.
Lucas Roberto Ferreira +2 more
doaj +1 more source
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
ICLR ...
Lan, Qingfeng +3 more
openaire +2 more sources
Developing an effective multi-stage treatment strategy over time is one of the essential goals of modern medical research. Developing statistical inference, including constructing confidence intervals for parameters, is of key interest in studies applying dynamic treatment regimens.
Goldberg, Yair +2 more
openaire +2 more sources
Este trabajo introduce una estrategia híbrida de planificación de caminos para vehículos robóticos tipo diferencial, combinando métodos de aprendizaje por refuerzo con técnicas de muestreo aleatorio.
Sebastian Zapata +5 more
doaj +1 more source
Q-learning Based Meta-Heuristics for Scheduling Bi-Objective Surgery Problems with Setup Time
Since the increasing demand for surgeries in hospitals, the surgery scheduling problems have attracted extensive attention. This study focuses on solving a surgery scheduling problem with setup time. First, a mathematical model is created to minimize the
Ruixue Zhang +3 more
doaj +1 more source

