Results 51 to 60 of about 241,763 (280)
Ramp Metering Control Based on the Q-Learning Algorithm
Modern urban highways are under the influence of increased traffic demand and cannot fulfill the desired level of service anymore. In most of the cases there is no space available for any infrastructure building.
Ivanjko Edouard +5 more
doaj +1 more source
This work has received funding from the EU Horizon 2020 research and innovation program under project DOMINOES (grant agreement No 771066) and from FEDER Funds through COMPETE program and from National Funds through FCT under projects CEECIND/01811/2017 and UIDB/00760 ...
Vale, Zita, Pinto, Tiago
openaire +2 more sources
Fragile X syndrome (FXS) is a neurodevelopmental disorder caused by hypermethylation of expanded CGG repeats (>200) in the FMR1 gene leading to gene silencing and loss of Fragile X Messenger Ribonucleoprotein (FMRP) expression. FMRP plays important roles
James J. Fink +20 more
doaj +1 more source
Complexification through gradual involvement and reward Providing in deep reinforcement learning
Training a relatively big neural network within the framework of deep reinforcement learning that has enough capacity for complex tasks is challenging. In real life the process of task solving requires system of knowledge, where more complex skills are ...
E. V. Rulko,
doaj +1 more source
Time-inhomogeneous finite-horizon Markov decision processes (MDP) are frequently employed to model decision-making in dynamic treatment regimes and other statistical reinforcement learning (RL) scenarios. These fields, especially healthcare and business, often face challenges such as high-dimensional state spaces and time-inhomogeneity of the MDP ...
Chen, Elynn, Li, Sai, Jordan, Michael I.
openaire +2 more sources
EMBEDDED LEARNING ROBOT WITH FUZZY Q-LEARNING FOR OBSTACLE AVOIDANCE BEHAVIOR [PDF]
Fuzzy Q-learning is extending of Q-learning algorithm that uses fuzzy inference system to enable Q-learning holding continuous action and state. This learning has been implemented in various robot learning application like obstacle avoidance and target ...
Anam, Khairul
core
Training task-completion dialogue agents with reinforcement learning usually requires a large number of real user experiences. The Dyna-Q algorithm extends Q-learning by integrating a world model, and thus can effectively boost training efficiency using ...
Gao, Jianfeng +4 more
core +1 more source
Subtype‐specific enhancer RNAs define transcriptional regulators and prognosis in breast cancers
This study employed machine learning methodologies to perform the subtype‐specific classification of RNA‐seq data sets, which are mapped on enhancers from TCGA‐derived breast cancer patients. Their integration with gene expression (referred to as ProxCReAM eRNAs) and chromatin accessibility profiles has the potential to identify lineage‐specific and ...
Aamena Y. Patel +6 more
wiley +1 more source
Offloading decision algorithm based on reinforcement learning for mobile edge computing
For the problem of computing offloading decision in mobile edge computing, this paper proposes an offloading decision algorithm based on enhanced learning in multiuser MEC system.
Yang Ge, Zhang Heng
doaj +1 more source
The use of target networks is a common practice in deep reinforcement learning for stabilizing the training; however, theoretical understanding of this technique is still limited. In this paper, we study the so-called periodic Q-learning algorithm (PQ-learning for short), which resembles the technique used in deep Q-learning for solving infinite ...
Lee, Donghwan, He, Niao
openaire +2 more sources

