Q-learning - Open Access .click

Results 51 to 60 of about 5,156,964 (309)

The impact of exploration on convergence and performance of multi-agent Q-learning dynamics

, 2023
Understanding the impact of exploration on the behaviour of multi-agent learning has, so far, benefited from the restriction to potential, or network zero-sum games in which convergence to an equilibrium can be shown.
Hussain, A, Paccagnan, D, Belardinelli, F +2 more
core

Subtype‐specific enhancer RNAs define transcriptional regulators and prognosis in breast cancers

Molecular Oncology, EarlyView.
This study employed machine learning methodologies to perform the subtype‐specific classification of RNA‐seq data sets, which are mapped on enhancers from TCGA‐derived breast cancer patients. Their integration with gene expression (referred to as ProxCReAM eRNAs) and chromatin accessibility profiles has the potential to identify lineage‐specific and ...
Aamena Y. Patel +6 more
wiley +1 more source

Offloading decision algorithm based on reinforcement learning for mobile edge computing

Dianzi Jishu Yingyong, 2021
For the problem of computing offloading decision in mobile edge computing, this paper proposes an offloading decision algorithm based on enhanced learning in multiuser MEC system.
Yang Ge, Zhang Heng
doaj +1 more source

Developmental programmes drive cellular plasticity, disease progression and therapy resistance in lung adenocarcinoma

Molecular Oncology, EarlyView.
This study shows that lung adenocarcinomas exploit developmental branching morphogenesis to acquire a therapy resistant basal‐like tumour cell state. This process was found to be regulated by combined TP53 loss‐of‐function and type‐I interferon signalling, identifying a novel axis for biomarker and therapeutic target discovery.
Kamila J Bienkowska +13 more
wiley +1 more source

Long‐Term Follow‐Up of Chemotherapy‐Associated Biological Aging in Women With Early Breast Cancer

Aging and Cancer, EarlyView.
Women threated with adjuvant chemotherapy for early breast cancer have sustained long‐term increase in p16INK4a,, a robust marker of cell senescence, suggesting a chemotherapy‐associated age acceleration. p16INK4a as well as other biomarkers may identify patients at greatest risk for senescence‐related diseases of aging.
Hyman B. Muss +12 more
wiley +1 more source

Q-learning for history-based reinforcement learning [PDF]

, 2013
We extend the Q-learning algorithm from the Markov Decision Process setting to problems where observations are non-Markov and do not reveal the full state of the world i.e. to POMDPs. We do this in a
Daswani, Mayank, Hutter, Marcus, Sunehag, Peter +2 more
core

Value of MRI Outcomes for Preventive and Early‐Stage Trials in Spinocerebellar Ataxias 1 and 3

Annals of Clinical and Translational Neurology, EarlyView.
ABSTRACT Objective To examine the value of MRI outcomes as endpoints for preventive and early‐stage trials of two polyglutamine spinocerebellar ataxias (SCAs). Methods A cohort of 100 participants (23 SCA1, 63 SCA3, median Scale for the Assessment and Rating of Ataxia (SARA) score = 5, 42% preataxic, and 14 gene‐negative controls) was scanned at 3T up ...
Thiago J. R. Rezende +26 more
wiley +1 more source

Efficacy of Intermittent Theta‐Burst Stimulation for Prolonged Disorders of Consciousness: A Prospective, Randomized, Controlled Trial

Annals of Clinical and Translational Neurology, EarlyView.
ABSTRACT Background Emerging evidence suggests that low‐frequency neural oscillations are dynamically regulated by consciousness levels, with the recovery of low cortical activity potentially serving as a neurophysiological substrate for conscious emergence. Targeted enhancement of these low‐frequency rhythms in patients with disorders of consciousness
Chuan Xu +10 more
wiley +1 more source

A Q-Learning Based Hybrid Meta-Heuristic for Integrated Scheduling of Disassembly and Reprocessing Processes Considering Product Structures and Stochasticity

Complex System Modeling and Simulation
Remanufacturing is regarded as a sustainable manufacturing paradigm of energy conservation and environment protection. To improve the efficiency of the remanufacturing process, this work investigates an integrated scheduling problem for disassembly and ...
Fuquan Wang +4 more
doaj +1 more source

Selectively decentralized q-learning

, 2023
In this paper, we explore the capability of selectively decentralized Q-learning approach in learning how to optimally stabilize control systems, as compared to the centralized approach.
S Mukhopadhyay (13605823), T Nguyen (9018068) +1 more
core

reinforcement learning
deep reinforcement learning
artificial intelligence

machine learning
path planning