Partially observable markov decision process

Results 131 to 140 of about 35,395 (320)

Fourier Mass Lower Bounds for Batchelor‐Regime Passive Scalars

Communications on Pure and Applied Mathematics, EarlyView.
ABSTRACT Batchelor predicted that a passive scalar ψν$\psi ^\nu$ with diffusivity ν$\nu$, advected by a smooth fluid velocity, should typically have Fourier mass distributed as |ψ̂ν|2(k)≈|k|−d$|\widehat{\psi }^\nu |^2(k) \approx |k|^{-d}$ for |k|≪ν−1/2$|k| \ll \nu ^{-1/2}$.
William Cooperman, Keefer Rowan
wiley +1 more source

Using Graph-Enhanced Deep Reinforcement Learning for Distribution Network Fault Recovery

Machines
Fault recovery in distribution networks is a complex, high-dimensional decision-making task characterized by partial observability, dynamic topology, and strong interdependencies among components.
Yueran Liu, Peng Liao, Yang Wang
doaj +1 more source

Ambiguous partially observable Markov decision processes: Structural results and applications

Journal of Economic Theory, 2014
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
openaire +2 more sources

Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes

, 2015
Pascal Poupart +5 more
openalex +2 more sources

When Dihedral Angles Mask Denticity in Molecular Conductance

ChemPhysChem, EarlyView.
This study tests whether higher molecule–electrode denticity increases conductance in single‐molecule junctions. Using nonequilibrium Green's function technique in conjunction with density functional theory simulations and mechanically controlled break‐junction experiments on a tetradentate N‐heterohexacene, it is found that conductance depends mainly ...
Kevin Batzinger +8 more
wiley +1 more source

Optimizing prescription of chinese herbal medicine for unstable angina based on partially observable markov decision process. [PDF]

Evid Based Complement Alternat Med, 2013
Feng Y, Qiu Y, Zhou X, Wang Y, Xu H, Liu B. +5 more
europepmc +1 more source

Straight to Phase III: Model‐Informed Approach Speeds Depemokimab Clinical Development in Interleukin‐5‐Driven Diseases

Clinical Pharmacology &Therapeutics, EarlyView.
Straight to Phase III: Model‐informed approach speeds depemokimab clinical development in interleukin‐5‐driven diseases. IL‐5, a key mediator of type 2 inflammation, underlies various diseases, including severe asthma, CRSwNP, EGPA, and HES. Reduction in blood eosinophil count (BEC), a biomarker of IL‐5 activity, is commonly used to evaluate the ...
Chiara Zecchin +6 more
wiley +1 more source

Recorded recurrent deep reinforcement learning guidance laws for intercepting endoatmospheric maneuvering missiles

Defence Technology
This work proposes a recorded recurrent twin delayed deep deterministic (RRTD3) policy gradient algorithm to solve the challenge of constructing guidance laws for intercepting endoatmospheric maneuvering missiles with uncertainties and observation noise.
Xiaoqi Qiu, Peng Lai, Changsheng Gao, Wuxing Jing +3 more
doaj +1 more source

A nonlinear programming model for partially observable Markov decision processes: Finite horizon case

, 1995
Yasemin Serin
openalex +2 more sources

The Prevention of Eating Disorders in Australian Adolescents: A Modeled Cost‐Effectiveness Study

International Journal of Eating Disorders, EarlyView.
ABSTRACT Objective Prevention programs for eating disorders (EDs) have the potential to reduce the onset of these diseases and improve the mental health and well‐being of the general population. However, there is mixed evidence on whether routine implementation of such programs at the population level is cost‐effective.
Long Khanh‐Dao Le +5 more
wiley +1 more source

markov and semi-markov decision processes
deep reinforcement learning
dynamic programming

pomdp
markov decision process
fos: computer and information sciences

reinforcement learning
artificial intelligence cs.ai
partially observable markov decision processes