Markov decision process - Open Access .click

Results 261 to 270 of about 57,303 (303)

A Comparison of Methods for Modeling Multistate Cancer Progression Using Screening Data with Censoring after Intervention. [PDF]

Med Decis Making
Akwiwu EU, Coupé VMH, Berkhof J, Klausch T. +3 more
europepmc +1 more source

In Silico Evaluation of Algorithm-Based Clinical Decision Support Systems Based on Care Pathway Simulation Models: Scoping Review. [PDF]

JMIR AI
Dorosan M, Chen YL, He Y, Zhuang Q, Lam SSW. +4 more
europepmc +1 more source

An Entropy-Based Framework for Hybrid Coalitions in Game Theory-Part I: Human Arbitration. [PDF]

Entropy (Basel)
Sepúlveda-Fontaine SA, Amigó JM.
europepmc +1 more source

A Bumblebee-Inspired Spatial Memory Navigation Framework for Robotic Odor Source Localization. [PDF]

Biomimetics (Basel)
Xu T, Guo Y, Wu Z, Wu J.
europepmc +1 more source

Implementing a CYP2C19-guided approach for prescribing dual antiplatelet therapy in acute coronary syndrome for patients undergoing percutaneous coronary intervention: a cost-effectiveness analysis

Mahboub-Ahari A +6 more
europepmc +1 more source

Some of the next articles are maybe not open access.

Related searches:

reinforcement learning
q-learning

MARKOV DECISION PROCESSES

Statistica Neerlandica, 1985
AbstractA review is presented of the development over the years of the theory and practical use of Markov decision processes. To this purpose three periods are considered: before 1966, from 1966 till 1972, and after 1973. In all 3 periods there has been some contribution from the Netherlands, but particularly in the last period the research in the ...
Wal, van der, J., Wessels, J.
openaire +1 more source

On the Generation of Markov Decision Processes

Journal of the Operational Research Society, 1995
Summary: Comparisons of the performance of solution algorithms for Markov decision processes rely heavily on problem generators to provide sizeable sets of test problems. Existing generation techniques allow little control over the properties of the test problems and often result in problems which are not typical of real-world examples.
Archibald, T. W., McKinnon, K. I. M., Thomas, L. C. +2 more
openaire +2 more sources

Online Markov Decision Processes

Mathematics of Operations Research, 2009
We consider a Markov decision process (MDP) setting in which the reward function is allowed to change after each time step (possibly in an adversarial manner), yet the dynamics remain fixed. Similar to the experts setting, we address the question of how well an agent can do when compared to the reward achieved under the best stationary policy over ...
Eyal Even-Dar, Sham M. Kakade, Yishay Mansour +2 more
openaire +1 more source

Monotonicity in a Markov Decision Process

Mathematics of Operations Research, 1988
Concavity of optimal costs and monotonicity of optimal actions are established for a Markov decision problem in which state space and action space are ordered, but in which the cost functions do not possess properties commonly used to establish monotonicity.
openaire +2 more sources

Risk-Constrained Markov Decision Processes

IEEE Transactions on Automatic Control, 2010
zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Vivek S. Borkar, Rahul Jain 0002
openaire +3 more sources

reinforcement learning
q-learning