Results 301 to 310 of about 1,517,214 (350)

Cost-utility analysis of botulinum toxin type A versus oral drug treatment in patients with severe blepharospasm in Thailand. [PDF]

open access: yesPLoS One
Hirunwiwatkul P   +4 more
europepmc   +1 more source

A Generic Markov Decision Process Model and Reinforcement Learning Method for Scheduling Agile Earth Observation Satellites

IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2022
We investigate a general solution based on reinforcement learning for the agile satellite scheduling problem. The core idea of this method is to determine a value function for evaluating the long-term benefit under a certain state by training from ...
Yongming He   +5 more
semanticscholar   +1 more source

Offloading Time Optimization via Markov Decision Process in Mobile-Edge Computing

IEEE Internet of Things Journal, 2021
Computation offloading from a mobile device to the edge server is an emerging paradigm to reduce completion latency of intensive computations in mobile-edge computing (MEC).
Guisong Yang   +5 more
semanticscholar   +1 more source

Markov Decision Processes.

The Statistician, 1995
(1995). Markov Decision Processes. Journal of the Operational Research Society: Vol. 46, No. 6, pp. 792-793.
Stephen Brooks, Douglas J. White
openaire   +3 more sources

MARKOV DECISION PROCESSES

Statistica Neerlandica, 1985
AbstractA review is presented of the development over the years of the theory and practical use of Markov decision processes. To this purpose three periods are considered: before 1966, from 1966 till 1972, and after 1973. In all 3 periods there has been some contribution from the Netherlands, but particularly in the last period the research in the ...
J Jaap Wessels, J. van der Wal
openaire   +2 more sources

Quantile Markov Decision Processes

Operations Research, 2022
Title: Sequential Decision Making Using Quantiles The goal of a traditional Markov decision process (MDP) is to maximize the expectation of cumulative reward over a finite or infinite horizon. In many applications, however, a decision maker may be interested in optimizing a specific quantile of the cumulative reward. For example, a physician may want
Xiaocheng Li   +2 more
openaire   +3 more sources

Home - About - Disclaimer - Privacy