Results 31 to 40 of about 236 (54)
REINFORCEMENT LEARNING FOR INDIVIDUAL OPTIMAL POLICY FROM HETEROGENEOUS DATA. [PDF]
Miao BR, Shahbaba B, Qu A.
europepmc +1 more source
ON THE FIRST PASSAGE g-MEAN-VARIANCE OPTIMALITY FOR DISCOUNTED CONTINUOUS-TIME MARKOV DECISION PROCESSES [PDF]
Guo, Xianping +2 more
core +2 more sources
Some of the next articles are maybe not open access.
Distributionally Robust Inventory Control When Demand Is a Martingale
Mathematics of Operations Research, 2022Linwei Xin, David Goldberg
exaly
Robust Markov Decision Processes: Beyond Rectangularity
Mathematics of Operations Research, 2023Vineet Goyal, Julien Grand-Clement
exaly
Finite-Memory Strategies in POMDPs with Long-Run Average Objectives
Mathematics of Operations Research, 2022Krishnendu Chatterjee +2 more
exaly
Provably Efficient Reinforcement Learning with Linear Function Approximation
Mathematics of Operations Research, 2023Chi Jin, Zhuoran Yang, Zhaoran Wang
exaly
Adaptive Bin Packing with Overflow
Mathematics of Operations Research, 2022Sebastian Perez-Salazar +2 more
exaly
Analyzing Approximate Value Iteration Algorithms
Mathematics of Operations Research, 2022Arunselvan Ramaswamy, Shalabh Bhatnagar
exaly
Stochastic Comparative Statics in Markov Decision Processes
Mathematics of Operations Research, 2021Bar Light
exaly

