Results 31 to 40 of about 68,246 (249)
Optimal Control of Partially Observable Piecewise Deterministic Markov Processes
In this paper we consider a control problem for a Partially Observable Piecewise Deterministic Markov Process of the following type: After the jump of the process the controller receives a noisy signal about the state and the aim is to control the ...
Bäuerle, Nicole, Lange, Dirk
core +1 more source
This study introduces a data‐driven framework that combines deep reinforcement learning with classical path planning to achieve adaptive microrobot navigation. By training a surrogate neural network to emulate microrobot dynamics, the approach improves learning efficiency, reduces training time, and enables robust real‐time obstacle avoidance in ...
Amar Salehi +3 more
wiley +1 more source
Update or Wait: How to Keep Your Data Fresh
In this work, we study how to optimally manage the freshness of information updates sent from a source node to a destination via a channel. A proper metric for data freshness at the destination is the age-of-information, or simply age, which is defined ...
Koksal, C. Emre +4 more
core +1 more source
Inference Strategies for Solving Semi-Markov Decision Processes
Semi-Markov decision processes are used to formulate many control problems and also play a key role in hierarchical reinforcement learning. In this chapter we show how to translate the decision making problem into a form that can instead be solved by inference and learning techniques.
Hoffman, M, de Freitas, N
openaire +2 more sources
This review explores the transformative impact of artificial intelligence on multiscale modeling in materials research. It highlights advancements such as machine learning force fields and graph neural networks, which enhance predictive capabilities while reducing computational costs in various applications.
Artem Maevskiy +2 more
wiley +1 more source
Infrastructure assets, such as pavements, naturally deteriorate over time due to traffic loads, environmental conditions, and other external factors. Traditionally, deterministic models have been employed to predict performance, aiding in work planning ...
Che Shobry Shahid +5 more
doaj +1 more source
Wide sense one-dependent processes with embedded Harris chains and their applications in inventory management [PDF]
In this paper we consider stochastic processes with an embedded Harris chain. The embedded Harris chain describes the dependence structure of the stochastic process.
Bazsa-Oldenkamp, E.M. (Emö) +1 more
core
Verification of Uncertain POMDPs Using Barrier Certificates
We consider a class of partially observable Markov decision processes (POMDPs) with uncertain transition and/or observation probabilities. The uncertainty takes the form of probability intervals.
Ahmadi, Mohamadreza +3 more
core +1 more source
Artificial intelligence (AI) is reshaping autonomous mobile robot navigation beyond classical pipelines. This review analyzes how AI techniques are integrated into core navigation tasks, including path planning and control, localization and mapping, perception, and context‐aware decision‐making. Learning‐based, probabilistic, and soft‐computing methods
Giovanna Guaragnella +5 more
wiley +1 more source
Continuous-time Markov decision processes under the risk-sensitive average cost criterion
This paper studies continuous-time Markov decision processes under the risk-sensitive average cost criterion. The state space is a finite set, the action space is a Borel space, the cost and transition rates are bounded, and the risk-sensitivity ...
Chen, Xian, Wei, Qingda
core +1 more source

