Results 91 to 100 of about 401,816 (284)
Quadrotor unmanned aerial vehicle control is critical to maintain flight safety and efficiency, especially when facing external disturbances and model uncertainties. This article presents a robust reinforcement learning control scheme to deal with these challenges.
Yu Cai +3 more
wiley +1 more source
Robust Markov Decision Processes [PDF]
Markov decision processes (MDPs) are powerful tools for decision making in uncertain dynamic environments. However, the solutions of MDPs are of limited practical use due to their sensitivity to distributional model parameters, which are typically ...
Berç Rustem +2 more
core
Large Language Model‐Based Chatbots in Higher Education
The use of large language models (LLMs) in higher education can facilitate personalized learning experiences, advance asynchronized learning, and support instructors, students, and researchers across diverse fields. The development of regulations and guidelines that address ethical and legal issues is essential to ensure safe and responsible adaptation
Defne Yigci +4 more
wiley +1 more source
ON THE INFINITE ORDER MARKOV PROCESSES [PDF]
The notion of infinite order Markov process is introduced and the Markov property of the flow of information is established.
doaj
Variational Autoencoder+Deep Deterministic Policy Gradient addresses low‐light failures of infrared depth sensing for indoor robot navigation. Stage 1 pretrains an attention‐enhanced Variational Autoencoder (Convolutional Block Attention Module+Feature Pyramid Network) to map dark depth frames to a well‐lit reconstruction, yielding a 128‐D latent code ...
Uiseok Lee +7 more
wiley +1 more source
This study introduces a data‐driven framework that combines deep reinforcement learning with classical path planning to achieve adaptive microrobot navigation. By training a surrogate neural network to emulate microrobot dynamics, the approach improves learning efficiency, reduces training time, and enables robust real‐time obstacle avoidance in ...
Amar Salehi +3 more
wiley +1 more source
The linear framework II: using graph theory to analyse the transient regime of Markov processes. [PDF]
Nam KM, Gunawardena J.
europepmc +1 more source
Deep Reinforcement Learning Approaches for Sensor Data Collection by a Swarm of UAVs
This article presents four decentralized reinforcement learning algorithms for autonomous data harvesting and investigates how collaboration improves collection efficiency. It also presents strategies to minimize training times by improving model flexibility, enabling algorithms to operate with varying number of agents and sensors.
Thiago de Souza Lamenza +2 more
wiley +1 more source
The issues concerning the potential application of Markov processes theory for efficient management decision making based on the production enterprise business modeling are considered.
Evgeniya R. Khabibullina
doaj
Markov Processes and Related Topics (III). [PDF]
Hong W, Mao Y, Zhang Y.
europepmc +1 more source

