Markov decision processes - Open Access .click

Results 41 to 50 of about 184,437 (258)

Safe Q-Learning Method Based on Constrained Markov Decision Processes

IEEE Access, 2019
The application of reinforcement learning in industrial fields makes the safety problem of the agent a research hotspot. Traditional methods mainly alter the objective function and the exploration process of the agent to address the safety problem. Those
Yangyang Ge, Fei Zhu, Xinghong Ling, Quan Liu +3 more
doaj +1 more source

Absorbing Markov decision processes

ESAIM: Control, Optimisation and Calculus of Variations
In this paper, we study discrete-time absorbing Markov Decision Processes (MDP) with measurable state space and Borel action space with a given initial distribution. For such models, solutions to the characteristic equation that are not occupation measures may exist.
Dufour, François, Prieto-Rumeau, Tomás
openaire +3 more sources

Fostering Innovation: Streamlining Magnetocaloric Materials Research by Digitalization

Advanced Engineering Materials, EarlyView.
Magnetocaloric cooling (MCE) is an environmentally friendly refrigeration method with great potential. Optimizing MCE materials involves the preparation and screening of large quantities of samples, which in turn generates a large amount of data. A digitalization approach is presented that uses ontologies, knowledge graphs, and digital workflows to ...
Simon Bekemeier +17 more
wiley +1 more source

Dynamic Watermarking for Finite Markov Decision Processes

IEEE Open Journal of Control Systems
Dynamic watermarking is an active intrusion detection technique that can potentially detect replay attacks, spoofing attacks, and deception attacks in the feedback channel for control systems.
Jiacheng Tang, Jiguo Song, Abhishek Gupta +2 more
doaj +1 more source

Plasmonic Enhancement of Fluorescence and Protein Dynamics in Living Mammalian Cells

Advanced Materials, EarlyView.
This study demonstrates plasmonic enhancement of the function of fluorescent voltage sensing proteins (genetically encoded voltage indicators, (GEVIs), QuasAr6) in live mammalian cells. Coupling to plasmonic nanoparticles does not just increase fluorescence, but influences the protein photocycle, creating a hybrid sensor with its response speed to ...
Marco Locarno +16 more
wiley +1 more source

Nonapproximability Results for Partially Observable Markov Decision Processes

, 2011
We show that for several variations of partially observable Markov decision processes, polynomial-time algorithms for finding control policies are unlikely to or simply don't have guarantees of finding policies within a constant factor or a constant ...
Goldsmith, J., Lusena, C., Mundhenk, M.
core +1 more source

Recent Advances of Slip Sensors for Smart Robotics

Advanced Materials Technologies, EarlyView.
This review summarizes recent progress in robotic slip sensors across mechanical, electrical, thermal, optical, magnetic, and acoustic mechanisms, offering a comprehensive reference for the selection of slip sensors in robotic applications. In addition, current challenges and emerging trends are identified to advance the development of robust, adaptive,
Xingyu Zhang +8 more
wiley +1 more source

The Evolution of Laser‐Induced Damage Patterns in Polymer Stabilized Liquid Crystals: Insights From Morphological Characterization and Thermo‐Driven Simulations

Advanced Optical Materials, EarlyView.
A dual‐domain PSLC architecture enables direct comparison of alignment‐dependent laser damage within a single device. Crack‐like and seal‐like morphologies emerge under different damage conditions, and their evolution is interpreted through quantitative image analysis and heat‐driven simulations.
Dengcheng Chen +4 more
wiley +1 more source

Learning Highly Dynamic Skills Transition for Quadruped Jumping Through Constrained Space

Advanced Robotics Research, EarlyView.
A quadruped robot masters dynamic jumps through constrained spaces with animal‐inspired moves and intelligent vision control. This hierarchical learning approach combines imitation of biological agility with real‐time trajectory planning. Although legged animals are capable of performing explosive motions while traversing confined spaces, replicating ...
Zeren Luo +6 more
wiley +1 more source

Factored Beliefs for Machine Agents in Decentralized Partially Observable Markov Decision Processes

Proceedings of the International Florida Artificial Intelligence Research Society Conference, 2022
A shared mental model (SMM) is a foundational structure in high performing, task-oriented teams and aid humans in determining their teammate's goals and intentions.
Joshua Lapso, Gilbert Peterson
doaj +1 more source

mathematics
computer science - logic in computer science
markov and semi-markov decision processes

markov decision process
computer science - artificial intelligence
fos: computer and information sciences

dynamic programming
reinforcement learning