Results 41 to 50 of about 184,437 (258)
Safe Q-Learning Method Based on Constrained Markov Decision Processes
The application of reinforcement learning in industrial fields makes the safety problem of the agent a research hotspot. Traditional methods mainly alter the objective function and the exploration process of the agent to address the safety problem. Those
Yangyang Ge +3 more
doaj +1 more source
Absorbing Markov decision processes
In this paper, we study discrete-time absorbing Markov Decision Processes (MDP) with measurable state space and Borel action space with a given initial distribution. For such models, solutions to the characteristic equation that are not occupation measures may exist.
Dufour, François, Prieto-Rumeau, Tomás
openaire +3 more sources
Fostering Innovation: Streamlining Magnetocaloric Materials Research by Digitalization
Magnetocaloric cooling (MCE) is an environmentally friendly refrigeration method with great potential. Optimizing MCE materials involves the preparation and screening of large quantities of samples, which in turn generates a large amount of data. A digitalization approach is presented that uses ontologies, knowledge graphs, and digital workflows to ...
Simon Bekemeier +17 more
wiley +1 more source
Dynamic Watermarking for Finite Markov Decision Processes
Dynamic watermarking is an active intrusion detection technique that can potentially detect replay attacks, spoofing attacks, and deception attacks in the feedback channel for control systems.
Jiacheng Tang +2 more
doaj +1 more source
Plasmonic Enhancement of Fluorescence and Protein Dynamics in Living Mammalian Cells
This study demonstrates plasmonic enhancement of the function of fluorescent voltage sensing proteins (genetically encoded voltage indicators, (GEVIs), QuasAr6) in live mammalian cells. Coupling to plasmonic nanoparticles does not just increase fluorescence, but influences the protein photocycle, creating a hybrid sensor with its response speed to ...
Marco Locarno +16 more
wiley +1 more source
Nonapproximability Results for Partially Observable Markov Decision Processes
We show that for several variations of partially observable Markov decision processes, polynomial-time algorithms for finding control policies are unlikely to or simply don't have guarantees of finding policies within a constant factor or a constant ...
Goldsmith, J., Lusena, C., Mundhenk, M.
core +1 more source
Recent Advances of Slip Sensors for Smart Robotics
This review summarizes recent progress in robotic slip sensors across mechanical, electrical, thermal, optical, magnetic, and acoustic mechanisms, offering a comprehensive reference for the selection of slip sensors in robotic applications. In addition, current challenges and emerging trends are identified to advance the development of robust, adaptive,
Xingyu Zhang +8 more
wiley +1 more source
A dual‐domain PSLC architecture enables direct comparison of alignment‐dependent laser damage within a single device. Crack‐like and seal‐like morphologies emerge under different damage conditions, and their evolution is interpreted through quantitative image analysis and heat‐driven simulations.
Dengcheng Chen +4 more
wiley +1 more source
Learning Highly Dynamic Skills Transition for Quadruped Jumping Through Constrained Space
A quadruped robot masters dynamic jumps through constrained spaces with animal‐inspired moves and intelligent vision control. This hierarchical learning approach combines imitation of biological agility with real‐time trajectory planning. Although legged animals are capable of performing explosive motions while traversing confined spaces, replicating ...
Zeren Luo +6 more
wiley +1 more source
Factored Beliefs for Machine Agents in Decentralized Partially Observable Markov Decision Processes
A shared mental model (SMM) is a foundational structure in high performing, task-oriented teams and aid humans in determining their teammate's goals and intentions.
Joshua Lapso, Gilbert Peterson
doaj +1 more source

