Multimodal Human–Robot Interaction Using Human Pose Estimation and Local Large Language Models
A multimodal human–robot interaction framework integrates human pose estimation (HPE) and a large language model (LLM) for gesture‐ and voice‐based robot control. Speech‐to‐text (STT) enables voice command interpretation, while a safety‐aware arbitration mechanism prioritizes gesture input for rapid intervention.
Nasiru Aboki +2 more
wiley +1 more source
Verification of the factors of individuality through avatar's speech generation system. [PDF]
Komai Y, Uchida T, Kamide H, Ishiguro H.
europepmc +1 more source
LLM‐Integrated Human–Robot Interaction System for Microrobots
This paper proposes an LLM‐based control framework for guiding microrobots using human natural language. This framework can convert the natural human speech into safe and executable command sets for reliable navigation in complex environments. The experimental results show high accuracy and robustness in task performance, demonstrating the potential of
Bairong Zhu, Amar Salehi, Tingting Yu
wiley +1 more source
Cognitive Outcomes After Cochlear Implantation in Older Adults: A Narrative Review of Current Evidence, Mechanisms, and Long-Term Perspectives. [PDF]
Falchetta L +9 more
europepmc +1 more source
Voice quality modelling for expressive speech synthesis. [PDF]
Monzo C, Iriondo I, Socoró JC.
europepmc +1 more source
3D‐Printing Aided Rapid Prototyping of Pretensioned Tensegrity Structures for Robotic Applications
Printing, injection molding, and assembly (PMA) is a method for rapid prototyping mesoscale, topologically complex, and tensioned tensegrity structures. In combination with PMA method, two mold design strategies: modular mold and compact channel layout, enable efficiency and scalability for tensegrity fabrication.
Yi Sun +3 more
wiley +1 more source
DRIVE‐SAFE evaluates learning‐based, black‐box autonomous driving policies against evolving temporal safety requirements using Signal Temporal Logic robustness metrics. It aggregates distributional robustness measures with domain‐informed weights to guide iterative retraining.
Kristy Sakano +3 more
wiley +1 more source
Immediate Effects of Delayed Auditory Feedback on Stuttering: A Systematic Review and Meta-Analysis of Literature Published 2000-2024. [PDF]
Iimura D, Yamamoto T, Ishida O.
europepmc +1 more source
This review maps the methods to monitor robots’ health by fusing vibration, sound, control signals, vision, force, and oil information with artificial intelligence. It identifies deep learning, transfer learning, digital twins, and physics‐informed models as key methodological pathways enabling earlier diagnosis, safer human–robot collaboration, and ...
Yuting Qiao +6 more
wiley +1 more source
A speech-to-video synthesis approach using spatio-temporal diffusion for vocal tract MRI. [PDF]
Pérez-Toro PA +11 more
europepmc +1 more source

