Speech synthesis - Open Access .click

Results 201 to 210 of about 50,727 (310)

Multimodal Human–Robot Interaction Using Human Pose Estimation and Local Large Language Models

Advanced Robotics Research, EarlyView.
A multimodal human–robot interaction framework integrates human pose estimation (HPE) and a large language model (LLM) for gesture‐ and voice‐based robot control. Speech‐to‐text (STT) enables voice command interpretation, while a safety‐aware arbitration mechanism prioritizes gesture input for rapid intervention.
Nasiru Aboki, Ilche Georgievski, Marco Aiello +2 more
wiley +1 more source

Verification of the factors of individuality through avatar's speech generation system. [PDF]

Sci Rep
Komai Y, Uchida T, Kamide H, Ishiguro H.
europepmc +1 more source

LLM‐Integrated Human–Robot Interaction System for Microrobots

Advanced Robotics Research, EarlyView.
This paper proposes an LLM‐based control framework for guiding microrobots using human natural language. This framework can convert the natural human speech into safe and executable command sets for reliable navigation in complex environments. The experimental results show high accuracy and robustness in task performance, demonstrating the potential of
Bairong Zhu, Amar Salehi, Tingting Yu
wiley +1 more source

Cognitive Outcomes After Cochlear Implantation in Older Adults: A Narrative Review of Current Evidence, Mechanisms, and Long-Term Perspectives. [PDF]

Audiol Res
Falchetta L +9 more
europepmc +1 more source

Voice quality modelling for expressive speech synthesis. [PDF]

ScientificWorldJournal, 2014
Monzo C, Iriondo I, Socoró JC.
europepmc +1 more source

3D‐Printing Aided Rapid Prototyping of Pretensioned Tensegrity Structures for Robotic Applications

Advanced Robotics Research, EarlyView.
Printing, injection molding, and assembly (PMA) is a method for rapid prototyping mesoscale, topologically complex, and tensioned tensegrity structures. In combination with PMA method, two mold design strategies: modular mold and compact channel layout, enable efficiency and scalability for tensegrity fabrication.
Yi Sun +3 more
wiley +1 more source

DRIVE‐SAFE: Data‐Driven Robustness and Informed Validation for Evolving Specifications via Formal Evaluation

Advanced Robotics Research, EarlyView.
DRIVE‐SAFE evaluates learning‐based, black‐box autonomous driving policies against evolving temporal safety requirements using Signal Temporal Logic robustness metrics. It aggregates distributional robustness measures with domain‐informed weights to guide iterative retraining.
Kristy Sakano, Jianyu An, Dinesh Manocha, Huan Xu +3 more
wiley +1 more source

Immediate Effects of Delayed Auditory Feedback on Stuttering: A Systematic Review and Meta-Analysis of Literature Published 2000-2024. [PDF]

Int J Lang Commun Disord
Iimura D, Yamamoto T, Ishida O.
europepmc +1 more source

Intelligent Maintenance Review for Robots: Multimodal Information, Deep Diagnosis and Embodied Artificial Intelligence

Advanced Robotics Research, EarlyView.
This review maps the methods to monitor robots’ health by fusing vibration, sound, control signals, vision, force, and oil information with artificial intelligence. It identifies deep learning, transfer learning, digital twins, and physics‐informed models as key methodological pathways enabling earlier diagnosis, safer human–robot collaboration, and ...
Yuting Qiao +6 more
wiley +1 more source

A speech-to-video synthesis approach using spatio-temporal diffusion for vocal tract MRI. [PDF]

Med Image Anal
Pérez-Toro PA +11 more
europepmc +1 more source

sound
acoustics
fos: computer and information sciences

deep learning
computation and language cs.cl
computer science - computation and language

audio and speech processing eess.as