Children Sustain Their Attention on Spatial Scenes When Planning to Describe Spatial Relations Multimodally in Speech and Gesture. [PDF]
Ünal E, Karadöller DZ, Özyürek A.
europepmc +1 more source
A Dataset of Hindi-English Code-Mixed Social Media Text for Hate Speech Detection [PDF]
Aditya Bohra +4 more
openalex +1 more source
The Future of Research in Cognitive Robotics: Foundation Models or Developmental Cognitive Models?
Research in cognitive robotics founded on principles of developmental psychology and enactive cognitive science would yield what we seek in autonomous robots: the ability to perceive its environment, learn from experience, anticipate the outcome of events, act to pursue goals, and adapt to changing circumstances without resorting to training with ...
David Vernon
wiley +1 more source
Code Breaking for Automatic Speech Recognition [PDF]
Frederick Jelinek
openalex +1 more source
TunSwitch: Code-Switched Tunisian Arabic Speech Dataset
Salah Zaiem, Ahmed Abdallah
openalex +1 more source
Grounding Large Language Models for Robot Task Planning Using Closed‐Loop State Feedback
BrainBody‐Large Language Model (LLM) introduces a hierarchical, feedback‐driven planning framework where two LLMs coordinate high‐level reasoning and low‐level control for robotic tasks. By grounding decisions in real‐time state feedback, it reduces hallucinations and improves task reliability.
Vineet Bhat +4 more
wiley +1 more source
Neural Coding of Fundamental Frequency and Processing of Discrete Pitch Accents in Middle Age. [PDF]
McHaney JR +4 more
europepmc +1 more source
Continual Learning for Multimodal Data Fusion of a Soft Gripper
Models trained on a single data modality often struggle to generalize when exposed to a different modality. This work introduces a continual learning algorithm capable of incrementally learning different data modalities by leveraging both class‐incremental and domain‐incremental learning scenarios in an artificial environment where labeled data is ...
Nilay Kushawaha, Egidio Falotico
wiley +1 more source
Developing an AI-driven multimodal approach to visualising resilient team performance: joint attentional engagement with gaze and speech in simulated emergency scenarios. [PDF]
Miyazaki A +10 more
europepmc +1 more source
THE PREPROCESSING PROCEDURE OF DIGITAL STREAMS OF CODED SPEECH MESSAGES
A.N. Gomonov, D.V. Gerasin
openalex +1 more source

