AVCLNet: Multimodal Multispeaker Tracking Network Using Audio‐Visual Contrastive Learning
ABSTRACT Audio‐visual speaker tracking aims to determine the locations of multiple speakers in the scene by leveraging signals captured from multisensor platforms. Multimodal fusion methods can improve both the accuracy and robustness of speaker tracking.
Yihan Li +5 more
wiley +1 more source
An Analysis of Indigenisation Dynamics of Kenya Army Band Martial Music
David Ekal +2 more
openalex +2 more sources
Games and gamification projects in the Australian public sector
Abstract This article surveys the arrival of gameful government into Australian public sector practice. Gameful government is a shorthand, descriptive term denoting the interpenetration of (video)games, and design elements and thinking from them, into public sector work.
David Threlfall, Catherine Althaus
wiley +1 more source
Temporal Predictions in Music and Language: The Case of Autism Spectrum Disorder. [PDF]
Denis M +3 more
europepmc +1 more source
Editorial: Interpersonal synchrony and network dynamics in social interaction, volume II. [PDF]
Müller V +3 more
europepmc +1 more source
Computational modeling of rhythmic expectations: Perspectives, pitfalls, and prospects. [PDF]
Damsma A +8 more
europepmc +1 more source
A music source separation method integrating time-frequency decoupling and mamba-based state space modeling. [PDF]
Zhang C, Zheng J, Cao M.
europepmc +1 more source
Temporal prediction and feedforward control in cerebellar ataxia during spontaneous, instructed, and adaptive auditory-motor coupling while walking. [PDF]
Moumdjian L +7 more
europepmc +1 more source

