Results 1 to 10 of about 12,622 (268)
Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers [PDF]
Video captioning is an essential technology to understand scenes and describe events in natural language. To apply it to real-time monitoring, a system needs not only to describe events accurately but also to produce the captions as soon as possible. Low-latency captioning is needed to realize such functionality, but this research area for online video
Hori, Chiori +2 more
openaire +2 more sources
Active Middle Ear Implant Evoked Auditory Brainstem Response Intensity-Latency Characteristics
ObjectiveTo analyze intensity-latency functions of intraoperative auditory evoked brainstem responses (ABRs) to stimulation by the Vibrant Soundbridge (VSB) active middle ear implant with respect to coupling efficiency, VSB evoked ABR thresholds, and ...
Laura Fröhlich +7 more
doaj +1 more source
audiomath: A neuroscientist's sound toolkit
In neuroscientific experiments and applications, working with auditory stimuli demands software tools for generation and acquisition of raw audio, for composition and tailoring of that material into finished stimuli, for precisely timed presentation of ...
N. Jeremy Hill +2 more
doaj +1 more source
Synchronization of ear-EEG and audio streams in a portable research hearing device
Recent advancements in neuroscientific research and miniaturized ear-electroencephalography (EEG) technologies have led to the idea of employing brain signals as additional input to hearing aid algorithms.
Steffen Dasenbrock +11 more
doaj +1 more source
Energy-efficient low-latency audio on android [PDF]
Abstract Counting more than two billion devices, Android is nowadays one of the most popular open-source general-purpose operating systems, based on Linux. Because of the diversity of applications that can be installed, it manages a number of different workloads, many of them requiring performance/QoS guarantees.
Alessio Balsini +6 more
openaire +1 more source
FaSNet: Low-Latency Adaptive Beamforming for Multi-Microphone Audio Processing [PDF]
Accepted to ASRU ...
Lou, Y +4 more
openaire +2 more sources
Online Spectrogram Inversion for Low-Latency Audio Source Separation [PDF]
Audio source separation is usually achieved by estimating the short-time Fourier transform (STFT) magnitude of each source, and then applying a spectrogram inversion algorithm to retrieve time-domain signals. In particular, the multiple input spectrogram inversion (MISI) algorithm has been exploited successfully in several recent works.
Paul Magron, Tuomas Virtanen
openaire +3 more sources
Background: The contingent negative variation (CNV) is a negative shift in electroencephalography (EEG) related to the planning and execution of an externally cued movement task.
Sharon Olsen +7 more
doaj +1 more source
VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer
This paper presents an audio-visual approach for voice separation which produces state-of-the-art results at a low latency in two scenarios: speech and singing voice. The model is based on a two-stage network. Motion cues are obtained with a lightweight graph convolutional network that processes face landmarks.
Juan F. Montesinos +2 more
openaire +3 more sources
Visual intensity-dependent response latencies predict perceived audio–visual simultaneity
To form a coherent presentation of the world, the brain needs to combine multiple sensory modalities accurately together in the temporal domain. Judgements on the relative timing of audio–visual stimuli are complex, due to the differing propagation speeds of light and sound through the environment and the nervous system, and the dependence of ...
Ryan Horsfall +2 more
openaire +3 more sources

