Results 1 to 10 of about 12,622 (268)

Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers [PDF]

open access: yesInterspeech 2021, 2021
Video captioning is an essential technology to understand scenes and describe events in natural language. To apply it to real-time monitoring, a system needs not only to describe events accurately but also to produce the captions as soon as possible. Low-latency captioning is needed to realize such functionality, but this research area for online video
Hori, Chiori   +2 more
openaire   +2 more sources

Active Middle Ear Implant Evoked Auditory Brainstem Response Intensity-Latency Characteristics

open access: yesFrontiers in Neurology, 2022
ObjectiveTo analyze intensity-latency functions of intraoperative auditory evoked brainstem responses (ABRs) to stimulation by the Vibrant Soundbridge (VSB) active middle ear implant with respect to coupling efficiency, VSB evoked ABR thresholds, and ...
Laura Fröhlich   +7 more
doaj   +1 more source

audiomath: A neuroscientist's sound toolkit

open access: yesHeliyon, 2021
In neuroscientific experiments and applications, working with auditory stimuli demands software tools for generation and acquisition of raw audio, for composition and tailoring of that material into finished stimuli, for precisely timed presentation of ...
N. Jeremy Hill   +2 more
doaj   +1 more source

Synchronization of ear-EEG and audio streams in a portable research hearing device

open access: yesFrontiers in Neuroscience, 2022
Recent advancements in neuroscientific research and miniaturized ear-electroencephalography (EEG) technologies have led to the idea of employing brain signals as additional input to hearing aid algorithms.
Steffen Dasenbrock   +11 more
doaj   +1 more source

Energy-efficient low-latency audio on android [PDF]

open access: yesJournal of Systems and Software, 2019
Abstract Counting more than two billion devices, Android is nowadays one of the most popular open-source general-purpose operating systems, based on Linux. Because of the diversity of applications that can be installed, it manages a number of different workloads, many of them requiring performance/QoS guarantees.
Alessio Balsini   +6 more
openaire   +1 more source

FaSNet: Low-Latency Adaptive Beamforming for Multi-Microphone Audio Processing [PDF]

open access: yes2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2019
Accepted to ASRU ...
Lou, Y   +4 more
openaire   +2 more sources

Online Spectrogram Inversion for Low-Latency Audio Source Separation [PDF]

open access: yesIEEE Signal Processing Letters, 2020
Audio source separation is usually achieved by estimating the short-time Fourier transform (STFT) magnitude of each source, and then applying a spectrogram inversion algorithm to retrieve time-domain signals. In particular, the multiple input spectrogram inversion (MISI) algorithm has been exploited successfully in several recent works.
Paul Magron, Tuomas Virtanen
openaire   +3 more sources

Characteristics of the Contingent Negative Variation during Lower Limb Functional Movement with an Audio-Visual Cue

open access: yesApplied Sciences, 2023
Background: The contingent negative variation (CNV) is a negative shift in electroencephalography (EEG) related to the planning and execution of an externally cued movement task.
Sharon Olsen   +7 more
doaj   +1 more source

VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer

open access: yes, 2022
This paper presents an audio-visual approach for voice separation which produces state-of-the-art results at a low latency in two scenarios: speech and singing voice. The model is based on a two-stage network. Motion cues are obtained with a lightweight graph convolutional network that processes face landmarks.
Juan F. Montesinos   +2 more
openaire   +3 more sources

Visual intensity-dependent response latencies predict perceived audio–visual simultaneity

open access: yesJournal of Mathematical Psychology, 2021
To form a coherent presentation of the world, the brain needs to combine multiple sensory modalities accurately together in the temporal domain. Judgements on the relative timing of audio–visual stimuli are complex, due to the differing propagation speeds of light and sound through the environment and the nervous system, and the dependence of ...
Ryan Horsfall   +2 more
openaire   +3 more sources

Home - About - Disclaimer - Privacy