Results 21 to 30 of about 5,012,951 (306)
Particle-Velocity-Based Mixed-Source Sound Field Translation for Binaural Reproduction
Following the rise of virtual reality is a demand for sound field reproduction techniques that allow the user to interact and move within acoustic reproductions with six-degrees-of-freedom.
Huanyu Zuo +4 more
doaj +1 more source
Differentiable Signal Processing With Black-Box Audio Effects [PDF]
We present a data-driven approach to automate audio signal processing by incorporating stateful third-party, audio effects as layers within a deep neural network.
Marco A. Mart'inez Ram'irez +3 more
semanticscholar +1 more source
Multi-channel speech coding and enhancement is an indispensable technology in speech communication. In order to verify the effectiveness of multi-channel speech coding and enhancement methods in the research and development, a microphone array speech ...
Rui Cheng, Changchun Bao, Zihao Cui
doaj +1 more source
In this paper, a quadratic convolution neural network (QCNN) using both audio and vibration signals is utilized for bearing fault diagnosis. Specifically, to make use of multi-modal information for bearing fault diagnosis, the audio and vibration signals
Jin Yan +5 more
doaj +1 more source
Audiodec: An Open-Source Streaming High-Fidelity Neural Audio Codec [PDF]
A good audio codec for live applications such as telecommunication is characterized by three key properties: (1) compression, i.e. the bitrate that is required to transmit the signal should be as low as possible; (2) latency, i.e.
Yi-Chiao Wu +3 more
semanticscholar +1 more source
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis [PDF]
Generating high-fidelity talking head video by fitting with the input audio sequence is a challenging problem that receives considerable attentions recently. In this paper, we address this problem with the aid of neural scene representation networks. Our
Yudong Guo +5 more
semanticscholar +1 more source
Feature Augmenting Networks for Improving Depression Severity Estimation From Speech Signals
Depression disorder has become one of the major psychological diseases endangering human health. Researcher in the affective computing community is supporting the development of reliable depression severity estimation system, from multiple modalities ...
Le Yang, Dongmei Jiang, Hichem Sahli
doaj +1 more source
Audiosr: Versatile Audio Super-Resolution at Scale [PDF]
Audio super-resolution is a fundamental task that predicts high-frequency components for low-resolution audio, enhancing audio quality in digital applications.
Haohe Liu +4 more
semanticscholar +1 more source
BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge [PDF]
Given an audio-visual pair, audio-visual segmentation (AVS) aims to locate sounding sources by predicting pixel-wise maps. Previous methods assume that each sound component in an audio signal always has a visual counterpart in the image.
Chen Liu +6 more
semanticscholar +1 more source
A Comparison of Audio Signal Preprocessing Methods for Deep Neural Networks on Music Tagging [PDF]
In this paper, we empirically investigate the effect of audio preprocessing on music tagging with deep neural networks. We perform comprehensive experiments involving audio preprocessing using different time-frequency representations, logarithmic ...
Keunwoo Choi +3 more
semanticscholar +1 more source

