Results 21 to 30 of about 4,974,855 (283)
In this paper, a quadratic convolution neural network (QCNN) using both audio and vibration signals is utilized for bearing fault diagnosis. Specifically, to make use of multi-modal information for bearing fault diagnosis, the audio and vibration signals
Jin Yan+5 more
doaj +1 more source
UAVM: Towards Unifying Audio and Visual Models [PDF]
Conventional audio-visual models have independent audio and video branches. In this work, we unify the audio and visual branches by designing a Unified Audio-Visual Model (UAVM). The UAVM achieves a new state-of-the-art audio-visual event classification accuracy of 65.8% on VGGSound.
arxiv +1 more source
Audio signal encryption using chaotic Hénon map and lifting wavelet transforms [PDF]
We propose an audio signal encryption scheme based on the chaotic Hénon map. The scheme mainly comprises two phases: one is the preprocessing stage where the audio signal is transformed into data by the lifting wavelet scheme and the other in which the ...
A. Roy, A. P. Misra
semanticscholar +1 more source
A Comparison of Audio Signal Preprocessing Methods for Deep Neural Networks on Music Tagging [PDF]
In this paper, we empirically investigate the effect of audio preprocessing on music tagging with deep neural networks. We perform comprehensive experiments involving audio preprocessing using different time-frequency representations, logarithmic ...
Keunwoo Choi+3 more
semanticscholar +1 more source
Feature Augmenting Networks for Improving Depression Severity Estimation From Speech Signals
Depression disorder has become one of the major psychological diseases endangering human health. Researcher in the affective computing community is supporting the development of reliable depression severity estimation system, from multiple modalities ...
Le Yang, Dongmei Jiang, Hichem Sahli
doaj +1 more source
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering [PDF]
Audio question answering (AQA) is a multimodal translation task where a system analyzes an audio signal and a natural language question, to generate a desirable natural language answer. In this paper, we introduce Clotho-AQA, a dataset for Audio question
Samuel Lipping+3 more
semanticscholar +1 more source
Quantum-assisted distortion-free audio signal sensing
High sensitivity in quantum sensing comes often at the expense of other figures of merit, usually resulting in distortion. Here, the authors propose a protocol with good sensitivity, readout linearity and high frequency resolution, and benchmark it ...
Chen Zhang+11 more
doaj +1 more source
Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers [PDF]
Audio signals are often stored and transmitted in compressed formats. Among the many available audio compression schemes, MPEG-1 Audio Layer III (MP3) is very popular and widely used. Since MP3 is lossy it leaves characteristic traces in the compressed audio which can be used forensically to expose the past history of an audio file.
arxiv +1 more source
Twin neural network regression
We propose to reformulate a regression problem into predicting differences between target values. This allows for leveraging consistency conditions which can be used as uncertainty estimates and enable the production of an ensemble of predictions while training only a single neural network.
Sebastian Johann Wetzel+3 more
wiley +1 more source
Penerima Sinyal Emergency Locator Transmitter dengan Metode Direct Receiver pada Frekuensi 121,5 MHz
To find out the location of an aircraft accident from an ELT signal beam of 121.5 MHz, search and rescue (SAR) officers or civil aviation authorities need an ELT signal receiver device that works at the same frequency, which is 121.5 MHz.
Rustamaji+2 more
doaj +1 more source