Results 21 to 30 of about 4,974,855 (283)

Fusion of Audio and Vibration Signals for Bearing Fault Diagnosis Based on a Quadratic Convolution Neural Network

open access: yesSensors, 2023
In this paper, a quadratic convolution neural network (QCNN) using both audio and vibration signals is utilized for bearing fault diagnosis. Specifically, to make use of multi-modal information for bearing fault diagnosis, the audio and vibration signals
Jin Yan   +5 more
doaj   +1 more source

UAVM: Towards Unifying Audio and Visual Models [PDF]

open access: yesIEEE Signal Processing Letters, vol. 29, pp. 2437-2441, 2022, 2022
Conventional audio-visual models have independent audio and video branches. In this work, we unify the audio and visual branches by designing a Unified Audio-Visual Model (UAVM). The UAVM achieves a new state-of-the-art audio-visual event classification accuracy of 65.8% on VGGSound.
arxiv   +1 more source

Audio signal encryption using chaotic Hénon map and lifting wavelet transforms [PDF]

open access: yesThe European Physical Journal Plus, 2017
We propose an audio signal encryption scheme based on the chaotic Hénon map. The scheme mainly comprises two phases: one is the preprocessing stage where the audio signal is transformed into data by the lifting wavelet scheme and the other in which the ...
A. Roy, A. P. Misra
semanticscholar   +1 more source

A Comparison of Audio Signal Preprocessing Methods for Deep Neural Networks on Music Tagging [PDF]

open access: yesEuropean Signal Processing Conference, 2017
In this paper, we empirically investigate the effect of audio preprocessing on music tagging with deep neural networks. We perform comprehensive experiments involving audio preprocessing using different time-frequency representations, logarithmic ...
Keunwoo Choi   +3 more
semanticscholar   +1 more source

Feature Augmenting Networks for Improving Depression Severity Estimation From Speech Signals

open access: yesIEEE Access, 2020
Depression disorder has become one of the major psychological diseases endangering human health. Researcher in the affective computing community is supporting the development of reliable depression severity estimation system, from multiple modalities ...
Le Yang, Dongmei Jiang, Hichem Sahli
doaj   +1 more source

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering [PDF]

open access: yesEuropean Signal Processing Conference, 2022
Audio question answering (AQA) is a multimodal translation task where a system analyzes an audio signal and a natural language question, to generate a desirable natural language answer. In this paper, we introduce Clotho-AQA, a dataset for Audio question
Samuel Lipping   +3 more
semanticscholar   +1 more source

Quantum-assisted distortion-free audio signal sensing

open access: yesNature Communications, 2022
High sensitivity in quantum sensing comes often at the expense of other figures of merit, usually resulting in distortion. Here, the authors propose a protocol with good sensitivity, readout linearity and high frequency resolution, and benchmark it ...
Chen Zhang   +11 more
doaj   +1 more source

Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers [PDF]

open access: yes, 2022
Audio signals are often stored and transmitted in compressed formats. Among the many available audio compression schemes, MPEG-1 Audio Layer III (MP3) is very popular and widely used. Since MP3 is lossy it leaves characteristic traces in the compressed audio which can be used forensically to expose the past history of an audio file.
arxiv   +1 more source

Twin neural network regression

open access: yesApplied AI Letters, Volume 3, Issue 4, December 2022., 2022
We propose to reformulate a regression problem into predicting differences between target values. This allows for leveraging consistency conditions which can be used as uncertainty estimates and enable the production of an ensemble of predictions while training only a single neural network.
Sebastian Johann Wetzel   +3 more
wiley   +1 more source

Penerima Sinyal Emergency Locator Transmitter dengan Metode Direct Receiver pada Frekuensi 121,5 MHz

open access: yesJurnal Teknik Elektro, 2020
To find out the location of an aircraft accident from an ELT signal beam of 121.5 MHz, search and rescue (SAR) officers or civil aviation authorities need an ELT signal receiver device that works at the same frequency, which is 121.5 MHz.
Rustamaji   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy