Results 31 to 40 of about 5,344,036 (399)
Feature Augmenting Networks for Improving Depression Severity Estimation From Speech Signals
Depression disorder has become one of the major psychological diseases endangering human health. Researcher in the affective computing community is supporting the development of reliable depression severity estimation system, from multiple modalities ...
Le Yang, Dongmei Jiang, Hichem Sahli
doaj +1 more source
Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering [PDF]
Audio question answering (AQA) is a multimodal translation task where a system analyzes an audio signal and a natural language question, to generate a desirable natural language answer. In this paper, we introduce Clotho-AQA, a dataset for Audio question
Samuel Lipping+3 more
semanticscholar +1 more source
A Simple Prior for Audio Signals [PDF]
We propose a simple prior for restoration problems involving oscillatory signals. The prior makes use of an underlying analytic frame decomposition with narrow subbands. Other than this, the prior does not have any other parameters, which makes it simple to use and apply.
Bayram, Ilker, Kamasak, Mustafa E.
openaire +3 more sources
Signal processing & audio processors [PDF]
Signal processing algorithms are the hidden components in the audio processor that converts the received acoustic signal into electrical impulses while maintaining as much relevant information as possible. Signal processing algorithms should be smart enough to mimic the functionality of external, middle and the inner-ear to provide the cochlear implant
Ingeborg Hochmair, Anandhan Dhanasingh
openaire +2 more sources
Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers [PDF]
Audio signals are often stored and transmitted in compressed formats. Among the many available audio compression schemes, MPEG-1 Audio Layer III (MP3) is very popular and widely used. Since MP3 is lossy it leaves characteristic traces in the compressed audio which can be used forensically to expose the past history of an audio file.
arxiv +1 more source
Audio-visual speech recognition with background music using single-channel source separation [PDF]
In this paper, we consider audio-visual speech recognition with background music. The proposed algorithm is an integration of audio-visual speech recognition and single channel source separation (SCSS). We apply the proposed algorithm to recognize spoken
Erdogan, Hakan+4 more
core +1 more source
Penerima Sinyal Emergency Locator Transmitter dengan Metode Direct Receiver pada Frekuensi 121,5 MHz
To find out the location of an aircraft accident from an ELT signal beam of 121.5 MHz, search and rescue (SAR) officers or civil aviation authorities need an ELT signal receiver device that works at the same frequency, which is 121.5 MHz.
Rustamaji+2 more
doaj +1 more source
Active Noise Control over Space: A Subspace Method for Performance Analysis
In this paper, we investigate the maximum active noise control performance over a three-dimensional (3-D) spatial space, for a given set of secondary sources in a particular environment.
Jihui Zhang+3 more
doaj +1 more source
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition [PDF]
Visual Speech Recognition (VSR) is a task to predict a sentence or word from lip movements. Some works have been recently presented which use audio signals to supplement visual information. However, existing methods utilize only limited information such as phoneme-level features and soft labels of Automatic Speech Recognition (ASR) networks.
arxiv
Atomic decompositions of audio signals [PDF]
Signal modeling techniques ranging from basis expansions to parametric approaches have been applied to audio signal processing. Motivated by the fundamental limitations of basis expansions for representing arbitrary signal features and providing means for signal modifications, we consider decompositions in terms of functions that are both signal ...
Goodwin, Michael, Vetterli, Martin
openaire +2 more sources