Results 31 to 40 of about 5,344,036 (399)

Feature Augmenting Networks for Improving Depression Severity Estimation From Speech Signals

open access: yesIEEE Access, 2020
Depression disorder has become one of the major psychological diseases endangering human health. Researcher in the affective computing community is supporting the development of reliable depression severity estimation system, from multiple modalities ...
Le Yang, Dongmei Jiang, Hichem Sahli
doaj   +1 more source

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering [PDF]

open access: yesEuropean Signal Processing Conference, 2022
Audio question answering (AQA) is a multimodal translation task where a system analyzes an audio signal and a natural language question, to generate a desirable natural language answer. In this paper, we introduce Clotho-AQA, a dataset for Audio question
Samuel Lipping   +3 more
semanticscholar   +1 more source

A Simple Prior for Audio Signals [PDF]

open access: yesIEEE Transactions on Audio, Speech, and Language Processing, 2013
We propose a simple prior for restoration problems involving oscillatory signals. The prior makes use of an underlying analytic frame decomposition with narrow subbands. Other than this, the prior does not have any other parameters, which makes it simple to use and apply.
Bayram, Ilker, Kamasak, Mustafa E.
openaire   +3 more sources

Signal processing & audio processors [PDF]

open access: yesActa Oto-Laryngologica, 2021
Signal processing algorithms are the hidden components in the audio processor that converts the received acoustic signal into electrical impulses while maintaining as much relevant information as possible. Signal processing algorithms should be smart enough to mimic the functionality of external, middle and the inner-ear to provide the cochlear implant
Ingeborg Hochmair, Anandhan Dhanasingh
openaire   +2 more sources

Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers [PDF]

open access: yes, 2022
Audio signals are often stored and transmitted in compressed formats. Among the many available audio compression schemes, MPEG-1 Audio Layer III (MP3) is very popular and widely used. Since MP3 is lossy it leaves characteristic traces in the compressed audio which can be used forensically to expose the past history of an audio file.
arxiv   +1 more source

Audio-visual speech recognition with background music using single-channel source separation [PDF]

open access: yes, 2012
In this paper, we consider audio-visual speech recognition with background music. The proposed algorithm is an integration of audio-visual speech recognition and single channel source separation (SCSS). We apply the proposed algorithm to recognize spoken
Erdogan, Hakan   +4 more
core   +1 more source

Penerima Sinyal Emergency Locator Transmitter dengan Metode Direct Receiver pada Frekuensi 121,5 MHz

open access: yesJurnal Teknik Elektro, 2020
To find out the location of an aircraft accident from an ELT signal beam of 121.5 MHz, search and rescue (SAR) officers or civil aviation authorities need an ELT signal receiver device that works at the same frequency, which is 121.5 MHz.
Rustamaji   +2 more
doaj   +1 more source

Active Noise Control over Space: A Subspace Method for Performance Analysis

open access: yesApplied Sciences, 2019
In this paper, we investigate the maximum active noise control performance over a three-dimensional (3-D) spatial space, for a given set of secondary sources in a particular environment.
Jihui Zhang   +3 more
doaj   +1 more source

Multi-Temporal Lip-Audio Memory for Visual Speech Recognition [PDF]

open access: yesarXiv, 2023
Visual Speech Recognition (VSR) is a task to predict a sentence or word from lip movements. Some works have been recently presented which use audio signals to supplement visual information. However, existing methods utilize only limited information such as phoneme-level features and soft labels of Automatic Speech Recognition (ASR) networks.
arxiv  

Atomic decompositions of audio signals [PDF]

open access: yesProceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics, 2002
Signal modeling techniques ranging from basis expansions to parametric approaches have been applied to audio signal processing. Motivated by the fundamental limitations of basis expansions for representing arbitrary signal features and providing means for signal modifications, we consider decompositions in terms of functions that are both signal ...
Goodwin, Michael, Vetterli, Martin
openaire   +2 more sources

Home - About - Disclaimer - Privacy