Results 11 to 20 of about 41,235 (305)
Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement [PDF]
Speech-enhancement methods based on the complex ideal ratio mask (cIRM) have achieved promising results. These methods often deploy a deep neural network to jointly estimate the real and imaginary components of the cIRM defined in the complex domain ...
Unoki, Masashi +3 more
core +1 more source
Enhancing Speech Privacy with Slicing
Privacy preservation calls for speech anonymization methods which hide the speaker's identity while minimizing the impact on downstream tasks such as automatic speech recognition (ASR) training or decoding. In the recent VoicePrivacy 2020 Challenge, several anonymization methods have been proposed to transform speech utterances in a way that preserves ...
Maouche, Mohamed +5 more
openaire +3 more sources
Visual Speech Enhancement [PDF]
When video is shot in noisy environment, the voice of a speaker seen in the video can be enhanced using the visible mouth movements, reducing background noise. While most existing methods use audio-only inputs, improved performance is obtained with our visual speech enhancement, based on an audio-visual neural network.
Aviv Gabbay, Asaph Shamir, Shmuel Peleg
openaire +2 more sources
Speech Enhancement Method Based on Quasi Recurrent Neural Network [PDF]
In the deep learning based speech enhancement model,the Long Short-Term Memory Network(LSTM) can well handle the sequence speech enhancement problem,but the training speed of the model is slow when dealing with speech enhancement problems based on large ...
LOU Yingxi, YUAN Wenhao, PENG Rongqun
doaj +1 more source
Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech [PDF]
Text-to-speech voices created from noisy and reverberant recordings are of lower quality. A simple way to improve this is to increase the quality of the recordings prior to text-to-speech training with speech enhancement methods such as noise suppression and dereverberation.
Cassia Valentini-Botinhao +1 more
openaire +2 more sources
Multi-channel speech coding and enhancement is an indispensable technology in speech communication. In order to verify the effectiveness of multi-channel speech coding and enhancement methods in the research and development, a microphone array speech ...
Rui Cheng, Changchun Bao, Zihao Cui
doaj +1 more source
Speech Enhancement via EMD [PDF]
In this study, two new approaches for speech signal noise reduction based on the empirical mode decomposition (EMD) recently introduced by Huang et al. (1998) are proposed. Based on the EMD, both reduction schemes are fully data-driven approaches. Noise signal is decomposed adaptively into oscillatory components called intrinsic mode functions (IMFs ...
Kais Khaldi +3 more
openaire +3 more sources
Speech enhancement is an important preprocessing step in a wide diversity of practical fields related to speech signals, and many signal‐processing methods have already been proposed for speech enhancement.
Yangjie Wei +3 more
doaj +1 more source
Multi‐stage attention network for monaural speech enhancement
Although current attention‐based speech enhancement methods have been proven to be capable of significantly improving the noise reduction performance, a bottleneck has arisen in juggling both detailed features and high‐level features: The more attention ...
Kunpeng Wang +4 more
doaj +1 more source
Hearing-impaired people face numerous challenges with speech perception in the presence of interfering background noise. To suppress interfering background noise, the common approach widely used is speech enhancement.
Hemangi Shinde, Vibha Vyas
doaj +1 more source

