Results 11 to 20 of about 41,235 (305)

Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement [PDF]

open access: yes, 2022
Speech-enhancement methods based on the complex ideal ratio mask (cIRM) have achieved promising results. These methods often deploy a deep neural network to jointly estimate the real and imaginary components of the cIRM defined in the complex domain ...
Unoki, Masashi   +3 more
core   +1 more source

Enhancing Speech Privacy with Slicing

open access: yesInterspeech 2022, 2022
Privacy preservation calls for speech anonymization methods which hide the speaker's identity while minimizing the impact on downstream tasks such as automatic speech recognition (ASR) training or decoding. In the recent VoicePrivacy 2020 Challenge, several anonymization methods have been proposed to transform speech utterances in a way that preserves ...
Maouche, Mohamed   +5 more
openaire   +3 more sources

Visual Speech Enhancement [PDF]

open access: yesInterspeech 2018, 2018
When video is shot in noisy environment, the voice of a speaker seen in the video can be enhanced using the visible mouth movements, reducing background noise. While most existing methods use audio-only inputs, improved performance is obtained with our visual speech enhancement, based on an audio-visual neural network.
Aviv Gabbay, Asaph Shamir, Shmuel Peleg
openaire   +2 more sources

Speech Enhancement Method Based on Quasi Recurrent Neural Network [PDF]

open access: yesJisuanji gongcheng, 2020
In the deep learning based speech enhancement model,the Long Short-Term Memory Network(LSTM) can well handle the sequence speech enhancement problem,but the training speed of the model is slow when dealing with speech enhancement problems based on large ...
LOU Yingxi, YUAN Wenhao, PENG Rongqun
doaj   +1 more source

Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech [PDF]

open access: yesIEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018
Text-to-speech voices created from noisy and reverberant recordings are of lower quality. A simple way to improve this is to increase the quality of the recordings prior to text-to-speech training with speech enhancement methods such as noise suppression and dereverberation.
Cassia Valentini-Botinhao   +1 more
openaire   +2 more sources

MASS: Microphone Array Speech Simulator in Room Acoustic Environment for Multi-Channel Speech Coding and Enhancement

open access: yesApplied Sciences, 2020
Multi-channel speech coding and enhancement is an indispensable technology in speech communication. In order to verify the effectiveness of multi-channel speech coding and enhancement methods in the research and development, a microphone array speech ...
Rui Cheng, Changchun Bao, Zihao Cui
doaj   +1 more source

Speech Enhancement via EMD [PDF]

open access: yesEURASIP Journal on Advances in Signal Processing, 2008
In this study, two new approaches for speech signal noise reduction based on the empirical mode decomposition (EMD) recently introduced by Huang et al. (1998) are proposed. Based on the EMD, both reduction schemes are fully data-driven approaches. Noise signal is decomposed adaptively into oscillatory components called intrinsic mode functions (IMFs ...
Kais Khaldi   +3 more
openaire   +3 more sources

Exploring conventional enhancement and separation methods for multi‐speech enhancement in indoor environments

open access: yesCognitive Computation and Systems, 2021
Speech enhancement is an important preprocessing step in a wide diversity of practical fields related to speech signals, and many signal‐processing methods have already been proposed for speech enhancement.
Yangjie Wei   +3 more
doaj   +1 more source

Multi‐stage attention network for monaural speech enhancement

open access: yesIET Signal Processing, 2023
Although current attention‐based speech enhancement methods have been proven to be capable of significantly improving the noise reduction performance, a bottleneck has arisen in juggling both detailed features and high‐level features: The more attention ...
Kunpeng Wang   +4 more
doaj   +1 more source

Robust intelligibility and quality evaluation of combined temporal and spectral processing for hearing impaired

open access: yesIntelligent Systems with Applications, 2022
Hearing-impaired people face numerous challenges with speech perception in the presence of interfering background noise. To suppress interfering background noise, the common approach widely used is speech enhancement.
Hemangi Shinde, Vibha Vyas
doaj   +1 more source

Home - About - Disclaimer - Privacy