Results 61 to 70 of about 7,706,916 (368)
This paper describes Tencent Ethereal Audio Lab – Northwestern Polytechnical University personalized speech enhancement (TEA-PSE) system submitted to track 2 of the ICASSP 2022 Deep Noise Suppression (DNS) challenge.
Yukai Ju+8 more
semanticscholar +1 more source
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI [PDF]
Generative AI has demonstrated impressive performance in various fields, among which speech synthesis is an interesting direction. With the diffusion model as the most popular generative model, numerous works have attempted two active tasks: text to ...
Chenshuang Zhang+6 more
semanticscholar +1 more source
A deep learning speech enhancement algorithm based on dynamic hybrid feature and adaptive mask and DSP implementation is proposed in this paper, which solves the problem of feature loss and improves the performance of speech enhancement.
Jie Yang, Yachun Tang
doaj +1 more source
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation [PDF]
Audio-visual speech enhancement (AV-SE) aims to enhance degraded speech along with extra visual information such as lip videos, and has been shown to be more effective than audio-only speech enhancement. This paper proposes further incorporating ultrasound tongue images to improve lip-based AV-SE systems' performance. Knowledge distillation is employed
arxiv +1 more source
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement [PDF]
The dual-path RNN (DPRNN) was proposed to more effectively model extremely long sequences for speech separation in the time domain. By splitting long sequences to smaller chunks and applying intra-chunk and inter-chunk RNNs, the DPRNN reached promising ...
Xiaohuai Le+3 more
semanticscholar +1 more source
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation [PDF]
Speech enhancement and speech separation are two related tasks, whose purpose is to extract either one or more target speech signals, respectively, from a mixture of sounds generated by several sources.
Daniel Michelsanti+6 more
semanticscholar +1 more source
Multiple sclerosis clinical decision support system based on projection to reference datasets
Abstract Objective Multiple sclerosis (MS) is a multifactorial disease with increasingly complicated management. Our objective is to use on‐demand computational power to address the challenges of dynamically managing MS. Methods A phase 3 clinical trial data (NCT00906399) were used to contextualize the medication efficacy of peg‐interferon beta‐1a vs ...
Chadia Ed‐driouch+13 more
wiley +1 more source
Speech Enhancement Using Deep Learning Methods: A Review
Speech enhancement, which aims to recover the clean speech of the corrupted signal, plays an important role in the digital speech signal processing. According to the type of degradation and noise in the speech signal, approaches to speech enhancement ...
Asri Rizki Yuliani+4 more
doaj +1 more source
Cycle-Consistent Speech Enhancement [PDF]
5 pages, 2 figures. Interspeech 2018.
Meng, Zhong+4 more
openaire +2 more sources
Exploring WavLM on Speech Enhancement
Accepted by IEEE SLT ...
Song, Hyungchan+7 more
openaire +2 more sources