Results 61 to 70 of about 7,706,916 (368)

TEA-PSE: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System for ICASSP 2022 DNS Challenge

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2022
This paper describes Tencent Ethereal Audio Lab – Northwestern Polytechnical University personalized speech enhancement (TEA-PSE) system submitted to track 2 of the ICASSP 2022 Deep Noise Suppression (DNS) challenge.
Yukai Ju   +8 more
semanticscholar   +1 more source

A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI [PDF]

open access: yesarXiv.org, 2023
Generative AI has demonstrated impressive performance in various fields, among which speech synthesis is an interesting direction. With the diffusion model as the most popular generative model, numerous works have attempted two active tasks: text to ...
Chenshuang Zhang   +6 more
semanticscholar   +1 more source

Research and DSP Implementation of Speech Enhancement Technology Based on Dynamic Mixed Features and Adaptive Mask

open access: yesJournal of Electrical and Computer Engineering, 2022
A deep learning speech enhancement algorithm based on dynamic hybrid feature and adaptive mask and DSP implementation is proposed in this paper, which solves the problem of feature loss and improves the performance of speech enhancement.
Jie Yang, Yachun Tang
doaj   +1 more source

Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation [PDF]

open access: yesProc. INTERSPEECH 2023, 844-848 (2023), 2023
Audio-visual speech enhancement (AV-SE) aims to enhance degraded speech along with extra visual information such as lip videos, and has been shown to be more effective than audio-only speech enhancement. This paper proposes further incorporating ultrasound tongue images to improve lip-based AV-SE systems' performance. Knowledge distillation is employed
arxiv   +1 more source

DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement [PDF]

open access: yesInterspeech, 2021
The dual-path RNN (DPRNN) was proposed to more effectively model extremely long sequences for speech separation in the time domain. By splitting long sequences to smaller chunks and applying intra-chunk and inter-chunk RNNs, the DPRNN reached promising ...
Xiaohuai Le   +3 more
semanticscholar   +1 more source

An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation [PDF]

open access: yesIEEE/ACM Transactions on Audio Speech and Language Processing, 2020
Speech enhancement and speech separation are two related tasks, whose purpose is to extract either one or more target speech signals, respectively, from a mixture of sounds generated by several sources.
Daniel Michelsanti   +6 more
semanticscholar   +1 more source

Multiple sclerosis clinical decision support system based on projection to reference datasets

open access: yesAnnals of Clinical and Translational Neurology, Volume 9, Issue 12, Page 1863-1873, December 2022., 2022
Abstract Objective Multiple sclerosis (MS) is a multifactorial disease with increasingly complicated management. Our objective is to use on‐demand computational power to address the challenges of dynamically managing MS. Methods A phase 3 clinical trial data (NCT00906399) were used to contextualize the medication efficacy of peg‐interferon beta‐1a vs ...
Chadia Ed‐driouch   +13 more
wiley   +1 more source

Speech Enhancement Using Deep Learning Methods: A Review

open access: yesJurnal Elektronika dan Telekomunikasi, 2021
Speech enhancement, which aims to recover the clean speech of the corrupted signal, plays an important role in the digital speech signal processing. According to the type of degradation and noise in the speech signal, approaches to speech enhancement ...
Asri Rizki Yuliani   +4 more
doaj   +1 more source

Cycle-Consistent Speech Enhancement [PDF]

open access: yesInterspeech 2018, 2018
5 pages, 2 figures. Interspeech 2018.
Meng, Zhong   +4 more
openaire   +2 more sources

Exploring WavLM on Speech Enhancement

open access: yes2022 IEEE Spoken Language Technology Workshop (SLT), 2023
Accepted by IEEE SLT ...
Song, Hyungchan   +7 more
openaire   +2 more sources

Home - About - Disclaimer - Privacy