Results 11 to 20 of about 132,260 (194)
Temporal Convolution Network Based Joint Optimization of Acoustic-to-Articulatory Inversion
Articulatory features are proved to be efficient in the area of speech recognition and speech synthesis. However, acquiring articulatory features has always been a difficult research hotspot.
Guolun Sun +3 more
doaj +1 more source
Vowel reduction is a common pronunciation phenomenon in stress-timed languages like English. Native speakers tend to weaken unstressed vowels into a schwa-like sound.
Zongming Liu +3 more
doaj +1 more source
Nowadays, most end-to-end task-oriented dialog systems are based on sequence-to-sequence (Seq2seq), which is an encoder-decoder framework. These systems sometimes produce grammatically correct, but logically incorrect responses.
Junqing He +4 more
doaj +1 more source
GuidedMix: An on‐the‐fly data augmentation approach for robust speaker recognition system
Data augmentation is an essential technique for building a high‐robustness speaker recognition system. this letter proposes a novel on‐the‐fly data augmentation strategy called GuidedMix.
Runqiu Xiao +4 more
doaj +1 more source
A Complementary Effect in Active Control of Powertrain and Road Noise in the Vehicle Interior
This study shows that a concurrent active noise control strategy for engine harmonics and road noise has a complementary effect. In particular, we found that engine booming noise is additionally attenuated when road noise control is concurrently used ...
Seonghyeon Kim, M. Ercan Altinsoy
doaj +1 more source
AUTOMATIC MUSIC TRANSCRIPTION USING ROW WEIGHTED DECOMPOSITIONS [PDF]
(c) 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
IEEE, O'Hanlon, K, Plumbley, MD
core +1 more source
ACCOUNTING FOR PHASE CANCELLATIONS IN NON-NEGATIVE MATRIX FACTORIZATION USING WEIGHTED DISTANCES [PDF]
(c)2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works for resale or ...
Ewert, S, IEEE, Plumbley, MD, Sandler, M
core +1 more source
Restricted Boltzmann Machine-Based Approaches for Link Prediction in Dynamic Networks
Link prediction in dynamic networks aims to predict edges according to historical linkage status. It is inherently difficult because of the linear/non-linear transformation of underlying structures.
Taisong Li +4 more
doaj +1 more source
Parallel global convolutional network for semantic image segmentation
In this paper, a novel convolutional neural network for fast semantic segmentation is presented. Deep convolutional neural networks have achieved great progress in the task of vision scene understanding.
Xing Bai, Jun Zhou
doaj +1 more source
Acoustic voice variation in spontaneous speech
This study replicates and extends the recent findings of Lee, Keating, and Kreiman [J. Acoust. Soc. Am. 146(3), 1568–1579 (2019)] on acoustic voice variation in read speech, which showed remarkably similar acoustic voice spaces for groups of female and male talkers and the individual talkers within these groups. Principal component analysis was applied
Yoonjeong Lee, Jody Kreiman
openaire +4 more sources

