Results 1 to 10 of about 132,794 (332)
Speech acoustics: How much science?
Human vocalizations are sounds made exclusively by a human vocal tract. Among other vocalizations, for example, laughs or screams, speech is the most important. Speech is the primary medium of that supremely human symbolic communication system called language.
Tiwari M.
europepmc +5 more sources
The human voice is a directional sound source. This property has been explored for more than 200 years, mainly using measurements of human participants. Some efforts have been made to understand the anatomical parameters that influence speech directivity,
Blandin Rémi +2 more
doaj +1 more source
End-to-end text-to-speech (TTS) models that directly generate waveforms from text are gaining popularity. However, existing end-to-end models are still not natural enough in their prosodic expressiveness.
Zengqiang Shang +4 more
doaj +1 more source
Explore Long-Range Context Features for Speaker Verification
Multi-scale context information, especially long-range dependency, has shown to be beneficial for speaker verification (SV) tasks. In this paper, we propose three methods to systematically explore long-range context SV feature extraction based on ResNet ...
Zhuo Li +4 more
doaj +1 more source
An individualization approach for head-related transfer function in arbitrary directions based on deep learning [PDF]
This paper provides an individualization approach for head-related transfer function (HRTF) in arbitrary directions based on deep learning by utilizing dual-autoencoder architecture to establish the relationship between HRTF magnitude spectrum and ...
Dingding Yao +6 more
doaj +1 more source
A Recurrent Neural Networks (RNN) based attention model has been used in code-switching speech recognition (CSSR). However, due to the sequential computation constraint of RNN, there are stronger short-range dependencies and weaker long-range ...
Zheying Huang +5 more
doaj +1 more source
Deep learning based methods have achieved state-of-the-art results on the task of ship type classification. However, most existing ship type classification algorithms take time–frequency (TF) features as input, the underlying discriminative information ...
Chen Li +4 more
doaj +1 more source
Confidence Learning for Semi-Supervised Acoustic Event Detection
In recent years, the involvement of synthetic strongly labeled data, weakly labeled data, and unlabeled data has drawn much research attention in semi-supervised acoustic event detection (SAED).
Yuzhuo Liu +4 more
doaj +1 more source
Acoustic Analysis of PD Speech [PDF]
According to the U.S. National Institutes of Health, approximately 500,000 Americans have Parkinson's disease (PD), with roughly another 50,000 receiving new diagnoses each year. 70%–90% of these people also have the hypokinetic dysarthria associated with PD.
Karen Chenausky +2 more
openaire +3 more sources
Temporal Convolution Network Based Joint Optimization of Acoustic-to-Articulatory Inversion
Articulatory features are proved to be efficient in the area of speech recognition and speech synthesis. However, acquiring articulatory features has always been a difficult research hotspot.
Guolun Sun +3 more
doaj +1 more source

