Right Hemisphere Regions Critical for Expression of Emotion Through Prosody
Impaired expression of emotion through pitch, loudness, rate, and rhythm of speech (affective prosody) is common and disabling after right hemisphere (RH) stroke. These deficits impede all social interactions.
Sona Patel +6 more
doaj +2 more sources
Visual Gestures of the Head and Eyebrows Support Prosody Perception for Individuals with Cochlear Implants [PDF]
Visual gestures, especially head movements and eyebrow raises, are time-locked to acoustic cues during the expression of spoken prosody. The present study examined the role these visual cues play in prosody perception, particularly for individuals with ...
Justin T. Fleming +2 more
doaj +2 more sources
Yazhou Jin,* Zhiqi Mao,* Zhipei Ling, Xin Xu, Guang Xie, Xinguang Yu Department of Neurosurgery, People’s Liberation Army General Hospital, Beijing, People’s Republic of China *These authors contributed equally to this work Background ...
Jin Y, Mao Z, Ling Z, Xu X, Xie G, Yu X
doaj +1 more source
Text-Free Prosody-Aware Generative Spoken Language Modeling [PDF]
Speech pre-training has primarily demonstrated efficacy on classification tasks, while its capability of generating novel speech, similar to how GPT-2 can generate coherent paragraphs, has barely been explored. Generative Spoken Language Modeling (GSLM) (
E. Kharitonov +10 more
semanticscholar +1 more source
A New Approach to the Persian Prosodies based on Music Tetrachords [PDF]
There is no doubt that the quantitative meter is the essence of music and prosodies, providing the link between poetry and music. In addition to the time priority that music has over prosodies, the similarity between these elements makes it apparent that
Mehran Mhboobi moqadam, Ali Heydari
doaj +1 more source
On the Utility of Self-Supervised Models for Prosody-Related Tasks [PDF]
Self-Supervised Learning (SSL) from speech data has produced models that have achieved remarkable performance in many tasks, and that are known to implicitly represent many aspects of information latently present in speech signals.
Guan-Ting Lin +7 more
semanticscholar +1 more source
Prosospeech: Enhancing Prosody with Quantized Vector Pre-Training in Text-To-Speech [PDF]
Expressive text-to-speech (TTS) has become a hot research topic recently, mainly focusing on modeling prosody in speech. Prosody modeling has several challenges: 1) the extracted pitch used in previous prosody modeling works have inevitable errors, which
Yi Ren +6 more
semanticscholar +1 more source
Emotional prosody perception and production are linked in prelingually deaf children with cochlear implantsa) [PDF]
Links between perception and production of emotional prosody by children with cochlear implants (CIs) have not been extensively explored. In this study, production and perception of emotional prosody were measured in 20 prelingually deaf school-age ...
Monita Chatterjee +3 more
doaj +1 more source
Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning
Prosody is closely linked to the identity of a speaker, leading to individual pitch and intonation patterns. Therefore, it is challenging in speaker anonymization to generate speech utterances that both keep the original audio’s main prosodic structure ...
S. Meyer +5 more
semanticscholar +1 more source
Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis [PDF]
This paper proposes a hierarchical, fine-grained and interpretable latent variable model for prosody based on the Tacotron 2 text-to-speech model. It achieves multi-resolution modeling of prosody by conditioning finer level representations on coarser ...
Guangzhi Sun +5 more
semanticscholar +1 more source

