Results 11 to 20 of about 71,717 (339)

Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2023
Prosody is closely linked to the identity of a speaker, leading to individual pitch and intonation patterns. Therefore, it is challenging in speaker anonymization to generate speech utterances that both keep the original audio’s main prosodic structure ...
Sarina Meyer   +5 more
semanticscholar   +1 more source

Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2020
This paper proposes a hierarchical, fine-grained and interpretable latent variable model for prosody based on the Tacotron 2 text-to-speech model. It achieves multi-resolution modeling of prosody by conditioning finer level representations on coarser ...
Guangzhi Sun   +5 more
semanticscholar   +1 more source

Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2020
Recent neural text-to-speech (TTS) models with fine-grained latent features enable precise control of the prosody of synthesized speech. Such models typically incorporate a fine-grained variational autoencoder (VAE) structure, extracting latent features ...
Guangzhi Sun   +7 more
semanticscholar   +1 more source

CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech [PDF]

open access: yesInterspeech, 2020
Prosody Transfer (PT) is a technique that aims to use the prosody from a source audio as a reference while synthesising speech. Fine-grained PT aims at capturing prosodic aspects like rhythm, emphasis, melody, duration, and loudness, from a source audio ...
S. Karlapati   +5 more
semanticscholar   +1 more source

Mark my words: tone of voice changes affective word representations in memory. [PDF]

open access: yesPLoS ONE, 2010
The present study explored the effect of speaker prosody on the representation of words in memory. To this end, participants were presented with a series of words and asked to remember the words for a subsequent recognition test. During study, words were
Annett Schirmer
doaj   +1 more source

Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis [PDF]

open access: yesSpoken Language Technology Workshop, 2020
Prosody modeling is an essential component in modern text-to-speech (TTS) frameworks. By explicitly providing prosody features to the TTS model, the style of synthesized utterances can thus be controlled.
C. Chien, Hung-yi Lee
semanticscholar   +1 more source

Prosodie et prosodie musicale

open access: yesLa Bretagne linguistique, 1993
How do the language and its accentuation adapt to the constraints of the melody? Which of these two elements takes precedence in the development of a song? The aim of musical prosody is to highlight the relationship between the accentuation of the lyrics and the melody.
Laurent, Donatien, Goyat, Gilles
openaire   +4 more sources

“Textual Prosody” Can Change Impressions of Reading in People With Normal Hearing and Hearing Loss

open access: yesFrontiers in Psychology, 2020
Recently, dynamic text presentation, such as scrolling text, has been widely used. Texts are often presented at constant timing and speed in conventional dynamic text presentation.
Miki Uetsuki   +2 more
doaj   +1 more source

Camp: A Two-Stage Approach to Modelling Prosody in Context [PDF]

open access: yesIEEE International Conference on Acoustics, Speech, and Signal Processing, 2020
Prosody is an integral part of communication, but remains an open problem in state-of-the-art speech synthesis. There are two major issues faced when modelling prosody: (1) prosody varies at a slower rate compared with other content in the acoustic ...
Zack Hodari   +8 more
semanticscholar   +1 more source

Perception of Prosodic Modulations of Linguistic and Paralinguistic Origin: Evidence From Early Auditory Event-Related Potentials

open access: yesFrontiers in Neuroscience, 2021
How listeners handle prosodic cues of linguistic and paralinguistic origin is a central question for spoken communication. In the present EEG study, we addressed this question by examining neural responses to variations in pitch accent (linguistic) and ...
Hatice Zora, Valéria Csépe
doaj   +1 more source

Home - About - Disclaimer - Privacy