Results 11 to 20 of about 50,727 (310)
Prosodic Prominence and Boundaries in Sequence-to-Sequence Speech Synthesis [PDF]
Recent advances in deep learning methods have elevated synthetic speech quality to human level, and the field is now moving towards addressing prosodic variation in synthetic speech.Despite successes in this effort, the state-of-the-art systems fall ...
Juraj Šimko +7 more
core +1 more source
Analysis of speech prosody using WaveNet embeddings : The Lombard effect [PDF]
We present a novel methodology for speech prosody research based on the analysis of embeddings used to condition a convolutional WaveNet speech synthesis system.
Juraj Šimko +5 more
core +1 more source
Document-Level Neural TTS Using Curriculum Learning and Attention Masking
Speech synthesis has been developed to the level of natural human-level speech synthesized through an attention-based end-to-end text-to-speech synthesis (TTS) model. However, it is difficult to generate attention when synthesizing a text longer than the
Sung-Woong Hwang, Joon-Hyuk Chang
doaj +1 more source
Improved Speech Synthesis Algorithm Based on Harmonic Plus Noise Excitation Model [PDF]
The excitation signal used in the traditional Hidden Markov Model(HMM)-based speech synthesis algorithm is either a pulse train or white gaussian noise,and the synthesis speech sounds buzzy.An improved speech synthesis algorithm based on harmonic plus ...
GE Yongkan,YU Fengqin
doaj +1 more source
Chinese personalised text‐to‐speech synthesis for robot human–machine interaction
Speech interaction is an important means of robot interaction. With the rapid development of deep learning, end‐to‐end speech synthesis methods based on this technique have gradually become mainstream.
Bao Pang +5 more
doaj +1 more source
An Emotion Speech Synthesis Method Based on VITS
People and things can be connected through the Internet of Things (IoT), and speech synthesis is one of the key technologies. At this stage, end-to-end speech synthesis systems are capable of synthesizing relatively realistic human voices, but the ...
Wei Zhao, Zheng Yang
doaj +1 more source
Speech Synthesis With Mixed Emotions
Emotional speech synthesis aims to synthesize human voices with various emotional effects. The current studies are mostly focused on imitating an averaged style belonging to a specific emotion type. In this paper, we seek to generate speech with a mixture of emotions at run-time.
Kun Zhou 0003 +4 more
openaire +4 more sources
Computer-Implemented Articulatory Models for Speech Production: A Review
Modeling speech production and speech articulation is still an evolving research topic. Some current core questions are: What is the underlying (neural) organization for controlling speech articulation?
Bernd J. Kröger
doaj +1 more source
Statistical parametric speech synthesis [PDF]
This paper gives a general overview of techniques in statistical parametric speech synthesis. One of the instances of these techniques, called HMM-based generation synthesis (or simply HMM-based synthesis), has recently been shown to be very effective in generating acceptable speech synthesis.
Alan W. Black, Heiga Zen, Keiichi Tokuda
openaire +1 more source
A survey of expressive speech synthesis
Speech synthesis is a hot research topic in the field of speech, language and machine learning, which aims to synthesize understandable and natural speech for a given text.It has a wide range of applications in industry.One of the goals of speech ...
Haobin TANG +4 more
doaj

