Results 11 to 20 of about 50,727 (310)

Prosodic Prominence and Boundaries in Sequence-to-Sequence Speech Synthesis [PDF]

open access: yes, 2020
Recent advances in deep learning methods have elevated synthetic speech quality to human level, and the field is now moving towards addressing prosodic variation in synthetic speech.Despite successes in this effort, the state-of-the-art systems fall ...
Juraj Šimko   +7 more
core   +1 more source

Analysis of speech prosody using WaveNet embeddings : The Lombard effect [PDF]

open access: yes, 2020
We present a novel methodology for speech prosody research based on the analysis of embeddings used to condition a convolutional WaveNet speech synthesis system.
Juraj Šimko   +5 more
core   +1 more source

Document-Level Neural TTS Using Curriculum Learning and Attention Masking

open access: yesIEEE Access, 2021
Speech synthesis has been developed to the level of natural human-level speech synthesized through an attention-based end-to-end text-to-speech synthesis (TTS) model. However, it is difficult to generate attention when synthesizing a text longer than the
Sung-Woong Hwang, Joon-Hyuk Chang
doaj   +1 more source

Improved Speech Synthesis Algorithm Based on Harmonic Plus Noise Excitation Model [PDF]

open access: yesJisuanji gongcheng, 2016
The excitation signal used in the traditional Hidden Markov Model(HMM)-based speech synthesis algorithm is either a pulse train or white gaussian noise,and the synthesis speech sounds buzzy.An improved speech synthesis algorithm based on harmonic plus ...
GE Yongkan,YU Fengqin
doaj   +1 more source

Chinese personalised text‐to‐speech synthesis for robot human–machine interaction

open access: yesIET Cyber-systems and Robotics, 2023
Speech interaction is an important means of robot interaction. With the rapid development of deep learning, end‐to‐end speech synthesis methods based on this technique have gradually become mainstream.
Bao Pang   +5 more
doaj   +1 more source

An Emotion Speech Synthesis Method Based on VITS

open access: yesApplied Sciences, 2023
People and things can be connected through the Internet of Things (IoT), and speech synthesis is one of the key technologies. At this stage, end-to-end speech synthesis systems are capable of synthesizing relatively realistic human voices, but the ...
Wei Zhao, Zheng Yang
doaj   +1 more source

Speech Synthesis With Mixed Emotions

open access: yesIEEE Transactions on Affective Computing, 2023
Emotional speech synthesis aims to synthesize human voices with various emotional effects. The current studies are mostly focused on imitating an averaged style belonging to a specific emotion type. In this paper, we seek to generate speech with a mixture of emotions at run-time.
Kun Zhou 0003   +4 more
openaire   +4 more sources

Computer-Implemented Articulatory Models for Speech Production: A Review

open access: yesFrontiers in Robotics and AI, 2022
Modeling speech production and speech articulation is still an evolving research topic. Some current core questions are: What is the underlying (neural) organization for controlling speech articulation?
Bernd J. Kröger
doaj   +1 more source

Statistical parametric speech synthesis [PDF]

open access: yesSpeech Communication, 2007
This paper gives a general overview of techniques in statistical parametric speech synthesis. One of the instances of these techniques, called HMM-based generation synthesis (or simply HMM-based synthesis), has recently been shown to be very effective in generating acceptable speech synthesis.
Alan W. Black, Heiga Zen, Keiichi Tokuda
openaire   +1 more source

A survey of expressive speech synthesis

open access: yes大数据, 2023
Speech synthesis is a hot research topic in the field of speech, language and machine learning, which aims to synthesize understandable and natural speech for a given text.It has a wide range of applications in industry.One of the goals of speech ...
Haobin TANG   +4 more
doaj  

Home - About - Disclaimer - Privacy