Results 1 to 10 of about 371,452 (184)

Exploring Prosodic Features Modelling for Secondary Emotions Needed for Empathetic Speech Synthesis [PDF]

open access: yesSensors, 2023
A low-resource emotional speech synthesis system for empathetic speech synthesis based on modelling prosody features is presented here. Secondary emotions, identified to be needed for empathetic speech, are modelled and synthesised in this investigation.
Jesin James   +3 more
doaj   +2 more sources

Speech synthesis, Speech simulation and speech science [PDF]

open access: yes7th International Conference on Spoken Language Processing (ICSLP 2002), 2002
Speech synthesis research has been transformed in recent years through the exploitation of speech corpora - both for statistical modelling and as a source of signals for concatenative synthesis.
Huckvale, M
core   +2 more sources

Methods of countering speech synthesis attacks on voice biometric systems in banking (review article) [PDF]

open access: yesНаучно-технический вестник информационных технологий, механики и оптики, 2021
The paper considers methods of countering speech synthesis attacks on voice biometric systems in banking. Voicebiometrics security is a large-scale problem significantly raised over the past few years.
Aleksandr Yu. Kuznetsov   +5 more
doaj   +1 more source

Sound-Action Symbolism

open access: yesFrontiers in Psychology, 2021
Recent evidence has shown linkages between actions and segmental elements of speech. For instance, close-front vowels are sound symbolically associated with the precision grip, and front vowels are associated with forward-directed limb movements.
Lari Vainio, Lari Vainio, Martti Vainio
doaj   +1 more source

Tibetan speech synthesis based on an improved neural network [PDF]

open access: yesMATEC Web of Conferences, 2021
Nowadays, Tibetan speech synthesis based on neural network has become the mainstream synthesis method. Among them, the griffin-lim vocoder is widely used in Tibetan speech synthesis because of its relatively simple synthesis.Aiming at the problem of low ...
Ding Yuntao, Cai Rangzhuoma, Gong Baojia
doaj   +1 more source

Effective Zero-Shot Multi-Speaker Text-to-Speech Technique Using Information Perturbation and a Speaker Encoder

open access: yesSensors, 2023
Speech synthesis is a technology that converts text into speech waveforms. With the development of deep learning, neural network-based speech synthesis technology is being researched in various fields, and the quality of synthesized speech has ...
Chae-Woon Bang, Chanjun Chun
doaj   +1 more source

A Situational Analysis of Current Speech-Synthesis Systems for Child Voices: A Scoping Review of Qualitative and Quantitative Evidence

open access: yesApplied Sciences, 2022
(1) Background: Speech synthesis has customarily focused on adult speech, but with the rapid development of speech-synthesis technology, it is now possible to create child voices with a limited amount of child-speech data.
Camryn Terblanche   +3 more
doaj   +1 more source

Semi-Supervised Learning for Robust Emotional Speech Synthesis with Limited Data

open access: yesApplied Sciences, 2023
Emotional speech synthesis is an important branch of human–computer interaction technology that aims to generate emotionally expressive and comprehensible speech based on the input text.
Jialin Zhang   +3 more
doaj   +1 more source

Vietnamese Speech Synthesis Based on Transfer Learning [PDF]

open access: yesJisuanji kexue, 2023
Vietnamese is the official language of the Socialist Republic of Vietnam.It belongs to the Vietnamese branch of the Viet Muang language family of the South Asian language family.In recent years,deep learning-based speech synthesis has been able to ...
YANG Lin, YANG Jian, CAI Haoran, LIU Cong
doaj   +1 more source

Document-Level Neural TTS Using Curriculum Learning and Attention Masking

open access: yesIEEE Access, 2021
Speech synthesis has been developed to the level of natural human-level speech synthesized through an attention-based end-to-end text-to-speech synthesis (TTS) model. However, it is difficult to generate attention when synthesizing a text longer than the
Sung-Woong Hwang, Joon-Hyuk Chang
doaj   +1 more source

Home - About - Disclaimer - Privacy