Results 1 to 10 of about 31 (26)
Cascade or Direct Speech Translation? A Case Study
Speech translation has been traditionally tackled under a cascade approach, chaining speech recognition and machine translation components to translate from an audio source in a given language into text or speech in a target language.
T. Etchegoyhen +7 more
semanticscholar +1 more source
Audio-Visual Target Speaker Enhancement on Multi-Talker Environment Using Event-Driven Cameras
We propose a method to address audio-visual target speaker enhancement in multi-talker environments using eventdriven cameras. State of the art audio-visual speech separation methods shows that crucial information is the movement of the facial landmarks ...
A. Arriandiaga +4 more
semanticscholar +1 more source
Enhancing Voice Cloning Quality through Data Selection and Alignment-Based Metrics
Voice cloning, an emerging field in the speech-processing area, aims to generate synthetic utterances that closely resemble the voices of specific individuals.
Ander González-Docasal, Aitor Álvarez
semanticscholar +1 more source
Changing Occupational Roles in Audit Society—The Case of Swedish Student Aid Officials
This article is about occupational change concerning a non-professional group of Street Level Bureaucrats—student aid officials at the Swedish Board for Study Support (SBSS).
A. Bruhn
semanticscholar +1 more source
Some of the next articles are maybe not open access.
Exploring the limits of neural voice cloning: A case study on two well-known personalities
IberSPEECH Conference, 2022This work describes one successful and one failed Voice Cloning processes of two famous personalities in order to be broadcast in a high-impact podcast and in a Spanish public tele-vision program.
Ander González-Docasal +2 more
semanticscholar +1 more source
Digital Voice Assistants: A new kind of user agent
IEEE International Conference on Control, Measurement and Instrumentation, 2020Digital voice assistants provide a new way of browsing and interacting online. Now we can simply speak and listen to our devices, almost as if to a human assistant.
A. T. Christensen +2 more
semanticscholar +1 more source
Using Text-to-Speech to Prototype Game Dialog
Conference on Computability in Europe, 2018Voice acting is common in computer games in many genres. The recording and processing of voice acting is a time-consuming process that involves, for instance, voice actors, directors, audio engineers, and game writers.
H. Engström, Per Anders Östblad
semanticscholar +1 more source
Sign-to-speech translation using machine-learning-assisted stretchable sensor arrays
Nature Electronics, 2020Kyle Chen, Xiaoshi Li, Songlin Zhang
exaly

