Results 1 to 10 of about 31 (26)

Cascade or Direct Speech Translation? A Case Study

open access: yesApplied Sciences, 2022
Speech translation has been traditionally tackled under a cascade approach, chaining speech recognition and machine translation components to translate from an audio source in a given language into text or speech in a target language.
T. Etchegoyhen   +7 more
semanticscholar   +1 more source

Audio-Visual Target Speaker Enhancement on Multi-Talker Environment Using Event-Driven Cameras

open access: yesInternational Symposium on Circuits and Systems, 2021
We propose a method to address audio-visual target speaker enhancement in multi-talker environments using eventdriven cameras. State of the art audio-visual speech separation methods shows that crucial information is the movement of the facial landmarks ...
A. Arriandiaga   +4 more
semanticscholar   +1 more source

Enhancing Voice Cloning Quality through Data Selection and Alignment-Based Metrics

open access: yesApplied Sciences, 2023
Voice cloning, an emerging field in the speech-processing area, aims to generate synthetic utterances that closely resemble the voices of specific individuals.
Ander González-Docasal, Aitor Álvarez
semanticscholar   +1 more source

Changing Occupational Roles in Audit Society—The Case of Swedish Student Aid Officials

open access: yes, 2015
This article is about occupational change concerning a non-professional group of Street Level Bureaucrats—student aid officials at the Swedish Board for Study Support (SBSS).
A. Bruhn
semanticscholar   +1 more source
Some of the next articles are maybe not open access.

Exploring the limits of neural voice cloning: A case study on two well-known personalities

IberSPEECH Conference, 2022
This work describes one successful and one failed Voice Cloning processes of two famous personalities in order to be broadcast in a high-impact podcast and in a Spanish public tele-vision program.
Ander González-Docasal   +2 more
semanticscholar   +1 more source

Digital Voice Assistants: A new kind of user agent

IEEE International Conference on Control, Measurement and Instrumentation, 2020
Digital voice assistants provide a new way of browsing and interacting online. Now we can simply speak and listen to our devices, almost as if to a human assistant.
A. T. Christensen   +2 more
semanticscholar   +1 more source

Using Text-to-Speech to Prototype Game Dialog

Conference on Computability in Europe, 2018
Voice acting is common in computer games in many genres. The recording and processing of voice acting is a time-consuming process that involves, for instance, voice actors, directors, audio engineers, and game writers.
H. Engström, Per Anders Östblad
semanticscholar   +1 more source

A high-performance speech neuroprosthesis

Nature, 2023
Francis R Willett   +2 more
exaly  

Deep learning

Nature, 2015
Yann Lecun   +2 more
exaly  

Sign-to-speech translation using machine-learning-assisted stretchable sensor arrays

Nature Electronics, 2020
Kyle Chen, Xiaoshi Li, Songlin Zhang
exaly  

Home - About - Disclaimer - Privacy