Audio and speech processing eess.as

Results 1 to 10 of about 31 (26)

Cascade or Direct Speech Translation? A Case Study

Applied Sciences, 2022
Speech translation has been traditionally tackled under a cascade approach, chaining speech recognition and machine translation components to translate from an audio source in a given language into text or speech in a target language.
T. Etchegoyhen +7 more
semanticscholar +1 more source

Audio-Visual Target Speaker Enhancement on Multi-Talker Environment Using Event-Driven Cameras

International Symposium on Circuits and Systems, 2021
We propose a method to address audio-visual target speaker enhancement in multi-talker environments using eventdriven cameras. State of the art audio-visual speech separation methods shows that crucial information is the movement of the facial landmarks ...
A. Arriandiaga +4 more
semanticscholar +1 more source

Enhancing Voice Cloning Quality through Data Selection and Alignment-Based Metrics

Applied Sciences, 2023
Voice cloning, an emerging field in the speech-processing area, aims to generate synthetic utterances that closely resemble the voices of specific individuals.
Ander González-Docasal, Aitor Álvarez
semanticscholar +1 more source

Changing Occupational Roles in Audit Society—The Case of Swedish Student Aid Officials

, 2015
This article is about occupational change concerning a non-professional group of Street Level Bureaucrats—student aid officials at the Swedish Board for Study Support (SBSS).
A. Bruhn
semanticscholar +1 more source

Some of the next articles are maybe not open access.

Exploring the limits of neural voice cloning: A case study on two well-known personalities

IberSPEECH Conference, 2022
This work describes one successful and one failed Voice Cloning processes of two famous personalities in order to be broadcast in a high-impact podcast and in a Spanish public tele-vision program.
Ander González-Docasal, Aitor Álvarez, Haritz Arzelus +2 more
semanticscholar +1 more source

Digital Voice Assistants: A new kind of user agent

IEEE International Conference on Control, Measurement and Instrumentation, 2020
Digital voice assistants provide a new way of browsing and interacting online. Now we can simply speak and listen to our devices, almost as if to a human assistant.
A. T. Christensen, H. Olesen, L. Sørensen +2 more
semanticscholar +1 more source

Using Text-to-Speech to Prototype Game Dialog

Conference on Computability in Europe, 2018
Voice acting is common in computer games in many genres. The recording and processing of voice acting is a time-consuming process that involves, for instance, voice actors, directors, audio engineers, and game writers.
H. Engström, Per Anders Östblad
semanticscholar +1 more source

A high-performance speech neuroprosthesis

Nature, 2023
Francis R Willett, Guy H Wilson, Eun Young Choi +2 more
exaly

Deep learning

Nature, 2015
Yann Lecun, Yoshua Bengio, Geoffrey Hinton +2 more
exaly

Sign-to-speech translation using machine-learning-assisted stretchable sensor arrays

Nature Electronics, 2020
Kyle Chen, Xiaoshi Li, Songlin Zhang
exaly

computer science