Results 11 to 20 of about 1,117,962 (24)

Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents

open access: yes, 2018
User interaction with voice-powered agents generates large amounts of unlabeled utterances. In this paper, we explore techniques to efficiently transfer the knowledge from these unlabeled utterances to improve model performance on Spoken Language ...
Goyal, Anuj   +2 more
core   +1 more source

Reconstruction of Phonated Speech from Whispers Using Formant-Derived Plausible Pitch Modulation [PDF]

open access: yes, 2015
Whispering is a natural, unphonated, secondary aspect of speech communications for most people. However, it is the primary mechanism of communications for some speakers who have impaired voice production mechanisms, such as partial laryngectomees, as ...
Beigi Homayoon   +13 more
core   +1 more source

A Public Voice for Youth: The Audience Problem in Digital Media and Civic Education [PDF]

open access: yes, 2008
Part of the Volume on Civic Life Online: Learning How Digital Media Can Engage Youth.Students should have opportunities to create digital media in schools.
Peter Levine
core  

Visual Speech Enhancement

open access: yes, 2018
When video is shot in noisy environment, the voice of a speaker seen in the video can be enhanced using the visible mouth movements, reducing background noise.
Gabbay, Aviv   +2 more
core   +1 more source

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

open access: yes, 2018
Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS), subsume the acoustic, pronunciation and language model components of a traditional automatic speech recognition (ASR) system into a single neural network.
Bacchiani, Michiel   +13 more
core   +1 more source

Formal and informal systems of VET: implications for employee involvement [PDF]

open access: yes, 2009
The age-old conundrum embodied in the skills challenge is this: if it is accepted that skills are a good thing, then why is it that the uptake of skills development practices, through, for example, training and lifelong learning agenda, are not ...
Chan, Paul, Moehler, Robert
core  

VoxCeleb2: Deep Speaker Recognition

open access: yes, 2018
The objective of this paper is speaker recognition under noisy and unconstrained conditions. We make two key contributions. First, we introduce a very large-scale audio-visual speaker recognition dataset collected from open-source media.
Chung, Joon Son   +2 more
core   +1 more source

Remote voice training: A case study on space shuttle applications, appendix C [PDF]

open access: yes
The Tile Automation System includes applications of automation and robotics technology to all aspects of the Shuttle tile processing and inspection system. An integrated set of rapid prototyping testbeds was developed which include speech recognition and
Hamid, Tamin, Mollakarimi, Cindy
core   +1 more source

Real-Time Statistical Speech Translation

open access: yes, 2014
This research investigates the Statistical Machine Translation approaches to translate speech in real time automatically. Such systems can be used in a pipeline with speech recognition and synthesis software in order to produce a real-time voice ...
A. Radziszewski, D. Cer, F. Holz
core   +1 more source

The Army word recognition system [PDF]

open access: yes
The application of speech recognition technology in the Army command and control area is presented. The problems associated with this program are described as well as as its relevance in terms of the man/machine interactions, voice inflexions, and the ...
Hadden, David R., Haratz, David
core   +1 more source

Home - About - Disclaimer - Privacy