Results 11 to 20 of about 1,117,962 (24)
Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents
User interaction with voice-powered agents generates large amounts of unlabeled utterances. In this paper, we explore techniques to efficiently transfer the knowledge from these unlabeled utterances to improve model performance on Spoken Language ...
Goyal, Anuj +2 more
core +1 more source
Reconstruction of Phonated Speech from Whispers Using Formant-Derived Plausible Pitch Modulation [PDF]
Whispering is a natural, unphonated, secondary aspect of speech communications for most people. However, it is the primary mechanism of communications for some speakers who have impaired voice production mechanisms, such as partial laryngectomees, as ...
Beigi Homayoon +13 more
core +1 more source
A Public Voice for Youth: The Audience Problem in Digital Media and Civic Education [PDF]
Part of the Volume on Civic Life Online: Learning How Digital Media Can Engage Youth.Students should have opportunities to create digital media in schools.
Peter Levine
core
When video is shot in noisy environment, the voice of a speaker seen in the video can be enhanced using the visible mouth movements, reducing background noise.
Gabbay, Aviv +2 more
core +1 more source
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS), subsume the acoustic, pronunciation and language model components of a traditional automatic speech recognition (ASR) system into a single neural network.
Bacchiani, Michiel +13 more
core +1 more source
Formal and informal systems of VET: implications for employee involvement [PDF]
The age-old conundrum embodied in the skills challenge is this: if it is accepted that skills are a good thing, then why is it that the uptake of skills development practices, through, for example, training and lifelong learning agenda, are not ...
Chan, Paul, Moehler, Robert
core
VoxCeleb2: Deep Speaker Recognition
The objective of this paper is speaker recognition under noisy and unconstrained conditions. We make two key contributions. First, we introduce a very large-scale audio-visual speaker recognition dataset collected from open-source media.
Chung, Joon Son +2 more
core +1 more source
Remote voice training: A case study on space shuttle applications, appendix C [PDF]
The Tile Automation System includes applications of automation and robotics technology to all aspects of the Shuttle tile processing and inspection system. An integrated set of rapid prototyping testbeds was developed which include speech recognition and
Hamid, Tamin, Mollakarimi, Cindy
core +1 more source
Real-Time Statistical Speech Translation
This research investigates the Statistical Machine Translation approaches to translate speech in real time automatically. Such systems can be used in a pipeline with speech recognition and synthesis software in order to produce a real-time voice ...
A. Radziszewski, D. Cer, F. Holz
core +1 more source
The Army word recognition system [PDF]
The application of speech recognition technology in the Army command and control area is presented. The problems associated with this program are described as well as as its relevance in terms of the man/machine interactions, voice inflexions, and the ...
Hadden, David R., Haratz, David
core +1 more source

