Results 1 to 10 of about 154,869 (269)
Speech Recognition with No Speech or with Noisy Speech [PDF]
The performance of automatic speech recognition systems(ASR) degrades in the presence of noisy speech. This paper demonstrates that using electroencephalography (EEG) can help automatic speech recognition systems overcome performance loss in the presence of noise.
Gautam Krishna +3 more
openaire +2 more sources
Learning speech rate in speech recognition [PDF]
A significant performance reduction is often observed in speech recognition when the rate of speech (ROS) is too low or too high. Most of present approaches to addressing the ROS variation focus on the change of speech signals in dynamic properties caused by ROS, and accordingly modify the dynamic model, e.g., the transition probabilities of the hidden
Xiangyu Zeng, Shi Yin, Dong Wang 0013
openaire +2 more sources
AbstractSpeech recognition can be formulated as the problem of guessing a sequence of words that produces a sequence of sounds. The human brain is remarkably good at solving this problem, even though the same words correspond to many different sounds, because of accents or characteristics of the voice. Moreover, the environment is always noisy, to that
openaire +2 more sources
Advancing Speech Recognition With No Speech Or With Noisy Speech [PDF]
In this paper we demonstrate end-to-end continuous speech recognition (CSR) using electroencephalography (EEG) signals with no speech signal as input. An attention model based automatic speech recognition (ASR) and connectionist temporal classification (CTC) based ASR systems were implemented for performing recognition.
Gautam Krishna +3 more
openaire +2 more sources
AbstractClassic research on the perception of speech sought to identify minimal acoustic correlates of each consonant and vowel. In explaining perception, this view designated momentary components of an acoustic spectrum as cues to the recognition of elementary phonemes.
Remez, Robert E, Thomas, Emily F
openaire +3 more sources
Band importance for speech-in-speech recognition [PDF]
Predicting masked speech perception typically relies on estimates of the spectral distribution of cues supporting recognition. Current methods for estimating band importance for speech-in-noise use filtered stimuli. These methods are not appropriate for speech-in-speech because filtering can modify stimulus features affecting auditory stream ...
Buss, Emily, Bosen, Adam
openaire +2 more sources
Speech recognition in parallel [PDF]
Concomitantly with recent advances in speech coding, recognition and production, parallel computer systems are now commonplace delivering raw computing power measured in hundreds of MIPS and Megaflops. It seems inevitable that within the next decade or so, gigaflop parallel processors will be achievable at modest cost.
Salvatore J. Stolfo +3 more
openaire +2 more sources
Unsupervised Speech Recognition
Despite rapid progress in the recent past, current speech recognition systems still require labeled training data which limits this technology to a small fraction of the languages spoken around the globe. This paper describes wav2vec-U, short for wav2vec Unsupervised, a method to train speech recognition models without any labeled data.
Alexei Baevski +3 more
openaire +3 more sources
Continuous speech recognition [PDF]
The authors focus on a tutorial description of the hybrid HMM/ANN method. The approach has been applied to large vocabulary continuous speech recognition, and variants are in use by many researchers, The method provides a mechanism for incorporating a range of sources of evidence without strong assumptions about their joint statistics, and may have ...
Nelson Morgan, Hervé Bourlard
openaire +1 more source
Speech Recognition with Augmented Synthesized Speech [PDF]
Recent success of the Tacotron speech synthesis architecture and its variants in producing natural sounding multi-speaker synthesized speech has raised the exciting possibility of replacing expensive, manually transcribed, domain-specific, human speech that is used to train speech recognizers.
Andrew Rosenberg +6 more
openaire +2 more sources

