Results 21 to 30 of about 19,322 (256)
Exploring discrete wavelet transforms for bimodal speech recognition [PDF]
Discrete Wavelet Transforms (DWTs) provide time–frequency representations that are well suited for nonstationary signals such as speech. This study presents a comparison of four wavelet families (Daubechies, Symlets, Coiflets, and Biorthogonal) for ...
Marković Branko +2 more
doaj +1 more source
IntroductionIn recent years, machines powered by deep learning have achieved near-human levels of performance in speech recognition. The fields of artificial intelligence and cognitive neuroscience have finally reached a similar level of performance ...
Cai Wingfield +9 more
doaj +1 more source
Speech recognition, which remained on the fringes of commercial interest for many years, came into prominence recently due to the support extended to this research area by the Advanced Research Projects Agency of theusa. More recently, this area of research has received added impetus due to the priority assigned to the development of fifth generation ...
P V S Rao, K K Paliwal
openaire +2 more sources
Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices
Audio-visual speech recognition (AVSR) is one of the most promising solutions for reliable speech recognition, particularly when audio is corrupted by noise. Additional visual information can be used for both automatic lip-reading and gesture recognition.
Dmitry Ryumin +2 more
doaj +1 more source
Automatic Speech Recognition from Neural Signals: A Focused Review
Speech interfaces have become widely accepted and are nowadays integrated in various real-life applications and devices. They have become a part of our daily life. However, speech interfaces presume the ability to produce intelligible speech, which might
Christian Herff, Tanja Schultz
doaj +1 more source
Automatic testing of speech recognition [PDF]
Speech reception tests are commonly administered by manually scoring the oral response of the subject. This requires a test supervisor to be continuously present. To avoid this, a subject can type the response, after which it can be scored automatically.
Francart, Tom +2 more
openaire +3 more sources
Speech production knowledge in automatic speech recognition [PDF]
Although much is known about how speech is produced, and research into speech production has resulted in measured articulatory data, feature systems of different kinds, and numerous models, speech production knowledge is almost totally ignored in current mainstream approaches to automatic speech recognition.
King, Simon +5 more
openaire +3 more sources
A Method Improves Speech Recognition with Contrastive Learning in Low-Resource Languages
Building an effective automatic speech recognition system typically requires a large amount of high-quality labeled data; However, this can be challenging for low-resource languages.
Lixu Sun, Nurmemet Yolwas, Lina Jiang
doaj +1 more source
Likelihood-Maximizing-Based Multiband Spectral Subtraction for Robust Speech Recognition
Automatic speech recognition performance degrades significantly when speech is affected by environmental noise. Nowadays, the major challenge is to achieve good robustness in adverse noisy conditions so that automatic speech recognizers can be used in ...
Bagher BabaAli +2 more
doaj +1 more source
KsponSpeech: Korean Spontaneous Speech Corpus for Automatic Speech Recognition
This paper introduces a large-scale spontaneous speech corpus of Korean, named KsponSpeech. This corpus contains 969 h of general open-domain dialog utterances, spoken by about 2000 native Korean speakers in a clean environment. All data were constructed
Jeong-Uk Bang +9 more
doaj +1 more source

