A novel privacy-preserving speech recognition framework using bidirectional LSTM
Utilizing speech as the transmission medium in Internet of things (IoTs) is an effective way to reduce latency while improving the efficiency of human-machine interaction. In the field of speech recognition, Recurrent Neural Network (RNN) has significant
Qingren Wang +4 more
doaj +1 more source
Non-autoregressive Transformer Chinese Speech Recognition Incorporating Pronunciation- Character Representation Conversion [PDF]
The Transformer based on self-attention mechanism shows powerful model performance in speech recognition tasks,where the non-autoregressive Transformer automatic speech recognition model has a faster decoding speed compared with the autoregressive model ...
TENG Sihang, WANG Lie, LI Ya
doaj +1 more source
Combining Multiple Views for Visual Speech Recognition [PDF]
Visual speech recognition is a challenging research problem with a particular practical application of aiding audio speech recognition in noisy scenarios.
Ekenel, Hazım Kemal +3 more
core +2 more sources
Design and implementation of a user-oriented speech recognition interface: the synergy of technology and human factors [PDF]
The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection.
Kloosterman, Sietse H.
core +4 more sources
Selection of acoustic modeling unit for Tibetan speech recognition based on deep learning [PDF]
The selection of the speech recognition modeling unit is the primary problem of acoustic modeling in speech recognition, and different acoustic modeling units will directly affect the overall performance of speech recognition.
Gong Baojia +4 more
doaj +1 more source
Multilingual Speech Recognition
The speech-to-speech translation system Verbmobil requires a multilingual setting. This consists of recognition engines in the three languages German, English and Japanese that run in one common framework together with a language identification component which is able to switch between these recognizers.
Waibel, Alex +4 more
openaire +4 more sources
Recognition of English speech – using a deep learning algorithm
The accurate recognition of speech is beneficial to the fields of machine translation and intelligent human–computer interaction. After briefly introducing speech recognition algorithms, this study proposed to recognize speech with a recurrent neural ...
Wang Shuyan
doaj +1 more source
Streaming End-to-End Target-Speaker Automatic Speech Recognition and Activity Detection
Automatic speech recognition of a target speaker in the presence of interfering speakers remains a challenging issue. One approach to tackle this problem is target-speaker speech recognition, which conditions the recognition process on an embedding that ...
Takafumi Moriya +4 more
doaj +1 more source
Synopsis on Arabic speech recognition
With the advancement and increased usage of intelligible smart devices, researchers have an intensified interest in the field of large-vocabulary speaker-independent continuous speech recognition.
Fawaz S. Al-Anzi, Dia AbuZeina
doaj +1 more source
Towards a tool for the subjective assessment of speech system interfaces (SASSI) [PDF]
Applications of speech recognition are now widespread, but user-centred evaluation methods are necessary to ensure their success. Objective evaluation techniques are fairly well established, but previous subjective techniques have been unstructured and ...
Graham, R, Hone, KS
core +1 more source

