Results 21 to 30 of about 1,061,228 (335)
A novel privacy-preserving speech recognition framework using bidirectional LSTM
Utilizing speech as the transmission medium in Internet of things (IoTs) is an effective way to reduce latency while improving the efficiency of human-machine interaction. In the field of speech recognition, Recurrent Neural Network (RNN) has significant
Qingren Wang+4 more
doaj +1 more source
Selection of acoustic modeling unit for Tibetan speech recognition based on deep learning [PDF]
The selection of the speech recognition modeling unit is the primary problem of acoustic modeling in speech recognition, and different acoustic modeling units will directly affect the overall performance of speech recognition.
Gong Baojia+4 more
doaj +1 more source
Streaming End-to-End Target-Speaker Automatic Speech Recognition and Activity Detection
Automatic speech recognition of a target speaker in the presence of interfering speakers remains a challenging issue. One approach to tackle this problem is target-speaker speech recognition, which conditions the recognition process on an embedding that ...
Takafumi Moriya+4 more
doaj +1 more source
AbstractClassic research on the perception of speech sought to identify minimal acoustic correlates of each consonant and vowel. In explaining perception, this view designated momentary components of an acoustic spectrum as cues to the recognition of elementary phonemes.
Robert E. Remez, Emily F. Thomas
openaire +4 more sources
Croatian Speech Recognition [PDF]
In the chapter we describe procedures for Croatian speech recognition which are used in a limited domain spoken dialog system for Croatian speech. The dialog system would provide information about weather in different regions of Croatia for different time periods (Žibert et al., 2003).
Sanda Martinčić-Ipšić, Ivo Ipšić
openaire +4 more sources
Audio-visual speech recognition with background music using single-channel source separation [PDF]
In this paper, we consider audio-visual speech recognition with background music. The proposed algorithm is an integration of audio-visual speech recognition and single channel source separation (SCSS). We apply the proposed algorithm to recognize spoken
Erdogan, Hakan+4 more
core +1 more source
Synopsis on Arabic speech recognition
With the advancement and increased usage of intelligible smart devices, researchers have an intensified interest in the field of large-vocabulary speaker-independent continuous speech recognition.
Fawaz S. Al-Anzi, Dia AbuZeina
doaj +1 more source
Learning speech rate in speech recognition [PDF]
A significant performance reduction is often observed in speech recognition when the rate of speech (ROS) is too low or too high. Most of present approaches to addressing the ROS variation focus on the change of speech signals in dynamic properties caused by ROS, and accordingly modify the dynamic model, e.g., the transition probabilities of the hidden
Dong Wang, Shi Yin, Xiangyu Zeng
openaire +3 more sources
Recognition of English speech – using a deep learning algorithm
The accurate recognition of speech is beneficial to the fields of machine translation and intelligent human–computer interaction. After briefly introducing speech recognition algorithms, this study proposed to recognize speech with a recurrent neural ...
Wang Shuyan
doaj +1 more source
Unvoiced Speech Recognition Using Tissue-Conductive Acoustic Sensor
We present the use of stethoscope and silicon NAM (nonaudible murmur) microphones in automatic speech recognition. NAM microphones are special acoustic sensors, which are attached behind the talker's ear and can capture not only normal (audible) speech ...
Hiroshi Saruwatari+3 more
doaj +2 more sources