Results 21 to 30 of about 275,421 (290)

A novel privacy-preserving speech recognition framework using bidirectional LSTM

open access: yesJournal of Cloud Computing: Advances, Systems and Applications, 2020
Utilizing speech as the transmission medium in Internet of things (IoTs) is an effective way to reduce latency while improving the efficiency of human-machine interaction. In the field of speech recognition, Recurrent Neural Network (RNN) has significant
Qingren Wang   +4 more
doaj   +1 more source

Overview of Automatic Speech Recognition, Approaches and Challenges: Way the Future to Turkish Speech Recognition

open access: yesGazi Üniversitesi Fen Bilimleri Dergisi, 2019
Speech is a paramount means of communication among humans, which makes recognition of the speech by computers is a study area of significance. In this research area, many studies have been carried out based on different languages.
Saadin OYUCU   +2 more
doaj   +1 more source

Selection of acoustic modeling unit for Tibetan speech recognition based on deep learning [PDF]

open access: yesMATEC Web of Conferences, 2021
The selection of the speech recognition modeling unit is the primary problem of acoustic modeling in speech recognition, and different acoustic modeling units will directly affect the overall performance of speech recognition.
Gong Baojia   +4 more
doaj   +1 more source

Application of dynamic time warping optimization algorithm in speech recognition of machine translation

open access: yesHeliyon, 2023
Speech recognition is the foundation of human-computer interaction technology and an important aspect of speech signal processing, with broad application prospects. Therefore, it is very necessary to recognize speech.
Shaohua Jiang, Zheng Chen
doaj  

Streaming End-to-End Target-Speaker Automatic Speech Recognition and Activity Detection

open access: yesIEEE Access, 2023
Automatic speech recognition of a target speaker in the presence of interfering speakers remains a challenging issue. One approach to tackle this problem is target-speaker speech recognition, which conditions the recognition process on an embedding that ...
Takafumi Moriya   +4 more
doaj   +1 more source

Optimization of Intelligent English Pronunciation Training System Based on Android Platform

open access: yesComplexity, 2021
Oral English, as a language tool, is not only an important part of English learning but also an essential part. For nonnative English learners, effective and meaningful voice feedback is very important. At present, most of the traditional recognition and
Qianyu Cao, Hanmei Hao
doaj   +1 more source

Large Vocabulary Spontaneous Speech Recognition for Tigrigna [PDF]

open access: yesarXiv, 2023
This thesis proposes and describes a research attempt at designing and developing a speaker independent spontaneous automatic speech recognition system for Tigrigna The acoustic model of the Speech Recognition System is developed using Carnegie Mellon University Automatic Speech Recognition development tool (Sphinx) while the SRIM tool is used for the ...
arxiv  

Recognition of English speech – using a deep learning algorithm

open access: yesJournal of Intelligent Systems, 2023
The accurate recognition of speech is beneficial to the fields of machine translation and intelligent human–computer interaction. After briefly introducing speech recognition algorithms, this study proposed to recognize speech with a recurrent neural ...
Wang Shuyan
doaj   +1 more source

Unvoiced Speech Recognition Using Tissue-Conductive Acoustic Sensor

open access: yesEURASIP Journal on Advances in Signal Processing, 2007
We present the use of stethoscope and silicon NAM (nonaudible murmur) microphones in automatic speech recognition. NAM microphones are special acoustic sensors, which are attached behind the talker's ear and can capture not only normal (audible) speech ...
Hiroshi Saruwatari   +3 more
doaj   +2 more sources

Silent versus modal multi-speaker speech recognition from ultrasound and video [PDF]

open access: yesarXiv, 2021
We investigate multi-speaker speech recognition from ultrasound images of the tongue and video images of the lips. We train our systems on imaging data from modal speech, and evaluate on matched test sets of two speaking modes: silent and modal speech.
arxiv  

Home - About - Disclaimer - Privacy