Results 71 to 80 of about 288,136 (202)
Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting
We propose a max-pooling based loss function for training Long Short-Term Memory (LSTM) networks for small-footprint keyword spotting (KWS), with low CPU, memory, and latency requirements.
Fu, Gengshen +8 more
core +1 more source
Joint intent detection and slot filling with syntactic and semantic features using multichannel CNN-BiLSTM [PDF]
Understanding spoken language is crucial for conversational agents, with intent detection and slot filling being the primary tasks in natural language understanding (NLU).
Yusuf Idris Muhammad +2 more
doaj +2 more sources
Speaker-following Video Subtitles
We propose a new method for improving the presentation of subtitles in video (e.g. TV and movies). With conventional subtitles, the viewer has to constantly look away from the main viewing area to read the subtitles at the bottom of the screen, which ...
Hu, Yongtao +3 more
core +1 more source
Evaluation of spoken document retrieval for historic speech collections [PDF]
The re-use of spoken word audio collections maintained by audiovisual archives is severely hindered by their generally limited access. The CHoral project, which is part of the CATCH program funded by the Dutch Research Council, aims to provide users of ...
Heeren, W. +4 more
core +1 more source
Reports related to community safety crisis incidents are being escalated and shared on social media and other online digital platforms. These reports must be addressed quickly to concerned organizations to provide welfare support to individuals and ...
Yeshanew Ale Wubet, Kuang-Yow Lian
doaj +1 more source
Sign language recognition (SLR) contains the capability to convert sign language gestures into spoken or written language. This technology is helpful for deaf persons or hard of hearing by providing them with a way to interact with people who do not know
Hadeel Alsolai +5 more
doaj +1 more source
Developing a negative speech emotion recognition model for safety systems using deep learning
Growing threats in public spaces have forced people to question personal security, making technology more relevant, especially in speech recognition. This paper proposes a security safety system by considering keyword and negative emotion detection to ...
Shreya Jena +6 more
doaj +1 more source
Spoken Term Detection Based on Improved Index Structure
The performance of keyword spotting system suffers severe degradation when the index stage is so fast that the lattice may lose lots of information to retrieve the spoken terms. In this paper, we focus on this problem and present two algorithm: the first one called unconstraint word graph expansion (UWGE) and the other called dynamic position specific ...
Zhen Zhang +4 more
openaire +1 more source
Constructing sub-word units for spoken term detection
Spoken term detection, especially of out-of-vocabulary (OOV) keywords, benefits from the use of sub-word systems. We experiment with different language-independent approaches to sub-word unit generation, generating both syllable-like and morpheme-like units, and demonstrate how the performance of syllable-like units can be improved by artificially ...
van Heerden, Charl +4 more
openaire +2 more sources
Spoken Term Detection of Zero-Resource Language Using Posteriorgram of Multiple Languages
Satoru Mizuochi +2 more
semanticscholar +1 more source

