Results 11 to 20 of about 288,136 (202)
Lattice Indexing for Spoken Term Detection
This paper considers the problem of constructing an efficient inverted index for the spoken term detection (STD) task. More specifically, we construct a deterministic weighted finite-state transducer storing soft-hits in the form of (utterance ID, start time, end time, posterior score) quadruplets.
Can, Dogan, Saraclar, Murat
openaire +4 more sources
Handling overlaps in spoken term detection [PDF]
Spoken term detection (STD) systems usually arrive at many overlapping detections which are often addressed with some pragmatic approaches, e.g. choosing the best detection to represent all the overlaps. In this paper we present a theoretical study based on a concept of acceptance space.
Wang, Dong +3 more
+6 more sources
The large amount of information stored in audio and video repositories makes search on speech (SoS) a challenging area that is continuously receiving much interest.
Javier Tejedor +4 more
doaj +1 more source
Query-by-Example with Acoustic Word Embeddings Using wav2vec Pretraining [PDF]
Query-by-Example is a popular keyword detection method in the absence of speech resources.It can build a keyword query system with excellent performance when there are few labeled voice resources and a lack of pronunciation dictionaries.In recent years ...
LI Zhao-qi, LI Ta
doaj +1 more source
Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer [PDF]
In recent years, the standard hybrid DNN-HMM speech recognizers are outperformed by the end-to-end speech recognition systems. One of the very promising approaches is the grapheme Wav2Vec 2.0 model, which uses the self-supervised pretraining approach ...
J. Svec, Jan Lehečka, L. Šmídl
semanticscholar +1 more source
Comparative analysis of machine learning methods to detect fake news in an Urdu language corpus [PDF]
Wide availability and large use of social media enable easy and rapid dissemination of news. The extensive spread of engineered news with intentionally false information has been observed over the past few years.
Adnan Rafique +5 more
doaj +2 more sources
Transformer-based encoder-encoder architecture for Spoken Term Detection [PDF]
The paper presents a method for spoken term detection based on the Transformer architecture. We propose the encoder-encoder architecture employing two BERT-like encoders with additional modifications, including convolutional and upsampling layers ...
J. Svec, L. Šmídl, Jan Lehečka
semanticscholar +1 more source
Amharic Speech Search Using Text Word Query Based on Automatic Sentence-like Segmentation
More than 7000 languages are spoken in the world today. Amharic is one of the languages spoken in the East African country Ethiopia. A lot of speech data is being made every day in different languages as machines are getting better at processing and have
Getnet Mezgebu Brhanemeskel +3 more
doaj +1 more source
Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings [PDF]
The paper describes a novel approach to Spoken Term Detection (STD) in large spoken archives using deep LSTM networks. The work is based on the previous approach of using Siamese neural networks for STD and naturally extends it to directly localize a ...
J. Svec +3 more
semanticscholar +1 more source
CNN-Based Spoken Term Detection and Localization without Dynamic Programming [PDF]
In this paper, we propose a spoken term detection algorithm for simultaneous prediction and localization of in-vocabulary and out-of-vocabulary terms within an audio segment.
T. Fuchs, Yael Segal, Joseph Keshet
semanticscholar +1 more source

