Spoken term detection - Open Access .click

Results 11 to 20 of about 288,136 (202)

Lattice Indexing for Spoken Term Detection

IEEE Transactions on Audio, Speech, and Language Processing, 2011
This paper considers the problem of constructing an efficient inverted index for the spoken term detection (STD) task. More specifically, we construct a deterministic weighted finite-state transducer storing soft-hits in the form of (utterance ID, start time, end time, posterior score) quadruplets.
Can, Dogan, Saraclar, Murat
openaire +4 more sources

Handling overlaps in spoken term detection [PDF]

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011
Spoken term detection (STD) systems usually arrive at many overlapping detections which are often addressed with some pragmatic approaches, e.g. choosing the best detection to represent all the overlaps. In this paper we present a theoretical study based on a concept of acceptance space.
Wang, Dong +3 more
+6 more sources

The Multi-Domain International Search on Speech 2020 ALBAYZIN Evaluation: Overview, Systems, Results, Discussion and Post-Evaluation Analyses

Applied Sciences, 2021
The large amount of information stored in audio and video repositories makes search on speech (SoS) a challenging area that is continuously receiving much interest.
Javier Tejedor +4 more
doaj +1 more source

Query-by-Example with Acoustic Word Embeddings Using wav2vec Pretraining [PDF]

Jisuanji kexue, 2022
Query-by-Example is a popular keyword detection method in the absence of speech resources.It can build a keyword query system with excellent performance when there are few labeled voice resources and a lack of pronunciation dictionaries.In recent years ...
LI Zhao-qi, LI Ta
doaj +1 more source

Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer [PDF]

Interspeech, 2022
In recent years, the standard hybrid DNN-HMM speech recognizers are outperformed by the end-to-end speech recognition systems. One of the very promising approaches is the grapheme Wav2Vec 2.0 model, which uses the self-supervised pretraining approach ...
J. Svec, Jan Lehečka, L. Šmídl
semanticscholar +1 more source

Comparative analysis of machine learning methods to detect fake news in an Urdu language corpus [PDF]

PeerJ Computer Science, 2022
Wide availability and large use of social media enable easy and rapid dissemination of news. The extensive spread of engineered news with intentionally false information has been observed over the past few years.
Adnan Rafique +5 more
doaj +2 more sources

Transformer-based encoder-encoder architecture for Spoken Term Detection [PDF]

Asian Conference on Pattern Recognition, 2022
The paper presents a method for spoken term detection based on the Transformer architecture. We propose the encoder-encoder architecture employing two BERT-like encoders with additional modifications, including convolutional and upsampling layers ...
J. Svec, L. Šmídl, Jan Lehečka
semanticscholar +1 more source

Amharic Speech Search Using Text Word Query Based on Automatic Sentence-like Segmentation

Applied Sciences, 2022
More than 7000 languages are spoken in the world today. Amharic is one of the languages spoken in the East African country Ethiopia. A lot of speech data is being made every day in different languages as machines are getting better at processing and have
Getnet Mezgebu Brhanemeskel +3 more
doaj +1 more source

Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings [PDF]

Interspeech, 2021
The paper describes a novel approach to Spoken Term Detection (STD) in large spoken archives using deep LSTM networks. The work is based on the previous approach of using Siamese neural networks for STD and naturally extends it to directly localize a ...
J. Svec, L. Šmídl, J. Psutka, A. Pražák +3 more
semanticscholar +1 more source

CNN-Based Spoken Term Detection and Localization without Dynamic Programming [PDF]

IEEE International Conference on Acoustics, Speech, and Signal Processing, 2021
In this paper, we propose a spoken term detection algorithm for simultaneous prediction and localization of in-vocabulary and out-of-vocabulary terms within an audio segment.
T. Fuchs, Yael Segal, Joseph Keshet
semanticscholar +1 more source

computer science
linguistics
engineering

deep learning
international evaluation
query-by-example spoken term detection

search on speech
machine learning
fos: computer and information sciences