ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation [PDF]
Query-by-example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given an acoustic (spoken) query containing the term of interest as the input.
Javier Tejedor+9 more
doaj +10 more sources
Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation [PDF]
The huge amount of information stored in audio and video repositories makes search on speech (SoS) a priority area nowadays. Within SoS, Query-by-Example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given a spoken query.
Javier Tejedor+6 more
doaj +7 more sources
Neural Network Based End-to-End Query by Example Spoken Term Detection [PDF]
This paper focuses on the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time warping (DTW) based template matching techniques using phone posterior or bottleneck features extracted from a deep neural network (DNN).
Dhananjay Ram+2 more
+6 more sources
Multilingual Bottleneck Features for Query by Example Spoken Term Detection [PDF]
State of the art solutions to query by example spoken term detection (QbE-STD) usually rely on bottleneck feature representation of the query and audio document to perform dynamic time warping (DTW) based template matching. Here, we present a study on QbE-STD performance using several monolingual as well as multilingual bottleneck features extracted ...
Dhananjay Ram+2 more
+9 more sources
Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations [PDF]
Xunta de Galicia | Ref.
Javier Tejedor+4 more
openalex +6 more sources
Query-by-example spoken term detection on multilingual unconstrained speech [PDF]
As part of the MediaEval 2013 benchmark evaluation campaign, the objective of the Spoken Web Search (SWS) task was to perform Query-by-Example Spoken Term Detection (QbESTD) using audio queries in a low-resource setting. After two successful editions and a continuously growing interest in the scientific community, a special effort was made in SWS 2013 ...
Xavier Anguera+5 more
openalex +5 more sources
Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection [PDF]
In this paper we present a method for automatically generating acoustic sub-word units that can substitute conventional phone models in a query-by-example spoken term detection system. We generate the sub-word units with a modified version of our speaker diarization system.
Marijn Huijbregts+2 more
openalex +4 more sources
Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection [PDF]
In this paper, we propose a deep convolutional neural network-based acoustic word embedding system on code-switching query by example spoken term detection. Different from previous configurations, we combine audio data in two languages for training instead of only using one single language.
Murong Ma+5 more
openalex +5 more sources
Query-by-example spoken term detection based on phonetic posteriorgram [PDF]
Spoken term detection in low-resource situations is a challenging problem, because traditional large vocabulary continuous speech recognition (LVCSR) approaches are often unusable. This paper introduces a method to use deep neural network (DNN) softmax outputs as input features in a query-by-example (QBE) spoken term detection (STD) system.
Beili Song+4 more
openalex +4 more sources
Query-by-Example Spoken Term Detection Using Attention-Based Multi-Hop Networks [PDF]
Retrieving spoken content with spoken queries, or query-by- example spoken term detection (STD), is attractive because it makes possible the matching of signals directly on the acoustic level without transcribing them into text. Here, we propose an end-to-end query-by-example STD model based on an attention-based multi-hop network, whose input is a ...
Chia-Wei Ao, Hung-yi Lee
openalex +4 more sources