Results 1 to 10 of about 777 (141)

ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation [PDF]

open access: goldEURASIP Journal on Audio, Speech, and Music Processing, 2018
Query-by-example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given an acoustic (spoken) query containing the term of interest as the input.
Javier Tejedor   +9 more
doaj   +10 more sources

Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation [PDF]

open access: goldEURASIP Journal on Audio, Speech, and Music Processing, 2019
The huge amount of information stored in audio and video repositories makes search on speech (SoS) a priority area nowadays. Within SoS, Query-by-Example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given a spoken query.
Javier Tejedor   +6 more
doaj   +7 more sources

Neural Network Based End-to-End Query by Example Spoken Term Detection [PDF]

open access: greenIEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020
This paper focuses on the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time warping (DTW) based template matching techniques using phone posterior or bottleneck features extracted from a deep neural network (DNN).
Dhananjay Ram   +2 more
  +6 more sources

Multilingual Bottleneck Features for Query by Example Spoken Term Detection [PDF]

open access: green2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2019
State of the art solutions to query by example spoken term detection (QbE-STD) usually rely on bottleneck feature representation of the query and audio document to perform dynamic time warping (DTW) based template matching. Here, we present a study on QbE-STD performance using several monolingual as well as multilingual bottleneck features extracted ...
Dhananjay Ram   +2 more
  +9 more sources

Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations [PDF]

open access: goldEURASIP Journal on Audio, Speech, and Music Processing, 2016
Xunta de Galicia | Ref.
Javier Tejedor   +4 more
openalex   +6 more sources

Query-by-example spoken term detection on multilingual unconstrained speech [PDF]

open access: greenInterspeech 2014, 2014
As part of the MediaEval 2013 benchmark evaluation campaign, the objective of the Spoken Web Search (SWS) task was to perform Query-by-Example Spoken Term Detection (QbESTD) using audio queries in a low-resource setting. After two successful editions and a continuously growing interest in the scientific community, a special effort was made in SWS 2013 ...
Xavier Anguera   +5 more
openalex   +5 more sources

Unsupervised acoustic sub-word unit detection for query-by-example spoken term detection [PDF]

open access: green2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011
In this paper we present a method for automatically generating acoustic sub-word units that can substitute conventional phone models in a query-by-example spoken term detection system. We generate the sub-word units with a modified version of our speaker diarization system.
Marijn Huijbregts   +2 more
openalex   +4 more sources

Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection [PDF]

open access: green2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2021
In this paper, we propose a deep convolutional neural network-based acoustic word embedding system on code-switching query by example spoken term detection. Different from previous configurations, we combine audio data in two languages for training instead of only using one single language.
Murong Ma   +5 more
openalex   +5 more sources

Query-by-example spoken term detection based on phonetic posteriorgram [PDF]

open access: diamondAdvances in Social Science, Education and Humanities Research, 2015
Spoken term detection in low-resource situations is a challenging problem, because traditional large vocabulary continuous speech recognition (LVCSR) approaches are often unusable. This paper introduces a method to use deep neural network (DNN) softmax outputs as input features in a query-by-example (QBE) spoken term detection (STD) system.
Beili Song   +4 more
openalex   +4 more sources

Query-by-Example Spoken Term Detection Using Attention-Based Multi-Hop Networks [PDF]

open access: green2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018
Retrieving spoken content with spoken queries, or query-by- example spoken term detection (STD), is attractive because it makes possible the matching of signals directly on the acoustic level without transcribing them into text. Here, we propose an end-to-end query-by-example STD model based on an attention-based multi-hop network, whose input is a ...
Chia-Wei Ao, Hung-yi Lee
openalex   +4 more sources

Home - About - Disclaimer - Privacy