Unsupervised decoding of long-term, naturalistic human neural recordings with automated video and audio annotations [PDF]
Fully automated decoding of human activities and intentions from direct neural recordings is a tantalizing challenge in brain-computer interfacing.
Brunton, Bingni W. +4 more
core +3 more sources
New Grapheme Generation Rules for Two-Stage Modelbased Grapheme-to-Phoneme Conversion
The precise conversion of arbitrary text into its corresponding phoneme sequence (grapheme-to-phoneme or G2P conversion) is implemented in speech synthesis and recognition, pronunciation learning software, spoken term detection and spoken document ...
Seng Kheang +3 more
doaj +1 more source
Multilingual Bottleneck Features for Query by Example Spoken Term Detection [PDF]
State of the art solutions to query by example spoken term detection (QbE-STD) rely on bottleneck feature representation of the query and audio document.
Dhananjay Ram +2 more
semanticscholar +1 more source
Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations [PDF]
Query-by-example spoken term detection (QbE STD) aims at retrieving data from a speech repository given an acoustic query containing the term of interest as input. Nowadays, it is receiving much interest due to the large volume of multimedia information.
Docío-Fernández, Laura +4 more
core +3 more sources
Score distribution based term specific thresholding for spoken term detection [PDF]
The spoken term detection (STD) task aims to return relevant segments from a spoken archive that contain the query terms. This paper focuses on the decision stage of an STD system. We propose a term specific thresholding (TST) method that uses per query posterior score distributions.
Doǧan Can, Murat Saraçlar
openaire +1 more source
Learning Acoustic Word Embeddings With Dynamic Time Warping Triplet Networks
In the last years, acoustic word embeddings (AWEs) have gained significant interest in the research community. It applies specifically to the application of acoustic embeddings in the Query-by-Example Spoken Term Detection (QbE-STD) search and related ...
Denis Shitov +3 more
doaj +1 more source
Overview of the NTCIR-11 SpokenQuery&Doc task [PDF]
This paper presents an overview of the Spoken Query and Spoken Document retrieval (SpokenQuery&Doc) task at the NTCIR-11Workshop. This task included spoken query driven spoken content retrieval (SQ-SCR) as the main sub-task.
Akiba, Tomoyosi +3 more
core
Incorporating visual information for spoken term detection [PDF]
Spoken term detection (STD) is the task of looking up a spoken term in a large volume of speech segments. In order to provide fast search, speech segments are first indexed into an intermediate representation using speech recognition engines which provide multiple hypotheses for each speech segment.
Kalantari, Shahram +2 more
openaire +1 more source
Query-by-example spoken term detection on multilingual unconstrained speech [PDF]
As part of the MediaEval 2013 benchmark evaluation campaign, the objective of the Spoken Web Search (SWS) task was to perform Query-by-Example Spoken Term Detection (QbESTD) using audio queries in a low-resource setting. After two successful editions and a continuously growing interest in the scientific community, a special effort was made in SWS 2013 ...
Anguera, Xavier +5 more
openaire +2 more sources
A Review on Language-Independent Search on Speech and its Applications
A thorough analysis of language-independent search methods and models for speech detection, a crucial task in retrieving audio file from large archives based on spoken queries was presented in this study.
Sushil Venkatesh Kulkarni, Sukomal Pal
doaj +1 more source

