Results 161 to 170 of about 288,136 (202)
Some of the next articles are maybe not open access.

Two-stage spoken term detection system for under-resourced languages

IET Signal Processing, 2020
: Spoken Term Detection (STD) is the process of locating the occurrences of spoken queries in a given speech database. Generally, two methods are adopted for STD: an ASR based sequence matching and ASR-free, feature-based template matching.
Deekshitha G, L. Mary
semanticscholar   +1 more source

Spoken Term Detection Using Visual Spectrogram Matching

2008 Tenth IEEE International Symposium on Multimedia, 2008
This work proposes a novel spoken term detection technique, where the query is in audio format. Detection and retrieval are performed by matching the spectrograms of the spoken document and query as visual images, using ideas from computer vision. Local descriptors are computed on a dense grid over each spectrogram, and the query term is detected using
Nevena Lazic, Parham Aarabi
openaire   +1 more source

Representation Learning for Spoken Term Detection

2016
Spoken Term Detection (STD), which refers to the task of searching for a user audio query in audio data is extremely significant for the management and monitoring of increasing volumes of audio data on the internet. It is affected by channel mismatch, speaker variability and differences in speaking mode/rate.
P. Raghavendra Reddy   +2 more
openaire   +2 more sources

Spoken Term Detection for Turkish Broadcast News

2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008
In this paper, we present a baseline spoken term detection (STD) system for Turkish broadcast news. The agglutinative structure of Turkish causes a high out-of-vocabulary (OOV) rate and increases word error rate (WER) in automatic speech recognition. Several approaches are attempted to reduce this negative effect on the STD system.
Siddika Parlak, Murat Saraclar
openaire   +1 more source

High performance Chinese Spoken Term Detection based on term expansion

2010 7th International Symposium on Chinese Spoken Language Processing, 2010
This paper mainly focuses on improving the performance of Chinese Spoken Term Detection (S TD) systems using words as searching units. These systems are designed to find instances of particular phrases (called Terms) in voices. Terms are usually segmented into word sequences and searched with voices' recognition results.
Wei Li, Ji Wu, Ping Lv
openaire   +1 more source

Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection

Computer Speech and Language, 2019
A speech spectrum is known to be changed by the variations in the length of the vocal tract of a speaker. This is because of the fact that speech formants are inversely related to the vocal tract length (VTL).
Maulik C. Madhavi, H. Patil
semanticscholar   +1 more source

Open vocabulary spoken content retrieval by front-ending with spoken term detection

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
How to deal with speech recognition errors and out-of-vocabulary (OOV) words are common challenging problems in spoken document processing. In this work, we propose the spoken content retrieval (SCR) method that incorporates spoken term detection (STD) as the pre-processing stage.
Tomoko Takigami, Tomoyosi Akiba
openaire   +1 more source

Query-by-Example Spoken Term Detection using Attentive Pooling Networks

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Query-by-example spoken term detection (QbE-STD) is attractive because its a key technology for retrieving and browsing spoken content without transcribing them into text. Several end-to-end models based on encoder architecture have been proposed for QbE-
Kun Zhang   +4 more
semanticscholar   +1 more source

Entropy-based false detection filtering in spoken term detection tasks

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
This paper describes spoken term detection (STD) and inexistent STD (iSTD) methods using term detection entropy based on a phoneme transition network (PTN)-formed index. Our previously reported STD method uses a PTN derived from multiple automatic speech recognizers (ASRs) as an index.
Satoshi Natori   +3 more
openaire   +1 more source

Using Monolingual Speech Recognition for Spoken Term Detection in Code-switched Hindi-English Speech

2019 International Conference on Data Mining Workshops (ICDMW), 2019
Code-switching is the alternation of two or more languages in a single utterance or a conversation and is prevalent in multilingual communities all over the world. Spoken Term Detection (STD) is the task of detecting a given word or phrase in audio.
Sanket Shah, Sunayana Sitaram
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy