Results 1 to 10 of about 288,136 (202)

Semantically Expanded Spoken Term Detection

open access: yesIEEE Access
Spoken term detection (STD) is effectively implemented using fundamental techniques such as automatic speech recognition (ASR) and information retrieval.
Zhanibek Kozhirbayev   +1 more
doaj   +3 more sources

Optimization of Spoken Term Detection System [PDF]

open access: yesJournal of Applied Mathematics, 2012
Generally speaking, spoken term detection system will degrade significantly because of mismatch between acoustic model and spontaneous speech. This paper presents an improved spoken term detection strategy, which integrated with a novel phoneme confusion
Chuanxu Wang, Pengyuan Zhang
doaj   +3 more sources

ALBAYZIN 2018 spoken term detection evaluation: a multi-domain international evaluation in Spanish [PDF]

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2019
Search on speech (SoS) is a challenging area due to the huge amount of information stored in audio and video repositories. Spoken term detection (STD) is an SoS-related task aiming to retrieve data from a speech repository given a textual representation ...
Javier Tejedor   +7 more
doaj   +5 more sources

Whisper-based spoken term detection systems for search on speech ALBAYZIN evaluation challenge

open access: yesEURASIP Journal on Audio, Speech, and Music Processing
The vast amount of information stored in audio repositories makes necessary the development of efficient and automatic methods to search on audio content. In that direction, search on speech (SoS) has received much attention in the last decades.
Javier Tejedor, Doroteo T. Toledano
doaj   +2 more sources

ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation [PDF]

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2018
Query-by-example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given an acoustic (spoken) query containing the term of interest as the input.
Javier Tejedor   +9 more
doaj   +8 more sources

Biomimetic Computing for Efficient Spoken Language Identification [PDF]

open access: yesBiomimetics
Spoken Language Identification (SLID)-based applications have become increasingly important in everyday life, driven by advancements in artificial intelligence and machine learning.
Gaurav Kumar, Saurabh Bhardwaj
doaj   +2 more sources

Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation

open access: yesEURASIP Journal on Audio, Speech, and Music Processing, 2019
The huge amount of information stored in audio and video repositories makes search on speech (SoS) a priority area nowadays. Within SoS, Query-by-Example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given a spoken query.
Javier Tejedor   +6 more
doaj   +2 more sources

Exploring the Effectiveness of Feature Reduction and Kernel-Based Matching for Query-by- Example Spoken Term Detection Using CNN

open access: yesIEEE Access
Query-by-example spoken term detection (QbE-STD) refers to the search for an audio query in a repository of audio utterances. A common approach for QbE-STD involves computing a matching matrix between the feature representations of the query and the ...
Manisha Naik Gaonkar   +3 more
doaj   +2 more sources

The architecture of a system for full-text search by speech data based on a global search index [PDF]

open access: yesНаучно-технический вестник информационных технологий, механики и оптики, 2021
This paper presents the architecture of a system for full-text search by speech data based on a global search index that combines information about all speech recordings in the archive. The architecture includes two independent blocks: an indexing block,
Oleg E. Petrov
doaj   +1 more source

Detect Multi Spoken Languages Using Bidirectional Long Short-Term Memory [PDF]

open access: yesAl-Rafidain Journal of Computer Sciences and Mathematics, 2023
Many speaker language detection systems depend on deep learning (DL) approaches, and utilize long recorded audio periods to achieve satisfactory accuracy.
Fawziya Ramo, Mohammed Kannah
doaj   +1 more source

Home - About - Disclaimer - Privacy