Results 41 to 50 of about 21,687 (132)

H-QuEST: Accelerating Query-by-Example Spoken Term Detection with Hierarchical Indexing

open access: yesInterspeech 2025
Query-by-example spoken term detection (QbE-STD) searches for matching words or phrases in an audio dataset using a sample spoken query. When annotated data is limited or unavailable, QbE-STD is often done using template matching methods like dynamic time warping (DTW), which are computationally expensive and do not scale well.
Singh, Akanksha   +2 more
openaire   +2 more sources

Cross-Lingual Query-by-Example Spoken Term Detection: A Transformer-Based Approach

open access: yes
Query-by-example spoken term detection (QbE-STD) is typically constrained by transcribed data scarcity and language specificity. This paper introduces a novel, language-agnostic QbE-STD model leveraging image processing techniques and transformer architecture.
Fatemeh, Allahdadi   +2 more
openaire   +2 more sources

Providers of relief in distress: RAG-based LLMs as situation and intent-aware assistants. [PDF]

open access: yesFront Artif Intell
Nazar AM   +8 more
europepmc   +1 more source

AI-driven audio-to-video generation for dynamic content creation via stable diffusion and CNN-augmented transformers. [PDF]

open access: yesSci Rep
Dharrao D   +6 more
europepmc   +1 more source

Multimodal Cognitive Architecture with Local Generative AI for Industrial Control of Concrete Plants on Edge Devices. [PDF]

open access: yesSensors (Basel)
Hidalgo-Castelo F   +4 more
europepmc   +1 more source

Home - About - Disclaimer - Privacy