Results 21 to 30 of about 19,864 (256)
Phonological Posterior Hashing for Query by Example Spoken Term Detection [PDF]
Afsaneh Asaei+2 more
openalex +3 more sources
Query-by-Example with Acoustic Word Embeddings Using wav2vec Pretraining [PDF]
Query-by-Example is a popular keyword detection method in the absence of speech resources.It can build a keyword query system with excellent performance when there are few labeled voice resources and a lack of pronunciation dictionaries.In recent years ...
LI Zhao-qi, LI Ta
doaj +1 more source
The large amount of information stored in audio and video repositories makes search on speech (SoS) a challenging area that is continuously receiving much interest.
Javier Tejedor+4 more
doaj +1 more source
COMBINING TEMPORAL AND SPECTRAL INFORMATION FOR QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
Publication in the conference proceedings of EUSIPCO, Lisbon, Portugal ...
Ciro Gràcia+2 more
openalex +3 more sources
Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages [PDF]
Pre-trained speech representations like wav2vec 2.0 are a powerful tool for automatic speech recognition (ASR). Yet many endangered languages lack sufficient data for pre-training such models, or are predominantly oral vernaculars without a standardised ...
Bartelds, Martijn+11 more
core +1 more source
Learning Acoustic Word Embeddings With Dynamic Time Warping Triplet Networks
In the last years, acoustic word embeddings (AWEs) have gained significant interest in the research community. It applies specifically to the application of acoustic embeddings in the Query-by-Example Spoken Term Detection (QbE-STD) search and related ...
Denis Shitov+3 more
doaj +1 more source
Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context
Acoustic word embeddings (AWEs) have been popular in low-resource query-by-example speech search. They are using vector distances to find the spoken query in search content, which has much lower computation than the conventional dynamic time warping (DTW)
Yougen Yuan+4 more
doaj +1 more source
Multimodal Locomotion: Next Generation Aerial–Terrestrial Mobile Robotics
Aerial–terrestrial robots can achieve efficient energy consumption and robust environmental interaction by adding morphological features, adapting forms for locomotion transitions, and integrating multiple platforms. This next generation of mobile robots advances real‐world robotic deployment for operations with complex tasks and tackle environments ...
Jane Pauline Ramirez, Salua Hamaza
wiley +1 more source
Spoken content retrieval: A survey of techniques and technologies [PDF]
Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings.
Ani Nenkova+3 more
core +3 more sources
Access to recorded interviews: A research agenda [PDF]
Recorded interviews form a rich basis for scholarly inquiry. Examples include oral histories, community memory projects, and interviews conducted for broadcast media.
Heeren, W.F.L.+3 more
core +3 more sources