Results 41 to 50 of about 288,136 (202)
Spoken content retrieval: A survey of techniques and technologies [PDF]
Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings.
Ani Nenkova +3 more
core +3 more sources
Emotion recognition of human speech using deep learning method and MFCC features
Subject matter: Speech emotion recognition (SER) is an ongoing interesting research topic. Its purpose is to establish interactions between humans and computers through speech and emotion.
Sumon Kumar Hazra +4 more
doaj +1 more source
Spoken Term Detection Techniques
The technology of audio search has now improved to search and retrieve any unspecified spoken word from an audio database with reasonable accuracy. This is termed as Spoken Term Detection (STD). STD can be broadly classified into Text-based STD and Query by Example STD (QbE-STD).
Leena Mary, Deekshitha G
openaire +2 more sources
Thai Named Entity Recognition Using BiLSTM-CNN-CRF Enhanced by TCC
The languages spoken in Asia share common morphological analysis errors in word segmentation which normally propagate to higher-level processing, i.e., part-of-speech (POS) tagging, syntactic parsing, word extraction, and named entity recognition (NER ...
Virach Sornlertlamvanich +1 more
doaj +1 more source
A summary of the 2012 JHU CLSP Workshop on Zero Resource Speech Technologies and Models of Early Language Acquisition [PDF]
We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding zero resource (unsupervised) speech technologies and related models of early language acquisition.
Bennett, Erin +26 more
core +1 more source
Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context
Acoustic word embeddings (AWEs) have been popular in low-resource query-by-example speech search. They are using vector distances to find the spoken query in search content, which has much lower computation than the conventional dynamic time warping (DTW)
Yougen Yuan +4 more
doaj +1 more source
This paper presents a new approach for unsupervised Spoken Term Detection with spoken queries using multiple sets of acoustic patterns automatically discovered from the target corpus.
Chan, Chun-an +2 more
core +1 more source
Written Term Detection Improves Spoken Term Detection
End-to-end (E2E) approaches to keyword search (KWS) are considerably simpler in terms of training and indexing complexity when compared to approaches which use the output of automatic speech recognition (ASR) systems. This simplification however has drawbacks due to the loss of modularity.
Bolaji Yusuf, Murat Saraçlar
openaire +2 more sources
Searching Spontaneous Conversational Speech [PDF]
The ACM SIGIR Workshop on Searching Spontaneous Conversational Speech was held as part of the 2007 ACM SIGIR Conference in Amsterdam.\ud The workshop program was a mix of elements, including a keynote speech, paper presentations and panel discussions ...
Jong, Franciska de +3 more
core +2 more sources
Neural Network Based End-to-End Query by Example Spoken Term Detection [PDF]
This article focuses on the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time warping (DTW) based template matching techniques using phone posterior or ...
Dhananjay Ram +2 more
semanticscholar +1 more source

