Results 41 to 50 of about 288,136 (202)

Spoken content retrieval: A survey of techniques and technologies [PDF]

open access: yes, 2012
Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings.
Ani Nenkova   +3 more
core   +3 more sources

Emotion recognition of human speech using deep learning method and MFCC features

open access: yesРадіоелектронні і комп'ютерні системи, 2022
Subject matter: Speech emotion recognition (SER) is an ongoing interesting research topic. Its purpose is to establish interactions between humans and computers through speech and emotion.
Sumon Kumar Hazra   +4 more
doaj   +1 more source

Spoken Term Detection Techniques

open access: yes, 2018
The technology of audio search has now improved to search and retrieve any unspecified spoken word from an audio database with reasonable accuracy. This is termed as Spoken Term Detection (STD). STD can be broadly classified into Text-based STD and Query by Example STD (QbE-STD).
Leena Mary, Deekshitha G
openaire   +2 more sources

Thai Named Entity Recognition Using BiLSTM-CNN-CRF Enhanced by TCC

open access: yesIEEE Access, 2022
The languages spoken in Asia share common morphological analysis errors in word segmentation which normally propagate to higher-level processing, i.e., part-of-speech (POS) tagging, syntactic parsing, word extraction, and named entity recognition (NER ...
Virach Sornlertlamvanich   +1 more
doaj   +1 more source

A summary of the 2012 JHU CLSP Workshop on Zero Resource Speech Technologies and Models of Early Language Acquisition [PDF]

open access: yes, 2013
We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding zero resource (unsupervised) speech technologies and related models of early language acquisition.
Bennett, Erin   +26 more
core   +1 more source

Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context

open access: yesIEEE Access, 2019
Acoustic word embeddings (AWEs) have been popular in low-resource query-by-example speech search. They are using vector distances to find the spoken query in search content, which has much lower computation than the conventional dynamic time warping (DTW)
Yougen Yuan   +4 more
doaj   +1 more source

Unsupervised Spoken Term Detection with Spoken Queries by Multi-level Acoustic Patterns with Varying Model Granularity

open access: yes, 2015
This paper presents a new approach for unsupervised Spoken Term Detection with spoken queries using multiple sets of acoustic patterns automatically discovered from the target corpus.
Chan, Chun-an   +2 more
core   +1 more source

Written Term Detection Improves Spoken Term Detection

open access: yesIEEE/ACM Transactions on Audio, Speech, and Language Processing
End-to-end (E2E) approaches to keyword search (KWS) are considerably simpler in terms of training and indexing complexity when compared to approaches which use the output of automatic speech recognition (ASR) systems. This simplification however has drawbacks due to the loss of modularity.
Bolaji Yusuf, Murat Saraçlar
openaire   +2 more sources

Searching Spontaneous Conversational Speech [PDF]

open access: yes, 2007
The ACM SIGIR Workshop on Searching Spontaneous Conversational Speech was held as part of the 2007 ACM SIGIR Conference in Amsterdam.\ud The workshop program was a mix of elements, including a keynote speech, paper presentations and panel discussions ...
Jong, Franciska de   +3 more
core   +2 more sources

Neural Network Based End-to-End Query by Example Spoken Term Detection [PDF]

open access: yesIEEE/ACM Transactions on Audio Speech and Language Processing, 2019
This article focuses on the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time warping (DTW) based template matching techniques using phone posterior or ...
Dhananjay Ram   +2 more
semanticscholar   +1 more source

Home - About - Disclaimer - Privacy