Results 41 to 50 of about 865,410 (187)
AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline
An open-source Mandarin speech corpus called AISHELL-1 is released. It is by far the largest corpus which is suitable for conducting the speech recognition research and building speech recognition systems for Mandarin.
Bu, Hui +4 more
core +1 more source
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
We propose a novel approach to semi-supervised automatic speech recognition (ASR). We first exploit a large amount of unlabeled audio data via representation learning, where we reconstruct a temporal slice of filterbank features from past and future ...
Kirchhoff, Katrin +3 more
core +1 more source
Multifunction audio digitizer for communications systems [PDF]
Digitizer accomplishes both N bit pulse code modulation /PCM/ and delta modulation, and provides modulation indicating variable signal gain and variable sidetone.
Monford, L. G., Jr.
core +1 more source
Universal Adversarial Perturbations for Speech Recognition Systems [PDF]
In this work, we demonstrate the existence of universal adversarial audio perturbations that cause mis-transcription of audio signals by automatic speech recognition (ASR) systems. We propose an algorithm to find a single quasi-imperceptible perturbation,
Dubnov, Shlomo +5 more
core +1 more source
Anti-social behavior detection in audio-visual surveillance systems [PDF]
In this paper we propose a general purpose framework for detection of unusual events. The proposed system is based on the unsupervised method for unusual scene detection in web{cam images that was introduced in [1].
Kelly, Philip +4 more
core +1 more source
This study provides an update on an earlier study in the “Capturing Talk” research topic, which aimed to demonstrate how automatic speech recognition (ASR) systems work with indistinct forensic-like audio, in comparison with good-quality audio.
Debbie Loakes
doaj +1 more source
Exploring the technological dimension of Autonomous sensory meridian response-induced physiological responses [PDF]
Background In recent years, the scientific community has been captivated by the intriguing Autonomous sensory meridian response (ASMR), a unique phenomenon characterized by tingling sensations originating from the scalp and propagating down the spine ...
Sahar Seifzadeh, Bozena Kostek
doaj +2 more sources
TUGS: I feel what you see [PDF]
This article identifies how navigation aids can assist a wide range of visually impaired individuals, particularly focussing on the currently available GPS (Global Positioning Satellite) linked mobile technology systems.
Gustafson-Pearce, O
core
Chaos-based audio encryption: Efficacy of 2D and 3D hyperchaotic systems
Secure communication in the digital age is necessary; securing audio data becomes very critical since this is normally transmitted across susceptible networks.
Thejas Haridas +4 more
doaj +1 more source
Local Control of Audio Environment: A Review of Methods and Applications
The concept of a local audio environment is to have sound playback locally restricted such that, ideally, adjacent regions of an indoor or outdoor space could exhibit their own individual audio content without interfering with each other.
Jussi Kuutti +2 more
doaj +1 more source

