Results 41 to 50 of about 865,410 (187)

AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline

open access: yes, 2017
An open-source Mandarin speech corpus called AISHELL-1 is released. It is by far the largest corpus which is suitable for conducting the speech recognition research and building speech recognition systems for Mandarin.
Bu, Hui   +4 more
core   +1 more source

Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition

open access: yes, 2020
We propose a novel approach to semi-supervised automatic speech recognition (ASR). We first exploit a large amount of unlabeled audio data via representation learning, where we reconstruct a temporal slice of filterbank features from past and future ...
Kirchhoff, Katrin   +3 more
core   +1 more source

Multifunction audio digitizer for communications systems [PDF]

open access: yes, 1971
Digitizer accomplishes both N bit pulse code modulation /PCM/ and delta modulation, and provides modulation indicating variable signal gain and variable sidetone.
Monford, L. G., Jr.
core   +1 more source

Universal Adversarial Perturbations for Speech Recognition Systems [PDF]

open access: yes, 2019
In this work, we demonstrate the existence of universal adversarial audio perturbations that cause mis-transcription of audio signals by automatic speech recognition (ASR) systems. We propose an algorithm to find a single quasi-imperceptible perturbation,
Dubnov, Shlomo   +5 more
core   +1 more source

Anti-social behavior detection in audio-visual surveillance systems [PDF]

open access: yes, 2009
In this paper we propose a general purpose framework for detection of unusual events. The proposed system is based on the unsupervised method for unusual scene detection in web{cam images that was introduced in [1].
Kelly, Philip   +4 more
core   +1 more source

Automatic speech recognition and the transcription of indistinct forensic audio: how do the new generation of systems fare?

open access: yesFrontiers in Communication
This study provides an update on an earlier study in the “Capturing Talk” research topic, which aimed to demonstrate how automatic speech recognition (ASR) systems work with indistinct forensic-like audio, in comparison with good-quality audio.
Debbie Loakes
doaj   +1 more source

Exploring the technological dimension of Autonomous sensory meridian response-induced physiological responses [PDF]

open access: yesPeerJ
Background In recent years, the scientific community has been captivated by the intriguing Autonomous sensory meridian response (ASMR), a unique phenomenon characterized by tingling sensations originating from the scalp and propagating down the spine ...
Sahar Seifzadeh, Bozena Kostek
doaj   +2 more sources

TUGS: I feel what you see [PDF]

open access: yes, 2005
This article identifies how navigation aids can assist a wide range of visually impaired individuals, particularly focussing on the currently available GPS (Global Positioning Satellite) linked mobile technology systems.
Gustafson-Pearce, O
core  

Chaos-based audio encryption: Efficacy of 2D and 3D hyperchaotic systems

open access: yesFranklin Open
Secure communication in the digital age is necessary; securing audio data becomes very critical since this is normally transmitted across susceptible networks.
Thejas Haridas   +4 more
doaj   +1 more source

Local Control of Audio Environment: A Review of Methods and Applications

open access: yesTechnologies, 2014
The concept of a local audio environment is to have sound playback locally restricted such that, ideally, adjacent regions of an indoor or outdoor space could exhibit their own individual audio content without interfering with each other.
Jussi Kuutti   +2 more
doaj   +1 more source

Home - About - Disclaimer - Privacy