A Feature Learning Siamese Model for Intelligent Control of the Dynamic Range Compressor
In this paper, a siamese DNN model is proposed to learn the characteristics of the audio dynamic range compressor (DRC). This facilitates an intelligent control system that uses audio examples to configure the DRC, a widely used non-linear audio signal ...
Fazekas, György, Sheng, Di
core +1 more source
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging [PDF]
Environmental audio tagging aims to predict only the presence or absence of certain acoustic events in the interested acoustic scene. In this paper we make contributions to audio tagging in two parts, respectively, acoustic modeling and feature learning.
Foster, Peter +6 more
core +3 more sources
Eventness: Object Detection on Spectrograms for Temporal Localization of Audio Events
In this paper, we introduce the concept of Eventness for audio event detection, which can, in part, be thought of as an analogue to Objectness from computer vision.
Das, Samarjit +3 more
core +1 more source
Urban environments are characterized by a complex interplay of various sound sources, which significantly influence the overall soundscape quality. This study presents a methodology that combines the intermittency ratio (IR) metric for acoustic event ...
Vidaña-Vila Ester +2 more
doaj +1 more source
Learning Audio Sequence Representations for Acoustic Event Classification
Acoustic Event Classification (AEC) has become a significant task for machines to perceive the surrounding auditory scene. However, extracting effective representations that capture the underlying characteristics of the acoustic events is still ...
Han, Jing +4 more
core +1 more source
Cutting tracks, making CDs: a comparative study of audio time-correction techniques in the desktop age. [PDF]
Producers have long sought to “tighten” studio performances. Software-based DAW’s now come with proprietary functions to facilitate this, but only the latest generation of platforms allow relative ease of use on longer takes.
Paterson, Justin
core +1 more source
Drum Transcription via Classification of Bar-level Rhythmic Patterns [PDF]
acceptedMatthias Mauch is supported by a Royal Academy of Engineering Research ...
15th International Society for Music Information Retrieval Conference +3 more
core
Deep Learning for Audio Signal Processing
Given the recent surge in developments of deep learning, this article provides a review of the state-of-the-art deep learning techniques for audio signal processing.
Chang, Shuo-yiin +5 more
core +1 more source
DCASE 2018 Challenge Surrey Cross-Task convolutional neural network baseline [PDF]
The Detection and Classification of Acoustic Scenes and Events (DCASE) consists of five audio classification and sound event detection tasks: 1) Acoustic scene classification, 2) General-purpose audio tagging of Freesound, 3) Bird audio detection, 4 ...
Iqbal, Turab +4 more
core +1 more source
USM-SED - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios
This paper introduces a novel dataset for polyphonic sound event detection in urban sound monitoring use-cases. Based on isolated sounds taken from the FSD50k dataset, 20,000 polyphonic soundscapes are synthesized with sounds being randomly positioned in the stereo panorama using different loudness levels.
openaire +2 more sources

