Results 31 to 40 of about 313,816 (278)
In this paper, feature extraction methods are developed based on the non-negative matrix factorization (NMF) algorithm to be applied in weakly supervised sound event detection. Recently, the development of various features and systems have been attempted
Seokjin Lee +4 more
doaj +1 more source
MTF-CRNN: Multiscale Time-Frequency Convolutional Recurrent Neural Network for Sound Event Detection
To reduce neural network parameter counts and improve sound event detection performance, we propose a multiscale time-frequency convolutional recurrent neural network (MTF-CRNN) for sound event detection.
Keming Zhang +4 more
doaj +1 more source
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection [PDF]
Sound events often occur in unstructured environments where they exhibit wide variations in their frequency content and temporal structure. Convolutional neural networks (CNN) are able to extract higher level features that are invariant to local spectral
Heittola, Toni +4 more
core +2 more sources
Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network [PDF]
This paper proposes to use low-level spatial features extracted from multichannel audio for sound event detection. We extend the convolutional recurrent neural network to handle more than one type of these multichannel features by learning from each of ...
Adavanne, Sharath +2 more
core +2 more sources
A Comprehensive Review of Polyphonic Sound Event Detection
One of the most amazing functions of the human auditory system is the ability to detect all kinds of sound events in the environment. With the technologies and hardware advances, polyphonic Sound Event Detection (SED) can be developed to mimic the ...
T. K. Chan, Cheng Siong Chin
doaj +1 more source
Peer Collaborative Learning for Polyphonic Sound Event Detection
This paper describes that semi-supervised learning called peer collaborative learning (PCL) can be applied to the polyphonic sound event detection (PSED) task, which is one of the tasks in the Detection and Classification of Acoustic Scenes and Events (DCASE) challenge.
Helen Bear +2 more
openaire +2 more sources
Large-scale weakly supervised audio classification using gated convolutional neural network [PDF]
In this paper, we present a gated convolutional neural network and a temporal attention-based localization method for audio classification, which won the 1st place in the large-scale weakly supervised sound event detection task of Detection and ...
Kong, Qiuqiang +3 more
core +2 more sources
Seed: Sound Event Early Detection Via Evidential Uncertainty
Sound Event Early Detection (SEED) is an essential task in recognizing the acoustic environments and soundscapes. However, most of the existing methods focus on the offline sound event detection, which suffers from the over-confidence issue of early-stage event detection and usually yield unreliable results.
Zhao, Xujiang +6 more
openaire +2 more sources
Audio Captioning Using Sound Event Detection
This technical report proposes an audio captioning system for DCASE 2021 Task 6 audio captioning challenge. Our proposed model is based on an encoder-decoder architecture with bi-directional Gated Recurrent Units (BiGRU) using pretrained audio features and sound event detection.
Eren, Ayşegül Özkaya, Sert, Mustafa
openaire +3 more sources

