Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset [PDF]
This paper presents the Synthetic Polyphonic Ambient Sound Source (SPASS) dataset, a publicly available synthetic polyphonic audio dataset. SPASS was designed to train deep neural networks effectively for polyphonic sound event detection (PSED) in urban ...
Rhoddy Viveros-Muñoz +8 more
doaj +4 more sources
Polyphonic Sound Event Detection Using Temporal-Frequency Attention and Feature Space Attention [PDF]
The complexity of polyphonic sounds imposes numerous challenges on their classification. Especially in real life, polyphonic sound events have discontinuity and unstable time-frequency variations.
Ye Jin +4 more
doaj +4 more sources
Metrics for Polyphonic Sound Event Detection [PDF]
This paper presents and discusses various metrics proposed for evaluation of polyphonic sound event detection systems used in realistic situations where there are typically multiple sound sources active simultaneously.
Annamaria Mesaros +2 more
doaj +4 more sources
A Comprehensive Review of Polyphonic Sound Event Detection [PDF]
One of the most amazing functions of the human auditory system is the ability to detect all kinds of sound events in the environment. With the technologies and hardware advances, polyphonic Sound Event Detection (SED) can be developed to mimic the ...
T. K. Chan, Cheng Siong Chin
doaj +3 more sources
Polyphonic Sound Event Detection by using Capsule Neural Networks [PDF]
Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, Deep Learning offers valuable techniques for this goal such as Convolutional Neural Networks (CNNs).
Gabrielli, Leonardo +3 more
core +2 more sources
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection [PDF]
Sound events often occur in unstructured environments where they exhibit wide variations in their frequency content and temporal structure. Convolutional neural networks (CNN) are able to extract higher level features that are invariant to local spectral
Heittola, Toni +4 more
core +6 more sources
Analysis and interpretation of joint source separation and sound event detection in domestic environments. [PDF]
In recent years, the relation between Sound Event Detection (SED) and Source Separation (SSep) has received a growing interest, in particular, with the aim to enhance the performance of SED by leveraging the synergies between both tasks.
Diego de Benito-Gorrón +2 more
doaj +2 more sources
Peer Collaborative Learning for Polyphonic Sound Event Detection
This paper describes that semi-supervised learning called peer collaborative learning (PCL) can be applied to the polyphonic sound event detection (PSED) task, which is one of the tasks in the Detection and Classification of Acoustic Scenes and Events (DCASE) challenge.
Helen Bear +2 more
openaire +2 more sources
A System for the Detection of Polyphonic Sound on a University Campus Based on CapsNet-RNN
In recent decades, surveillance and home security systems based on video analysis have been proposed for the automatic detection of abnormal situations.
Liyan Luo +6 more
doaj +1 more source
Improved capsule routing for weakly labeled sound event detection
Polyphonic sound event detection aims to detect the types of sound events that occur in given audio clips, and their onset and offset times, in which multiple sound events may occur simultaneously. Deep learning–based methods such as convolutional neural
Haitao Li, Shuguo Yang, Wenwu Wang
doaj +1 more source

